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PREFACE 


The  word  mechatronics  was  first  coined  by  a  senior  engineer  of  a  Japanese  company;  Yaskawa,  in 
1969,  as  a  combination  of  "mecha"  of  mechanisms  and  "tronics"  of  electronics  and  the  company  was 
granted  the  trademark  rights  on  the  word  in  1971  [1-2].  The  word  soon  received  broad  acceptance  in 
industry  and,  in  order  to  allow  its  free  use,  Yaskawa  elected  to  abandon  its  rights  on  the  word  in  1982 
[3],  The  word  has  taken  a  wider  meaning  since  then  and  is  now  widely  being  used  as  a  technical  jargon 
to  describe  a  philosophy  in  engineering  technology,  more  than  the  technology  itself.  For  this  wider 
concept  of  mechatronics,  a  number  of  definitions  has  been  proposed  in  the  literature,  differing  in  the 
particular  characteristics  that  the  definition  is  intended  to  emphasize.  The  most  commonly  used  one 
emphasizes  synergy  and  is  as  follows:  Mechatronics  is  the  synergistic  integration  of  mechanical 
engineering  with  electronics  and  intelligent  computer  control  in  the  design  and  manufacture  of  products 
and  processes.  The  embedded  intelligence  may  vary  from  programmed  behaviour  to  self  organization 
and  learning. 

The  development  of  mechatronics  has  gone  through  three  stages.  The  first  stage  corresponds  to  the 
years  around  the  introduction  of  the  word.  During  this  stage,  technologies  used  in  mechatronic  systems 
developed  rather  independently  of  each  other  and  individually.  With  the  start  of  the  eighties,  a 
synergistic  integration  of  different  technologies  started  taking  place,  the  notable  example  being  in 
optoelectronics  (i.e.  an  integration  of  optics  and  electronics).  The  concept  of  hardware/software  co¬ 
design  also  started  in  these  years.  The  third  and  the  last  stage  starts  with  the  early  nineties.  The  most 
notable  aspect  of  this  stage  is  the  increased  use  computational  intelligence  in  mechatronic  products  and 
systems.  It  is  due  to  this  development  that  we  can  now  talk  about  Machine  Intelligence  Quotient 
(MIQ).  Another  important  development  in  the  third  stage  is  the  possibility  of  miniaturization  of  the 
components;  in  the  form  of  microactuators  and  microsensors  (i.e.  micromechatronics). 

The  field  of  mechatronics  is  now  widely  recognized  in  all  parts  of  world.  Various  undergraduate 
and  graduate  degree  programs  on  mechatronic  engineering  are  being  offered  at  different  universities. 
Journals  dedicated  to  the  field  of  mechatronics  are  being  published,  dedicated  conferences  are  being 
held.  One  such  conference  is  the  one  organized  in  Turkey  during  August  14-16,  1995,  with  the  title, 
"International  Conference  on  Recent  Advances  in  Mechatronics:  ICRAM'95,”  under  the  technical  co¬ 
operation  of  ASME  (American  Society  of  Mechanical  Engineers),  IEEE  (Institute  of  Electrical  and 
Electronics  Engineers)  Industrial  Electronics  Society,  IEEE  Robotics  and  Automation  Society,  IEEJ 
(Institute  of  Electrical  Engineers  of  Japan)  ,  IFAC  (International  Federation  of  Automatic  Control), 
IFToMM  (Int.  Fed.  for  the  Theory  of  Machines  and  Mechanisms),  JSME  (Japanese  Society  of 
Mechanical  Engineers),  RSJ  (Robotics  Society  of  Japan)  and  SICE  (Society  of  Instr.  and  Control 
Engineers  of  Japan).  The  conference  was  highly  successful,  it  had  more  than  200  participants  from  34 
different  countries.  Four  years  has  since  then  passed  and  in  order  to  discuss  the  most  recent  advances,  it 
has  been  decided  to  hold  another  similar  conference  during  24-26  May  1999,  again  in  Istanbul,  Turkey, 
under  the  title  2nd  International  Conference  on  Recent  Advances  in  Mechatronics:  ICRAM'99.  It  is 
organized  by  UNESCO  Chair  on  Mechatronics  and  Mechatronics  Research  and  Application  Center  of 
Bogazici  University,  Istanbul,  co-sponsored  by  IEEE  Industrial  Electronics  Society  and  IEEE  Robotics 
and  Automation  Society,  and  in  technical  co-operation  with  ASME  Dynamic  and  Control  Systems 
Division,  ASME  Design  Engineering  Division,  RSJ,  IEEJ ,  JSME  and  SICE. 

Two  kind  of  papers  have  been  solicited  for  this  conference,  “long  papers”  and  “regular  papers.”  The 
former  ones  have  been  collected  in  an  edited  book,  published  by  Springer- Verlag  (ISBN  981-4021-34- 
2),  under  the  name  Recent  Advances  in  Mechatronics  (Eds.  Okyay  Kaynak,  Sabri  Tosunoglu  and 
Marcelo  Ang  Jr.).  The  regular  papers  (90  of  them)  appear  in  this  volume. 

We  would  like  to  take  this  opportunity  to  thank  all  the  authors  for  their  valuable  contributions.  We 
are  confident  that  the  readers  will  find  the  contents  of  the  proceedings  interesting  and  beneficial. 
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ICRAM’99  is  being  organized  by  UNESCO  Chair  on  Mechatronics  and  Mechatronics  Research  and 
Application  Center  of  Bogazici  University,  Istanbul,  co-sponsored  by  IEEE  (Institute  of  Electrical  and 
Electronics  Engineers)  Industrial  Electronics  Society  and  IEEE  Robotics  and  Automation  Society,  and 
in  technical  co-operation  with  ASME  (American  Society  of  Mechanical  Engineers)  Dynamic  and 
Control  Systems  Division,  ASME  Design  Engineering  Division,  RSJ  (Robotics  Society  of  Japan),  IEEJ 
(Institute  of  Electrical  Engineers  of  Japan),  JSME  (Japanese  Society  of  Mechanical  Engineers)  and 
SICE  (Society  of  Instr.  and  Control  Engineers  of  Japan). 

The  support  of  the  prestigious  organizations  listed  above  has  resulted  in  a  wide  interest  in  the  meeting. 
We  have  about  130  high  quality  presentations  in  the  final  program  and  4  plenary  presentations  by  very 
well  known  experts  in  their  fields.  Additionally,  2  panel  discussions  are  planned  during  which  a  group 
of  American  and  Turkish  scientists  will  present  their  latest  research  endeavors  with  the  purpose  of 
identifying  possible  research  collaboration  areas. 

Many  colleagues  from  many  different  countries  of  the  world  have  given  their  support  to  ICRAM’99. 
We  have  been  trying  to  deserve  this  attention  by  doing  our  best  to  make  the  meeting  both  scientifically 
and  socially  rewarding  to  all  the  participants.  In  addition  to  the  scientific  program  of  the  meeting,  we 
are  offering  some  tours  of  Istanbul  to  acquaint  you  with  some  of  the  cultural  and  historical  riches  that 
this  city  has  to  offer.  Additionally,  a  post  conference  tour  into  Cappadocia  is  organized  which,  we 
hope,  will  promote  not  only  long  lasting  friendship  but  also  in-depth  technical  discussions  in  an  expert 
group  of  limited  size. 

I  would  like  to  take  this  opportunity  to  acknowledge  the  tremendous  effort  invested  in  this  meeting  by 
many  colleagues,  students  and  others  in  many  committees  of  the  meeting.  My  special  thanks  go  to  the 
Program  Chairs;  Professors  Tosunoglu  and  Ang,  Jr.,  and  to  the  members  of  the  Advisory  Committee 
and  International  Program  Committee.  I  would  also  like  to  thank  all  the  speakers,  authors  and 
participants  for  joining  us  and  making  ICRAM'99  a  success. 

Finally  I  wish  you  all  a  very  pleasant  stay  in  Turkey  and  a  safe  return  back  home.  I  hope  that 
ICRAM'99  will  be  a  meeting  you  will  pleasantly  remember  for  years  to  come. 


Okyay  Kaynak 
General  Chair 
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•  Bogazici  University  Foundation 

•  European  Office  of  Aerospace  Research  and  Development,  Air  Force  Office  of  Scientific 
Research,  United  States  Air  Force  Research  Laboratory 

•  European  Research  Office,  United  States  Army 
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The  term  "mechatronics"  was  introduced  in  the  late  1960s  by  the  Yaskawa  Electric  Company  (Japan)  based  on 
their  observation  of  the  synergy  achieved  through  the  integration  of  mechanical  and  electronic  technologies.  Yaskawa 
subsequently  released  trademark  rights  to  the  name  and  it  has  been  used  since  then  in  education  and  industry  to  describe 
systems  derived  from  this  heritage. 

Today,  that  heritage  has  come  to  include  a  broad  variety  of  physical  systems  operating  under  computer  and 
electronic  control.  The  term  "intelligence"  might  be  applied  to  these  systems,  but  its  anthropomorphic  implications  lead 
to  too  many  arguments  as  to  appropriate  definition.  What  we  can  observe,  however,  is  that  these  systems  employ  a 
complexity  of  decision-making  that  is  primarily  limited  by  engineering  economics  rather  than  technology.  This  is  a 
distinctive  break  from  the  past  and  serves  as  a  defining  characteristic  of  mechatronic  systems.  Using  this  perspective 
broadens  the  modem  view  of  mechatronics  to  include  any  kinds  of  decision-making  technologies  although  digital 
computers  dominate  today. 

A  major  driver  of  mechatronics  comes  from  the  needs  of  industry.  Masten  of  Texas  Instrument  summarizes  the 
general  trends  in  industry  by  listing  1)  knowledge  is  king,  2)  innovation  is  essential,  3)  cost  (what  is  "new"  today 
becomes  a  "commodity"  tomorrow,  4)  products  are  more  complex  and  system  based  with  higher  performance,  5)  short 
design  cycle,  6)  markets  are  global  and  more  competitive  and  7)  "design  teams"  are  a  preferred  approach.  Mechatronics 
offers  the  best  practice  to  meet  these  challenges. 

There  have  been  numerous  innovations  in  the  1980s  such  as  adding  intelligence  to  industrial  products.  We  may 
call  such  trends  as  infusion  of  "information  technology"  to  industrial  products.  Mechatronics  in  the  1980’s  was  no 
longer  a  simple  integration  of  mechanics  and  electronics.  The  availability  of  economically  affordable  and  reliable 
microprocessors  and  new  types  of  sensors  had  influenced  the  design  and  manufacturing  of  industrial  products  as  well  as 
the  products  themselves. 

These  developments  also  set  the  background  for  various  definitions  of  mechatronics  introduced  in  the  late  1 980's 
and  the  early  1990's.  The  International  Federation  for  the  Theory  of  Machines  and  Mechanism  (IFTMM),  provided  a 
definition,  "Mechatronics  is  the  synergistic  combination  of  precision  mechanical  engineering,  electronic  control,  and 
systems  thinking  in  design  of  products  and  manufacturing  processes."  The  IEEE/ASME  Transactions  on  Mechatronics, 
which  was  launched  in  1996,  adopted  a  similar  definition,  "Mechatronics  is  the  synergetic  integration  of  mechanical 
engineering  with  electronic  and  intelligent  computer  control  in  the  design  and  manufacture  of  industrial  products  and 
processes." 

It  was  mentioned  that  a  major  driver  of  mechatronics  is  the  needs  of  industry,  and  that  mechatronics  offers  the  best 
practice.  This  does  not  mean  that  mechatronics  refers  to  how  things  should  be  done  in  the  development  of  products  at 
industry.  In  fact,  mechatronics  provides  a  series  of  interesting  research  issues  to  university  researchers.  Mechatronics 
research  in  academia  should  be  at  the  forefront  and  yet  remain  relevant  to  needs  of  industry.  Mechatronics  research 
examples  appropriate  as  university  research  appear  in  many  areas  such  as  advanced  vehicle  control  and  safety'  systems 
(AVCSS),  design  and  control  of  hard  disc  drives  for  high  aerial  density,  biomedical  systems  and  MEMS  (micro-electro- 
mechanical  systems). 

Engineering  education  is  facing  challenges  in  the  midst  of  rapid  progress  of  IT.  IT  is  important  in  mechatronics, 
but  we  need  to  recognize  that  in  a  field  as  dynamic  as  ITs,  no  set  of  vocational  skills  has  any  significant  longevity. 
Having  said  above,  our  undergraduate  students  should  be  exposed  to  mechanical  design,  information  technology 
(software  and  hardware),  modeling  and  feedback  control,  integration  of  mechanical  devices  and  computers,  dealing 
with  complexity  and  team  work.  Furthermore,  students  should  be  trained  to  be  forward  looking  and  curious  and  good 
communicators.  Issues  surrounding  integration  as  well  as  working  in  team  cannot  be  taught  in  lecture  courses. 
Students  must  experience  them,  and  in  this  regard  laboratory  courses  are  essential  in  mechatronics  education.  Many 
high  school  students  have  a  simple  equation  Information  Technology  =  Computer  Science  Department  and  do  not 
realize  how  other  engineering  disciplines  are  changing  by  IT.  In  this  regard,  the  role  of  educators  must  go  beyond  how 
mechatronics  should  be  taught.  It  is  important  that  we  direct  the  attention  of  high  school  students  and  college  freshmen 
and  sophomores  to  the  exciting  field  of  mechatronics.  Mechatronics  is  the  field,  which  may  attract  a  number  of  talented 
undergraduate  students,  and  in  fact  mechatronics  thinking  should  be  exposed  to  all  engineering  students,  in  particular,  in 
mechanical  engineering  and  electrical  engineering.  Mechatronics  thinking  is  the  basics  of  the  modem  engineering 
education. 
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Computational  Intelligence  for  Robotic  Systems 
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This  paper  introduces  recent  topics  of  computational  intelligence.  The  intelligent  capabilities  will  be  required  to  the 
various  systems  to  adapt  a  system  to  dynamically  changing  environment.  First,  we  introduce  the  computational 
intel  igence  including  evolutionary  computing,  neural  computing  and  fuzzy  computing.  Next,  some  of  the  important 
pro  ems  including  the  system  architecture,  structured  intelligence,  emerging  system  and  implementation  methods  is 
discussed  in  this  paper  from  the  viewpoint  of  coevolution. 
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Rosalie  Zobel 

European  Commission,  DG  XIII,  Rue  de  la  Loi  200  -  NI05  5/25,  B-I049 
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Mechatronics,  as  an  intelligent  integration  of  different  engineering  disciplines,  can  also  play  a  pivotal  role  as  a 
model  for  inter-disciplinary  R&D  that  is  supported  by  the  European  Commission's  5th  Framework  Programme  for 
research  &  technological  development. 


The  presentation  will  offer  an  overview  of  the  R&D  priorities  relevant  to  mechatronics,  as  covered  by  the 
Information  Society  Technologies  programme,  (Key  Actions  11  and  IV),  Key  Action  I  of  the  Competitive  and 
Sustainable  Growth  programme,  as  well  as  the  Intelligent  Manufacturing  Systems  initiative 


Advances  in  Robotics 

Gerd  Hirzinger 

Director,  Institute  of  Robotics  and  System  Dynamics,  DLR-Oberpfaffenhofen 
P .  O.  Box  II  16,  82230  Wessling,  Germany 

Future  robots  presumably  are  the  most  challenging  mechatronic  systems.  The  talk  tries  to  outline  recent  advances  in 
the  field  of  industrial  robots  as  well  as  in  the  design  of  a  new  generation  of  lightweight  robots  and  articulated 
muitisensory  hands  for  service  applications  in  space  and  on  earth. 

Powerful  man-machine  interfaces  and  skill-transfer-techniques  combined  with  sensor-based  world  model 
generation  and  update  are  key  elements  for  new  programming  and  task-learning  concepts.  And  in  surgery 
mechatronic  and  robotic  systems  stand  for  the  final  break-through  in  minimally  invasive  operations  and  telesurgery.  ’ 
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Abstract 

An  efficient  simulation  platform  for  the  development 
of  anti-lock  brake  system  (ABS)  is  presented.  To  reduce 
computational  burden  due  to  the  rigidity  of  brake  hy¬ 
draulics,  the  actual  hydraulic  brake  assembly  is  taken 
from  the  target  vehicle  and  linked  to  a  virtual  18-dof 
vehicle  dynamics  model.  The  latter  is  used  to  describe 
dynamical  behavior  of  the  target  vehicle  under  extreme 
maneuvers.  The  hardware-in-the-loop  system  thus  built 
allows  the  investigation  of  not  only  the  dynamic  be¬ 
havior  but  also  hydraulic  responses  of  the  vehicle  with 
a  complete  assess  to  all  system  parameters  and  state 
variables . 


1  Introduction 

Simulation  analysis  has  become  a  indispensable  tool 
in  the  design  of  a  control  system,  where  parameters  to 
be  tuned  affect  dynamics  of  a  plant  in  motion.  This  is 
because  a  small  change  in  dynamics  of  the  moving  plant 
may  reduce  the  stability  margin  significantly,  thereby 
resulting  in  the  loss  of  stability.  The  typical  example 
is  an  electro-mechanical  chassis  control  system  which 
supports  a  driver  to  maintain  steerability  when  brak¬ 
ing  on  slippery  roadways,  such  as  Anti-Lock  Brake  Sys¬ 
tem  (ABS),  Traction  Control  System  (TCS)  and  Vehi¬ 
cle  Dynamics  Control  (VDC). 

Such  control  systems  provide  the  control  input  in 
the  form  of  braking  pressure  which  is,  in  turn,  result¬ 
ing  in  different  dynamical  behaviors  depending  on  the 
amount  of  braking  energy  absorbed  by  the  road  sur¬ 
face  conditions.  It  is  therefore  very  dangerous  to  tune 
the  underlying  control  systems  without  knowing  conse¬ 
quent  physics  of  the  vehicle  motion.  Moreover,  uncou¬ 
pling  dynamical  responses  to  rectify  the  source  of  un¬ 
desirable  maneuvers  hard  to  attain  due  to  difficulties 


M.  H.  Lee 

Department  of  Mechanical  Engineering 
Pusan  National  University 
San  30  Jang  jeon- dong,  Kumjung-gu 
Pusan,  609-735,  Korea 
mahlee@hyowon. pusan.  ac.kr 

in  defining  the  nominal  motion  among  a  set  of  road- 
test  results.  Furthermore,  the  growing  demand  for  the 
rapid  development  has  laid  a  tight  limit  on  the  time 
required  for  weather-dependent  road  tests. 

In  an  effort  to  reduce  the  number  of  road  tests, 
various  simulation  approaches  have  been  introduced. 
Among  them,  a  hardware-in-the-loop  approach  seems 
to  be  most  promising  in  the  sense  that  computation¬ 
ally  expensive  components  could  be  substituted  with 
the  actual  hardware  [1].  Consequently,  one  can  reduce 
the  great  deal  of  the  numerical  complexity  in  modeling, 
while  narrowing  down  the  gap  between  the  simulation 
and  experiment  results.  In  this  respect,  we  present  an 
efficient  simulation  platform  for  the  rapid  prototyping 
of  ABS. 

The  simulation  platform  thus  constructed  consists 
of  two  parts:  a  18-dof  virtual  vehicle  model  and  the 
actual  hydraulic  brake  assembly  of  the  target  vehicle. 
The  former  is  used  to  describe  dynamical  behavior  of 
the  target  vehicle  under  extreme  maneuvers,  while  the 
latter  to  reduce  computational  burden  due  to  the  rigid¬ 
ity  of  hydraulic  units  comprising  brake  system.  By 
establishing  a  link  between  the  actual  hardware  and 
the  virtual  vehicle  model,  one  can  artificially  generate 
the  road  surface  conditions  inducing  ABS  involvement. 
Once  tuned  properly,  the  resulting  simulation  platform 
exhibits  a  very  high  degree  of  reliability,  making  many 
road  tests  obsolete. 


2  Virtual  Model  of  the  Target  Vehicle 

The  target  vehicle  under  consideration  employs  a 
typical  power  steering  system  and  a  MacPherson  strut 
front  suspension,  along  with  a  semi- trailing  arm  rear 
suspension.  To  reduce  the  degree  of  complexity  in  mod¬ 
eling,  these  mechanical  systems  are  simplified;  namely, 
a  6-bar  steering  linkages  for  the  power  steering,  a  spring 
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and  damper  system  with  2  closed  kinematic  chains  for 
the  front  suspension,  a  simple  spring  and  damper  sys¬ 
tem  without  suspension  kinematics  for  the  rear.  Fi¬ 
nally,  the  governing  equations  of  motion  having  mini¬ 
mal  order,  while  taking  into  account  imposed  kinematic 
constraints,  are  derived  by  applying  multibody  formal¬ 
ism  based  on  Kane’s  approach  [2]. 

We  resort  to  AUTOSIM  to  drive  the  governing  equa¬ 
tions  of  motion.  Unlike  the  other  commercial  packages 
endowed  with  easy-to-use  graphical  interfaces,  such  as 
ADAMS  and  DADS,  AUTOSIM  requires  a  designer  to 
establish  interrelation  between  bodies  and  to  incorpo¬ 
rate  them  into  dynamics  model  while  taking  into  ac¬ 
count  kiematic  constraints,  thereby  allowing  the  model¬ 
ing  of  highly  coupled  multibody  systems.  Furthermore, 
AUTOSIM  automatically  generates  platform-free  nu¬ 
merical  codes  which  is  amenable  to  real-time  applica¬ 
tions. 


2.1  The  Six-Bar  Steering  Mechanism 


Considering  that  the  basic  principle  of  steering  ac¬ 
tion  can  be  explained  using  the  classical  steering  mech¬ 
anism  of  Ackermann  type,  a  6-bar  steering  linkages 
optimizing  Ackermann’s  turning  geomery,  as  shown  in 
Figure  1,  is  used  to  model  the  power  steering  system. 


Figure  1:  Ackerman  geometry 


The  condition  for  correct  steering  requires  the  cen¬ 
terlines  of  the  four  wheels  to  meet  at  a  single  point,  O, 
along  the  axis  of  the  rear  wheels.  The  steer  angles  ac¬ 
cording  to  the  said  correct  steering  condition  are  then 
given  by 


tan  (pi  = 


L 

x  4- 1/2 


(i) 


t“M  =  ^72  <2> 

where  x  denotes  the  distance  of  O  from  the  centerline 
L  of  the  vehicle.  Upon  eliminating  x  from  eqs.  (1  and 
2),  the  condition  for  correct  steering  can  be  derived  as 

f(<P u<P2)  =  sin(^2  ~  <pi)  “/?sin^isin^2  =  0  (3) 

where  p  =  L/t.  It  is,  however,  well  known  that  the 
ideal  steering  satisfying  the  foregoing  relation  is  diffi¬ 
cult  to  achieve.  Instead,  we  choose  the  six-bar  steering 
linkages  which  optimizing  the  condition  for  ideal  steer¬ 
ing  [3],  as  shown  in  Figure  2. 


Figure  2:  6-bar  steering  mechanism 


2.2  MacPherson  Strut  Suspension 

The  underlying  MacPherson  strut  suspension  con¬ 
sists  of  5  bodies  and  3  independent  kinematic  loops, 
as  shown  in  Figure  3.  Each  body  forming  the  loops 
is  then  analyzed  independently  to  assign  the  appropri¬ 
ate  degrees  of  freedom  depending  on  its  kinematic  role 
in  the  system.  For  simplicity,  a  kinematic  loop  associ¬ 
ated  with  the  tension  strut  of  the  front  suspension  is 
neglected  by  defining  a  ball  joint  connecting  the  lower 
control  arm  and  the  wheel  carrier  as  a  rotational  joint 
[4] .  The  idle  degrees  of  freedom  arising  from  the  rota¬ 
tion  of  the  rods  about  the  line  connecting  the  centers 
of  their  sperical  joints  are  also  neglected  [5] . 

Integrating  each  body  into  a  single  dynamical  sys¬ 
tem  gives  rise  to  a  set  of  highly  coupled  kinematic 
constraints.  These  constraints,  in  turn,  allows  us  to 
eliminate  dependent  generalized  speeds,  so  that  the  re¬ 
sulting  equations  of  motion  can  be  expressed  in  terms 
of  independent  generalized  speeds.  The  loop  closure 
conditions  obtained  by  disconnecting  the  loop  at  P,  as 
shown  in  Figure  4,  give  rise  to  the  following  constraints 
equations  [6] 

rB-c-  =  0  (4) 
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Table  1 :  Dynamical  properties  used  for  simulation 


Figure  3:  Architecture  of  a  MacPherson  strut 

vB’c>  =  0  (5) 

The  foregoing  equations  can  then  be  verified  the  vector 
equation  connecting  two  open  chains,  namely, 

a-j-b  =  c  +  d  (6) 

Bp 


2.3  Chassis  Dynamics  Model 

For  chassis  dynamics  model,  the  18-dof  vehicle 
model  provided  by  MSC  [7]  is  customized  such  a  way 
that  the  resulting  equations  of  motion  includes  the 
MacPherson  strut  front  suspension,  the  linearized  semi- 
trailing  arm  rear  suspension,  along  with  the  6-bar  steer¬ 
ing  linkages.  Figure  5  shows  the  characteristic  curves 
of  the  front  and  rear  suspensions  damping.  In  addition, 
the  dynamical  properties  used  in  simulation  are  listed 
in  Table  1 . 

3  Construction  of  the  HILS 

The  hardware  used  for  the  simulation  is  a  hydraulic 
circuit  comprising  the  braking  system.  The  actual 


Spring  coefficient  [N /m] 

23530 

of  the  front  coil  spring 

Spring  coefficient  [N/m] 

20286 

of  the  rear  coil  spring 

Spring  coefficient  [N/m] 

180000 

of  the  tire  (vertical) 

Weight  distribution  (front: rear) 

1.6:1 

Moment  of  inertia  (Rolling)  [Kg-m2] 

300 

Moment  of  inertia  (Pitching)  [Kg-m2] 

2000 

Moment  of  inertia  (Yawing)  [Kg-m2] 

2800 

Figure  5:  Damping  forces  (Front  and  Rear) 


brake  assembly  including  a  hydraulic  modulator  for 
ABS  is  integrated  into  the  simulation  platform.  The 
measured  brake  line  pressures  are  then  converted  into 
the  speed  signals  by  virtue  of  the  virtual  vehicle  model. 
Numerical  integration  of  the  governing  equations  of 
motion  gives  rises  to  the  speed  signals  in  discrete  form, 
where  the  resulting  wheel  speed  signals  are  determined 
by  the  amount  of  the  brake  line  pressures  and  the 
given  road  surface  conditions.  To  emulate  the  magnetic 
pick-up  type  speed  sensor,  the  sine  waves  are  gener¬ 
ated  through  the  built-in  4-channels  function  genera¬ 
tor.  Upon  receiving  the  emulated  speed  signals,  it  is 
possible  for  ECU  to  determine  the  road  surface  con¬ 
ditions  and  provide  appropriate  control  action  to  pre¬ 
vent  brakes  from  locking.  Hence,  a  loop  including  the 
brake  assembly  can  be  successfully  constructed:  Fig¬ 
ure  6  shows  the  complete  system  layout. 

3.1  Brake  Hydraulics 

Instead  of  the  brake  pedal  effort,  a  pneumatic  actua¬ 
tor  is  designed  to  initiate  the  brake  system,  as  shown  in 


Figure  6:  Front  view  of  the  HILS 
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Figure  7:  Design  of  the  pneumatic  actuator  for  braking 


Figure  7.  The  brake  pressure  distribution  correspond¬ 
ing  to  the  full  braking  condition  can  be  obtained  by 
controlling  the  stroke  of  the  actuator.  Figure  8  shows 
the  pressure  distribution  for  full  braking.  Moreover,  a 
speed  controller  is  attached  to  regulate  the  rise  time  of 
the  brake  pressure.  This  setup  allows  us  to  minimize 
the  possible  discrepancies  due  to  repeated  experiments, 
unlike  the  road-tests  where  the  the  nominal  motion  has 
been  chosen  collectively. 

The  hydraulic  brake  assembly  taken  from  the  target 
vehicle  is  integrated  into  the  simulation  platform.  This 
includes  the  vacuum  booster  master  cylinder  assembly, 
proportioning  valve  and  the  hydraulic  modulator  for 
ABS,  as  shown  in  Figure  9.  To  substitute  the  vacuum 
provided  by  the  engine  manifold,  an  additional  vacuum 
pump  is  used  to  assist  the  booster.  The  pressure  mod¬ 
ulation  in  the  wheel  brakes  can  be  achieved  by  the  use 
of  the  sol-sol  type  8-channel  hydraulic  modulator.  Fi¬ 
nally,  the  brake  line  pressures  are  measured  through 
pressure  transducers  installed  at  each  brake  line.  The 
underlying  brake  pressure  signals  are  then  transferred 


Figure  8:  Brake  pressure  distribution 


to  the  brake  forces  by  [8] 


Fb  =  (  Pt  - 

P0)AwcT1cBF(r/R)  [N] 

where 

Pi  = 

brake  line  pressure 

Po  = 

pushout  pressure 

Awc  = 

wheel  cylinder  area,  [cm2] 

Tjc  = 

wheel  cylinder  efficiency 

BF  = 

brake  factor 

r  = 

effective  disk  radius  [mm] 

R 

tire  radius  [mm] 

(7) 


3.2  Data  Acquisition  &  Processing 


The  sensed  brake  line  pressures  are  passing  through 
the  signal  conditioner  having  the  cutoff  frequency  of 
100  Hz  and,  then,  to  be  sent  to  ds2001  multi  I/O 
board  (dSPACE),  where  these  signals  are  sampled  at 


6 


Botxicr 


Figure  9:  Brake  hydraulic  circuit 


200  Hz.  The  latter  sampling  frequency  corresponds  to 
the  solenoid  firing  frequency  of  the  hydraulic  modula¬ 
tor.  Under  the  given  road  surface  conditions,  the  sys¬ 
tem  of  non-linear  equations  of  motion,  which  are  coded 
in  C  and  downloaded  in  dsl003  DSP  board  (dSPACE), 
takes  the  measured  brake  forces  as  the  system  input 
and  produces  the  corresponding  wheel  speed  signals 
after  carrying  out  numerical  integration  with  the  min¬ 
imum  achievable  sampling  rate  of  200  Hz.  It  should 
be  emphasized  that  the  loop  time  for  the  simulation 
has  to  be  bounded  within  the  specified  sampling  rate 
to  secure  successful  data  handshaking  between  the  ac¬ 
tual  hydraulic  module  and  the  virtual  vehicle  model. 
In  such  a  way,  a  link  between  two  different  systems  can 
be  established. 

The  discrete  wheel  speed  signals  thus  obtained  from 
numerical  integration  are  converted  into  sine  waves  ac¬ 
cording  to  the  characteristics  of  the  magnetic  pick-up 
type  wheel  speed  sensor,  namely, 

S(f,  t)  ~  A(f)  sin(2jr  /  t)  (8) 

while  satisfying 

\Mfi)\  >  \A(h)\ *+  fi  >  h,  (/min  <  /  <  /max)  (9) 

where  /  and  A(f)  denotes  the  wheel  speed  frequency 
and  its  corresponding  magnitude,  respectively.  As  dis¬ 
cussed  already,  the  vehicle  model  gives  rise  to  the  dis¬ 
crete  speed  values  at  every  5  [msec]  when  integrating 
with  the  200  Hz  sampling  frequency.  Note  that  the 
said  wheel  speed  signal  is  too  rough  to  emulate  the 
continuous  speed  wave,  which  is,  in  turn,  used  to  cal¬ 
culate  the  acceleration  of  the  rotating  wheel — the  most 
crucial  parameter  in  determining  ABS  engagement — . 
For  example,  the  ECU  might  fire  solenoid  valves  even 
in  the  normal  situation  due  to  surges  induced  by  nu¬ 
merical  noises  associated  the  underlying  procedure.  To 
resolve  the  problem  with  numerical  noises,  we  force¬ 
fully  introduce  one-step  delay,  i.e.,  save  the  previous 


two  values  in  the*  memory  and  use  the  numerical  in¬ 
terpolation  scheme  to  produce  smooth  sine  waves.  The 
sampling  rate  used  for  the  interpolation  is  1000  Hz,  five 
times  faster  than  the  sampling  frequency  for  the  system 
integration. 

4  System  Validation 

The  off-line  simulations  have  been  conducted  to  ver¬ 
ify  the  reliability  of  the  18-dof  vehicle  model.  With  the 
existing  road-test  results  at  hand,  the  brake  pressure 
signals  are  taken  as  the  input  for  the  system  of  nonlin¬ 
ear  equations  of  motion  and  the  resulting  wheel  speed 
signals  are  compared  with  the  measured  wheel  speed 
signals.  Then,  the  system  parameters  including  the 
road-surface  conditions  are  tuned  to  minimize  the  dif¬ 
ference  between  the  simulation  and  road-test  results  as 
illustrated  by  Figures  10  and  11. 


Brake  Pressure 


Figure  10:  Wheel  speed  and  corresponding  brake  pres¬ 
sure  when  ABS  engaged  (Front) 

Figure  12  shows  simulation  errors  after  such  tun¬ 
ings.  It  has  been  shown  that  the  speed  error  of  the 
front  wheel  is  larger  than  that  of  the  rear.  This  is 
because  the  front  wheel  has  been  affected  by  the  un¬ 
certainty  arising  from  the  inertia  variation  in  the  drive 
train  during  braking.  To  reduce  the  gap,  the  drive 
train  model  must  be  included  in  the  vehicle  model. 
However,  the  increased  complexity  in  modeling  might 
reduce  real-time  capability.  Instead,  the  wheel  speed 
signals  measured  during  braking  directly  are  sent  to 
the  vehicle  model  so  that  chassis  dynamics  can  be  im¬ 
plemented  using  the  measured  signals,  rather  than  the 
computed  values.  The  latter  approach  is  known  to  be  a 
Vehicle-In-the-Loop  System  (VILS).  The  VILS  is  cur¬ 
rently  under  construction  at  Ulsan  University. 
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Figure  11:  Wheel  speed  and  corresponding  brake  pres¬ 
sure  when  ABS  engaged  (Rear) 


Figure  12:  Simulation  errors 


5  Concluding  Remarks 

The  efficient  simulation  platform  for  the  rapid  pro¬ 
totyping  of  Anti-Lock  Brake  System  has  been  con¬ 
structed.  The  18-dof  vehicle  model  is  formulated  to 
describe  dynamics  of  the  target  vehicle  under  extreme 
maneuvers.  The  vehicle  model  consists  of  the  MacPher- 
son  strut  front  suspension,  linearized  semi-trailing  arm 
rear  suspension  and  6-bar  steering  linkages.  To  reduce 
computational  burden  arising  from  the  rigidity  of  brake 
hydraulics,  the  actual  brake  system  is  linked  to  the 
virtual  vehicle  model.  The  loop  including  the  hard¬ 
ware  and  the  virtual  vehicle  model  is  then  closed  by 
the  use  of  data  acquisition  and  processing  boards.  The 
hardware-in-the-loop  system  makes  it  possible  for  en¬ 
gineers  to  assess  the  capabilities  of  the  control  scheme 


long  before  the  actual  road-tests  are  carried  out,  boost¬ 
ing  the  grade  of  innovativeness  and  providing  more  flex¬ 
ibility  in  incorporating  various  control  techniques. 
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Abstract 

The  engagement  behaviour  of  wet  clutches  used  in  au¬ 
tomatic  transmissions  is  analysed.  A  comprising  mo¬ 
del  including  all  the  components  of  the  clutch  is  deve¬ 
loped.  The  friction  transmission  mechanism  consists 
of  three  phases:  clearance  phase ,  mixed  contact  phase 
and  steady  contact  phase.  The  clearance  phase  des¬ 
cribes  the  squeezing  process  of  the  oil  film  between  the 
discs  before  they  reach  contact  under  the  assumption 
that  the  lining  material  is  porous)  elastically  deforma¬ 
ble  and  has  a  rough  surface.  During  the  mixed  contact 
phase  the  discs  are  partly  in  contact  and  the  contact 
area  increases  smoothly  due  to  the  deformation  of  the 
surface  asperities  of  the  lined  discs.  The  steady  contact 
phase  occurs  when  the  disc  package  is  pressed  together 
and  a  static  equilibrium  in  the  axial  direction  has  been 
established.  The  engagement  behaviour  of  a  wet  clutch 
is  discussed  with  variations  of  the  main  parameters. 

1.  Introduction 

The  transmission  behaviour  of  wet  clutches  (figure  1) 
is  affected  by  friction  characteristic,  oil  viscosity,  se¬ 
paration  clearance,  manufacturing  tolerances,  friction 
between  the  housing  and  the  discs,  etc.  Therefore  sim¬ 
plified  models  neglecting  the  presence  of  the  oil  film 
between  the  discs  and  the  axial  motion  of  the  piston 
are  not  eligible  for  realistic  prediction  and  simulation 
of  the  engagement  behaviour  of  wet  clutches.  The  mo¬ 
del  needed  for  this  purpose  must  include  the  effects 
mentioned  above. 

In  [1]  and  [2]  the  engagement  behaviour  of  two  clutch 
plates  is  analysed.  The  model  used  in  [1]  takes  into  ac¬ 
count  the  clutch  lining  porosity.  The  transition  from 
hydrodynamic  to  boundary  lubrication  is  assumed  to 
occur  abrupt  at  a  prescribed  critical  film  thickness, 
which  leads  to  a  discontinuity  in  the  torque  curves.  In 
[2]  the  surface  roughness  of  the  lining  material  is  con¬ 
sidered.  A  squeeze  film  phase  and  a  consolidating  con- 
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1  piston  housing,  2  piston,  3  oil  passages, 

4  return  spring,  5  disc  package, 

6  compensating  plate,  7  lubrication  passage 


Figure  Is  Wet  clutch 


tact  phase  are  analysed  but  the  mixed  contact  phase  is 
not  investigated.  A  simple  and  physically  comprehen¬ 
sive  model  is  used  in  [3]  to  simulate  the  transition  from 
viscous  to  dry  friction. 

The  above  mentioned  models  do  not  include 
differences  between  each  pair  of  friction  discs,  manu¬ 
facturing  tolerances  and  friction  between  discs  and 
tangs.  These  effects  are  considered  in  detail  in  the 
model  presented  in  this  paper. 

2.  Description  of  the  mechanical  model 

Figure  2  shows  a  comprising  wet  clutch  model  inclu¬ 
ding  the  outer  and  inner  tang,  the  piston,  the  return 
spring  and  the  steel  (inner)  and  lined  (outer)  plates. 
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Furthermore  the  oil  film  between  each  pair  of  friction 
surfaces  is  considered.  At  the  beginning  of  a  clutch  en¬ 
gagement  the  control  pressure  is  applied  to  the  piston, 
which  starts  to  press  the  disc  package.  Consequently 
the  discs  start  approaching  and  the  oil  has  to  be  squee¬ 
zed  out  before  the  friction  surfaces  can  make  contact. 
The  increasing  squeeze  pressure  in  the  oil  film  prevents 
the  discs  from  making  instantaneous  contact  and  the 
torque  transmitted  during  this  phase  is  purely  due  to 
viscous  friction.  Once  the  discs  reach  contact  a  tran¬ 
sition  from  viscous  to  dry  friction  occurs.  In  order  to 
describe  this  transient  phase,  the  geometry  and  ela¬ 
sticity  of  the  lined  disc  asperities  must  be  taken  into 
account. 


Figure  2:  Wet  clutch  model 


3.  Equations  of  the  two-discs  model 
3.1  Clearance  phase  model 


Figure  3:  Clearance  phase  model 


Assuming  a  homogenous  and  isotropic  porous  lining 
material  and  an  incompressible  fluid,  the  squeeze  pres¬ 
sure  can  be  described  by  the  modified  Reynold’s  equa¬ 
tion  [2,  4] 


r  dr 


(H3  + 12  <j>hB)  r~^r  =  +  24<t>hBpwl 


PujT 
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3  +  4^+3[ 
Win  1 


f  UoutY 
\Win  ) 
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We  consider  the  squeeze  oil  film  between  a  rotating 
steel  disc  and  a  lined  disc  while  approaching  each  other 
(qj  coordinate  of  the  disc  j).  Taking  into  account  the 
lining’s  elastic  deformation  and  its  surface  roughness, 
the  oil  film  thickness  H  in  figure  3  may  be  written  as 

H  =  h  + 5  + ha  (1) 

with 

•  h  =  s  +  z  the  mean  unstressed  film  thickness. 


where  <f)  represents  the  permeability  of  the  lining  ma¬ 
terial  and  7]  and  p  are  the  oil  viscosity  and  density 
respectively. 

We  assume  a  uniform  and  isotropic  surface  roughness 
structure  and  a  disappearing  mean  stochastic  thickness 
h8.  Using  the  expectancy  operator 

oo 

E(X)  =  J  xf(x)  dx  (3) 

—  OO 


•  s  =  qj+1  —b—qj  the  minimal  unstressed  film  thick¬ 
ness. 

•  =  ^(1  -  the  elastic  de- 

formation  of  the  porous  lining. 

•  ka  the  stochastic  thickness  difference  from  the  no¬ 
minal  level  defined  by  (h  +  J)  due  to  the  asperities. 

Here  b  is  the  disc  thickness,  z  is  the  mean  height  of  the 
asperities,  and  Hb,Eb  and  ub  are  the  lining  thickness, 
Young’s  modulus  and  Poisson’s  ratio  respectively. 


where  AT  is  a  random  magnitude  with  the  density  /(x), 
the  expected  values  of  H  and  H 3  can  be  calculated  as 

E(H)  =  {h  +  6)  (4) 

oo 

E(H3)  =  (h  +  5)a  +  3{h  +  5)  J  h2sf(ha)dhe 

— OO 
OO 

+  J  h3f(ha)dha 
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E{H3)  =  (ft  +  <5)3  +  3(/H-<S)a2+e 


(5)  3.2  Mixed  contact  phase  model 


where  u  is  the  variance  of  the  film  thickness  H  and  e  is 
a  symmetry  factor  of  its  stochastic  distribution,  which 
equals  zero  for  a  symmetric  distribution.  Substituting 
H  and  H 3  in  eq.  (2)  by  eq.  (4)  and  eq.  (5)  and 
solving  the  modified  Reynolds  equation  with  respect 
to  the  pressure  p0u  leads  to  the  squeeze  pressure  in  the 
oil  film,  which  can  be  integrated  over  the  disc  surface 
to  [5] 


Once  the  discs  have  made  contact  (s  =  0),  we  have  to 
deal  with  mixed  friction  as  we  have  viscous  friction  in 
the  area  between  the  asperities  and  dry  friction  in  the 
contact  area  between  them  and  the  steel  disc  (figure  4). 
In  order  to  calculate  the  part  of  the  torque  due  to  dry 
friction,  we  have  to  determine  the  contact  force  Fcont 
between  the  asperities  and  the  steel  disc.  Using  the 
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Figure  4:  Mixed  contact  phase  model 


Hertzian  theory  the  contact  force  Fcont  and  the  ratio  f 
of  the  contact  area  Acont  to  the  total  friction  area  An 
can  be  expressed  in  terms  of  the  compliance  ze  (figure 
4)  [6,  7]  as 


ftjB.  _ 12rjhrf  _  240/iBpa)?nr? 


= *1?'  C:= 

PahB 


h3pa 


Fcont  =  \ E'Ann0R F3/2  g) 


7-7  +  C;  S:=^  i 


Ti 


with  the  disc’s  inner  and  outer  radii  r*  and  ra  and  the 
reference  ambient  pressure  pa . 

Assuming  a  Newtonian  fluid,  the  shear  stress  can  be 
written  as 


with  the  definitions 


/  \  Win  -  w0ut  Aw 

T(r)  =  V  l  ,  *  r  =  T)  T— ^  r 


h  +  6 


h  +  5 


(7) 


The  torque  due  to  viscous  shear  is  then  obtained  by 
integrating  the  shear  stress  over  the  disc  surface 


M7)-/(?-7)  '(7)<7 

0 

*(?)=/(  *-*)/(*K 


27 r  ra 

Mvi8  =  J  J  tt2  drd(p 
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o  n 
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(10) 

(11) 

(12) 

(13) 

(14) 

(15) 


where  za  =  2z,  N  is  the  number  of  asperity  summits 
within  a  defined  traverse  length  ls ,  ae  and  R  are  the 
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mean  basis  radius  and  the  mean  curvature  radius  of  the 
asperity  summits,  Es  and  vs  are  the  Young’s  modulus 
and  the  Poisson’s  ratio  of  the  steel  disc. 


Figure  5:  Profile  traces  of  a  steel 
and  a  paper- lined  clutch 
disc 


The  probability  density  distribution  /  (^-)  (figure  6) 
can  be  ascertained  from  a  profile  trace  of  the  surfaces 
of  lined  clutch  discs  as  shown  in  figure  d.  Figure  5 
shows  that  the  surface  roughness  variation  of  the  steel 
disc  is  negligible  compared  to  the  lined  disc.  Using 
the  polynomial  approximation  of  the  surface  roughness 
distribution 


shown  in  figure  6,  the  integral  functions  (12)  and  (13) 
can  be  evaluated  analytically. 

f® 


Figure  6:  Surface  roughness  distri¬ 
bution  of  a  lined  disc 

The  part  of  the  force  due  to  the  squeeze  pressure  in 


the  cavities  between  the  asperities  can  be  calculated 
using  eq.  (6)  after  redefining  the  thickness  h  as 

h  =  z  +  ±s  (17) 

The  total  force  between  the  two  discs  is  given  as 

Fd  =  (1  -  QFqU  4-  Fcont  (18) 

Analogously  we  obtain  for  the  torque  transmitted  bet¬ 
ween  the  two  discs 

Md  =  (1  -  i)Mvi8  4-  Mdry  (19) 

The  viscous  torque  Mvla  results  from  eq.  (9)  and  the 
redefined  thickness  h  in  eq.  (17).  To  determine  the  dry 
torque  Mdry ,  we  use  the  Coulombian  friction  law 

Mdry  =  rm  /iAw  Fcont  (20) 

where 

2  -  r? 

r  —  Z  -9l _ lx 

m“3r2-r? 

The  coefficient  of  friction  \i  is  assumed  to  be  a 
function  of  the  angular  sliding  velocity  (figure  7)  but 
constant  over  the  friction  surface. 


Figure  7:  Characteristic  of  the  friction 
coefficient 


3.3  Steady  contact  phase  model 

This  phase  begins  when  the  static  equilibrium  in  the 
axial  direction  has  been  achieved.  It  is  reasonable  to 
expect  the  film  thickness  to  not  disappear  complete¬ 
ly  because  the  asperities  cannot  be  perfectly  flattened. 
For  numerical  reasons,  it  is  also  expected  that  the  ve¬ 
locities  as  well  as  the  accelerations  of  the  discs  do  not 
become  zero  exactly.  Therefore,  when  a  minimal  film 
thickness  hmin  is  reached,  the  accelerations  qj  and  qj+\ 
and  the  velocities  qj  and  are  set  to  zero.  The  com¬ 
putation  of  the  contact  force  and  of  the  transmitted 
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torque  is  performed  according  to  section  3.2  as  long  as 
the  discs  slip. 

When  the  relative  angular  velocity  between  the  two 
discs  disappears,  the  sticking  state  occurs.  In  this  case 
the  clutch  torque  represents  a  constraint  torque 

Mp  <  fioTmFcont  (21) 

where  po  represents  the  coefficient  of  static  friction. 

4.  Equations  of  motions  of  the  multi- 
disc-clutch 

4.1.  Equations  of  translational  motions 

The  axial  motion  of  the  piston  is  described  by  the 
generalized  coordinate  go-  The  equation  of  motion  of 
the  piston  can  be  written  as 

mpq0  =  FP  -  Fpp  -  Fs  -  J Fb0  (22) 

Furthermore  we  obtain  an  equation  of  motion  for 
each  disc 

mjQj  =  ^Dj-i  -  FDj  -  Ffd.  .  (23) 

4.1.1.  The  actuating  force  Fp.  The  hydraulic  devi¬ 
ce  provides  the  clutch  with  the  necessary  control  pres¬ 
sure  Pcontroi •  The  oil  has  to  flow  through  the  orifice 
before  it  enters  the  piston  chamber.  The  oil  pressure 
in  the  piston  chamber  consists  of  a  static  and  a  dyna¬ 
mic  part.  The  static  pressure  psta  can  be  calculated 
using  the  classical  orifice  equation  given  by 

p  A^ 

Psta  =  Pcontroi  ~  Sgn(go)  «go 2~  ~  2  (2^) 

1  {Adad) 

where  Ap  is  the  piston  abutting  face,  Ad  and  are  the 
orifice  area  and  the  discharge  coefficient  respectively. 

The  dynamic  pressure  pdyn  results  from  the  rotation 
of  the  oil  mass  in  the  piston  chamber 

Pdynir)  =  ^pvlu(r)  =  ^piv^r2  .  (25) 

The  total  force  acting  on  the  piston  may  be  written  as 

Fp  =  J  (p8ta  +  Pdyn(r))  dAP 
AP 

=  PstaAp  +  jP0J2oit  (rpo  -  rp.)  (26) 

with  rp.  and  rpo  representing  the  inner  and  outer 
radii  of  the  piston.  The  angular  velocity  of  the  oil 
can  be  assumed  to  equal  the  piston  housing  velocity 


Von  =  &in-  In  the  case  of  a  dynamic  compensated 
piston,  for  example,  when  it  is  double-sided  pressurised 
or  in  the  case  of  a  brake  clutch  with  a  static  piston 
housing,  the  dynamic  part  of  the  pressure  has  to  be 
set  to  zero. 

4.1.2.  The  return  spring  force  F5.  The  spring 
force  is  responsible  for  moving  the  piston  back  whi¬ 
le  disengaging  the  clutch.  The  spring  usually  has  a 
degressive  characteristic,  which  can  found  from  mea¬ 
surements  (figure  8).  The  spring  force  depends  on  the 
piston  displacement  and  can  be  calculated  as 

Fs(q  0)  =  Fs0  +  c(qo)go  (27) 

where  Fs0  is  the  prestressing  force  and  c  is  the  spring 
stiffness. 


0  1  2  3  4  5 

q0  [mm] 


Figure  8:  Characteristic  of  the  return 
spring 

4.1.3.  The  friction  force  of  the  piston  seal  Fpp. 
The  piston  seals  cause  friction  forces  between  the  inner 
and  outer  sides  of  the  piston  and  the  housing  (friction 
coefficient  p,p).  For  a  given  inner  stress  force  Fp.  and 
outer  stress  force  Fpo ,  the  friction  force  of  the  piston 
seal  can  be  written  as 

Fpp  =  sgn(g0)  Pp  {Fp{  +  FpJ  (28) 

4.1.4.  The  friction  forces  between  tang  and  discs 
Ffd.  The  friction  force  between  a  disc  and  the  tang 
can  be  written  as 

FpDi  —  P>i  +  — —  for  inner  discs  (29) 

TTi 

and 

Fpo,  =  Po  — l  — —  for  outer  discs  (30) 

rT0 
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with  p,i  and  pQ  respectively  the  corresponding  friction 
coefficients,  and  respectively  the  operative  radii 
of  the  inner  and  outer  tangs. 

4.1.5.  The  disc  force  Fp.  The  disc  forces  can  be 
calculated  according  to  section  3. 

4.2.  Equations  of  rotational  motions 

Making  use  of  the  principle  of  the  angular  momen¬ 
tum  the  rotational  motions  of  the  inner  and  outer  tang 
with  the  mating  discs  are  written  as 

Jintyin  —  M< jn  —  Mq  (31) 

Jouttyout  —  Me  —  Mout  (32) 

where  Min  and  Mout  are  the  drive  and  load  torques 
respectively.  The  clutch  torque  is  given  as 

n— 1 

Mc  =  J2MDj  (33) 

i= i 

where  n  is  the  number  of  discs. 

4.3.  Equation  of  the  viscosity 

Since  the  engagement  process  of  a  wet  clutch  is  dis¬ 
sipative,  it  is  coupled  with  an  increase  of  the  oil  tem¬ 
perature.  This  has  a  great  influence  on  the  oil  viscosity 
and  consequently  on  the  squeeze  pressure  and  the  shear 
stress  of  the  oil  film.  Considering  only  the  temperature 
dependence,  the  viscosity  can  be  given  by  the  following 
empiric  equation  [8] 

V  =  Vo  e~^Tol‘  (34) 

The  change  in  the  temperature  results  from  the  ba¬ 
lance  between  the  thermal  load 

Qin  =  Me  (Uin  -  Wout)  (35) 

which  is  developed  during  the  engagement  process,  and 
the  thermal  power 

Qout  =  A  [T0ii  ~  Tg)  (36) 

which  is  conducted  by  the  steel  discs  to  the  housing. 
Thus  we  can  write 

Toil  ^ Qin  Qout'j  ~  *  (37) 

Here  A,  cd  and  Tq  represent  the  housing  heat  con¬ 
ductivity,  the  disc  heat  capacity  and  the  housing 
temperature  respectively. 


5.  Simulation  results 

Figure  9  shows  some  simulation  results  of  the  enga¬ 
gement  of  a  wet  clutch  with  11  discs.  The  curve  9  (a) 
represents  the  plots  of  the  viscous  and  the  dry  torques 
as  well  as  the  resulting  transmitted  torque  in  the  clutch 
for  the  reference  parameters  (a)  as  written  in  table  (1). 
At  the  beginning  only  the  viscous  torque  is  present.  It 
increases  until  the  discs  get  in  contact.  After  the  dry 
torque  is  developed,  the  viscous  torque  decreases  with 
increasing  contact  area  and  oil  temperature  due  to  the 
thermal  dissipation  in  the  clutch. 

A  simulation  with  a  smaller  initial  film  thickness 
Sj  =  0.02  mm  is  shown  in  figure  9(b).  It  is  obvious 
that  although  the  initial  film  thickness  does  effect  the 
drag  torque  during  the  clearance  phase,  it  does  not  in¬ 
fluence  the  time  at  which  contact  occurs. 

The  tolerance  of  the  disc  thickness  has  indeed  an 
influence  on  the  duration  of  the  engagement  process  as 
shown  in  figure  9(c).  The  higher  level  of  the  dry  torque 
results  from  the  dynamic  pressure  pdyn  because  of  the 
delayed  deceleration  of  the  clutch  drive  side. 

Compared  with  the  reference,  a  lower  initial  oil  tem¬ 
perature  leads  to  a  higher  drag  torque  and  a  longer 
engagement  process.  This  results  from  the  higher  va¬ 
lue  of  the  viscosity,  which  increases  with  decreasing  oil 
temperature. 

A  comparison  of  the  torque  curve  with  the  experi¬ 
mental  results  given  in  [9]  shows  a  good  qualitative 
agreement. 


Pcontrol  [heir] 

Sj  [mm] 

b  [mm] 

T0ii[°C] 

a) 

1.4 

0.9  mm 

1.25 

90 

b) 

1.4 

0.02  mm 

1.25 

90 

c) 

1.4 

0.9  mm 

1.23 

90 

d) 

1.4 

0.9  mm 

1.25 

0 

Table  1:  Parameter  variation 
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Torque  [Nm]  Torque  [Nm]  Torque  [Nm]  Torque  [Nm] 


a) 


6.  Conclusions 


Figure  9:  Simulation  results  of  an 

engagement  of  a  wet  clutch 
with  11  discs 


A  detailed  model  has  been  developed  to  simulate  the 
engagement  behaviour  of  wet  clutches.  Effects  like  li¬ 
ning  porosity,  surface  irregularities  as  well  as  clearance 
tolerances  have  been  considered.  The  model  equations 
have  been  deduced  from  the  porous  squeeze  film  and 
the  Hertzian  theory.  The  model  showed  that  the  drag 
torque  during  the  engagement  process  is  affected  by 
the  initial  thickness  of  the  oil  film  as  well  as  by  the 
initial  oil  temperature,  whereas  the  duration  of  the  en¬ 
gagement  is  influenced  by  the  clearance  tolerances  and 
the  initial  oil  temperature.  The  simulation  results  of 
the  presented  model  agree  qualitatively  with  the  expe¬ 
rimental  tests. 
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Abstract 

The  piezoelectric  actuator  is  a  common  device  for  re¬ 
alizing  extremely  small  displacements.  In  the  behavior 
of  a  piezo- actuated  mechanism  the  actuator  dynamics 
play  an  important  role.  Therefore,  in  order  to  be  able 
to  design  a  high  performance  model-based  controller  the 
actuator  dynamics  must  be  taken  into  account  in  the 
system  model.  Based  on  the  literature  a  physical  model 
is  derived.  In  this  model  the  hysteresis  effect  is  repre¬ 
sented  by  a  first-order  differential  equation.  The  model 
is  analyzed  in  great  detail  and  compared  to  another 
model  from  the  literature.  It  turns  out  that  the  derived 
physical  model  is  equally  accurate  for  a  number  of  char¬ 
acteristics.  Furthermore,  preliminary  experiments  have 
shown  that  the  physical  model  more  accurately  describes 
the  rate  of  convergence  of  hysteresis  loops. 

1  Introduction 

The  piezoelectric  actuator  (PEA)  is  a  well-known  com¬ 
mercially  available  device  for  realizing  extremely  small 
displacements  in  the  range  of  10  pm  (1  pm  =  10_12m) 
to  100  pm.  The  ratio  of  the  input  voltage  and  the 
output  elongation  is  very  favorable  for  this.  A  disad¬ 
vantage  however  is  its  highly  nonlinear  input/output 
behavior.  More  specifically,  a  PEA  shows  hysteresis  be¬ 
havior.  In  simple  terms  this  means  that  for  a  certain  in¬ 
put  there  is  no  unique  output.  Instead,  the  output  de¬ 
pends  on  the  input  history.  For  high  accuracy  position¬ 
ing  and  tracking  systems  the  piezo-actuated  position¬ 
ing  mechanism  should  therefore  be  equiped  with  a  con¬ 
troller.  Modern  controller  design  is  based  on  a  model 
of  the  system  to  be  controlled.  For  piezo-actuated  posi¬ 
tioning  mechanisms  the  dynamical  aspects  of  the  PEA 

*This  research  is  supported  by  the  Dutch  Technology  Foun¬ 
dation  (STW). 


play  a  dominant  role.  In  this  paper  we  focus  on  the 
modeling  of  the  dynamics  of  a  piezo-actuated  position¬ 
ing  mechanism. 

In  [8]  a  typical  servomotor  model  is  presented  that  is 
completely  based  on  physical  principles.  In  [7]  an  im¬ 
provement  to  this  model  has  been  described.  Instead 
of  a  combination  of  elementary  elements  the  hysteresis 
is  modeled  by  a  nonlinear  first-oder  differential  equa¬ 
tion,  which  has  been  proposed  in  [5].  In  [1]  a  second 
improvement  has  been  described.  Instead  of  a  mass¬ 
spring-damper  system  the  mechanical  behavior  is  mod¬ 
eled  by  a  partial  differential  equation.  In  [6],  [9],  [8], 
[7]  and  [1]  it  is  shown  that  in  case  of  charge  steering  no 
hysteresis  behavior  is  encountered  in  the  input-output 
behavior,  i.e.  inbetween  the  electrical  input  and  the 
displacement.  A  different  way  of  dealing  with  the  hys¬ 
teresis  effect  is  to  do  less  complicated  voltage  steering 
and  to  take  the  hysteresis  effect  into  account  in  the 
model  on  which  you  base  the  controller  design.  In  [3] 
the  series  connection  of  linear  time-invariant  second  or¬ 
der  dynamics  and  the  differential  equation  of  hysteresis 
has  been  analyzed  in  great  detail.  Based  on  this  model 
controllers  have  been  developed  and  succesfully  applied 
to  an  already  existing  set-up  in  [2]  and  [4]. 

In  this  contribution  we  will  derive  a  physical  model 
for  the  case  of  voltage  steering.  This  model  will  be 
analyzed  in  great  detail  and  will  be  compared  to  the 
series  connection.  In  section  2  the  physical  model  will 
be  derived  and  normalized.  The  series  connection  will 
be  considered  in  section  3.  In  section  4  we  will  show 
that  hysteresis  loops  converge.  The  model  characteris¬ 
tics  of  the  series  connection  are  related  to  the  ones  of 
the  normalized  physical  model  in  section  5.  In  section 
6  the  influence  of  the  two  feedback  loops,  which  are 
present  in  the  physical  model,  will  be  determined  an¬ 
alytically.  Finally,  in  section  7  we  will  summarize  the 
results  and  draw  some  conclusions. 
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2  The  physical  model 

In  this  section  we  will  present  the  physical  model  for 
the  case  of  voltage  steering.  The  model  will  be  based 
on  the  work  in  [8],  [7],  and  [1].  Contrary  to  the  case  of 
charge  steering,  in  the  case  of  voltage  steering  we  will 
have  to  deal  with  the  hysteresis  effect. 

For  a  piezo-actuated  mechanism  that  is  fixed  at  one 
side  the  electromechanical  model  is  presented  in  Fig¬ 
ure  1.  This  model  has  been  proposed  in  [8]  and  im¬ 
proved  in  [7]  and  [1]. 


Figure  1:  The  electromechanical  model  of  PEA  and 
positioning  mechanism  together. 

Here  the  hysteresis  and  piezo  effect  are  separated. 
H  represents  the  hysteresis  effect  and  Uh  is  the  voltage 
due  to  this  effect.  The  piezo  effect  is  represented  by 
C  -  T,  which  is  a  DC  permanent-magnet  motor  with 
electromechanical  transformer  ratio  T  and  with  paral¬ 
lel  capacitance  C.  The  voltage  up  is  the  back-electro¬ 
motive-force  (back-emf)  from  the  mechanical  side.  The 
total  voltage  over  the  PEA  is  u.  This  voltage  can  be 
considered  to  be  the  input  of  the  total  system.  The 
current  flowing  through  the  circuit  is  q.  The  force  Fp 
is  generated  by  the  PEA  and  drives  the  mechanics  of 
PEA  and  positioning  mechanism  together,  which  is  de¬ 
noted  by  D.  This  mechanism  is  designed  such  that  it 
is  accurately  modeled  by  a  linear  mass-spring-damper 
system.  The  resulting  displacement  y  is  the  output  of 
the  total  system. 

The  set  of  electromechanical  equations  is  given  by: 


u  =  Uh+Up  (1) 

q  =  a  \uh\(a  Uh  -  q) +  b  iih  (2) 

q  =  C  up  +  T  y  (3) 

Fp  =  T  up  (4) 

m  y  +  c  y  +  k  y  —  Fp  (5) 


The  second  equation  is  the  hysteresis  equation,  which 
is  proposed  in  [5]  in  a  little  more  general  form.  This 
specific  form  has  been  used  in  [3],  [7],  and  [1].  Accord¬ 
ing  to  [3],  for  physically  correct  hysteresis  it  is  required 


that  a  >  0  and  that  \a  <  b  <  a.  We  remark  that  q  may 
be  seen  as  the  total  charge  in  the  PEA.  The  mechanical 
properties  m,  c,  and  k  are  a  compound  of  the  mechani¬ 
cal  properties  of  both  PEA  and  positioning  mechanism 
together.  Namely,  in  [1]  it  has  been  shown  that  the 
mechanical  behavior  of  a  well-designed  piezo-actuated 
positioning  mechanism  can  be  accurately  modeled  by 
a  single  mass-spring-damper  system.  After  simple  ma¬ 
nipulations  the  following  set  of  equations  may  be  ob¬ 
tained: 

Uh  =  u  -  ^  q  +  0oy  (6) 

q  =  a\uh\(auh-q)  +  buh  (7) 
my  +  cy  +  k0y  =  (30q  (8) 


with  k0  and  0O  defined  as  in  [7]: 


ka 

0a 


rp2 

k+c 


(9) 

(10) 


Before  performing  model  analysis  we  will  simplify 
the  model  by  reducing  the  number  of  parameters.  To 
this  purpose  we  make  the  static  gain  of  Eq.  8  equal  to 
one.  To  do  so  we  substitue  q  =  jg  v,  with  v  a  new 
variable,  in  the  model.  After  simple  manipulations  the 
following  model  may  be  obtained: 


uh  -  u-PiV  +  /30y  (11) 

v  =  a  |i4|  (a'  Uh-v)  +  b'  it/,  (12) 
y  +  c'y  +  k'0y  =  k'0v  (13) 


where  the  new  parameters  are  defined  as: 

bT  (14) 

(15) 

In  this  set  of  equations  the  number  of  parameters  is 
maximally  reduced  while  all  dynamical  properties  are 
captured. 

In  Figure  2  the  normalized  physical  model  is  rep¬ 
resented  in  a  block-diagram  form.  Here  H'  represents 
the  normalized  hysteresis  of  Eq.  12  and  D '  represents 
the  normalized  linear  dynamics  of  Eq.  13.  Clearly,  two 
feedback  loops  are  present. 


/3  _  1  k°  t  _  P°  7  /  _ 

C(30  ’  “  ~  k0  ’  " 


3  The  series  connection 

In  this  section  we  will  consider  the  series  connection 
of  linear  dynamics  and  hysteresis  in  somewhat  more 
detail.  A  much  more  extensive  discussion  may  be  found 
in  [3]. 
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Figure  2:  Block-diagram  of  the  normalized  physical 
model. 

In  Fig.  3  the  series  connection  of  linear  time- 
invariant  second-order  dynamics  Ds  with  static  gain 
1  and  hysteresis  Hs  is  represented  in  a  block-diagram 
form. 


Figure  3:  Block-diagram  of  the  series  connection  of  lin¬ 
ear  dynamics  Ds  and  hysteresis  Hs. 

The  equations  describing  the  series  connection  are: 

w  +  d8w  +  k's  w  —  k'su  (16) 

y  =  aa\w\(asw  -y)  +  bsw  (17) 

where  an  index  s  is  added  to  the  parameters  to  indicate 
that  we  deal  with  the  series  connection  and  where  the 
accent  in  c'  and  kfs  refers  to  the  fact  that  the  parame¬ 
ters  are  divided  by  the  mass. 

A  hysteresis  loop  is  defined  as  the  stationary  loop 
in  the  input/output  plane,  i.e.  in  the  u/y  plane,  for 
a  quasi-static  monotone  oscilating  input,  e.g.  a  low 
frequency  sinusoid.  In  [3]  the  convergence  of  hysteresis 
loops  has  been  proved  for  the  series  connection.  Note 
that  in  the  quasi-static  case  the  series  connection  is 
similar  to  only  hysteresis  HSi  because  the  dynamics  Ds 
reduces  to  the  simple  transfer  of  1. 

In  [3]  expressions  have  been  derived  for  the  center 
point  and  the  average  slope  of  a  hysteresis  loop: 

Vmax  +  ymin 
2 

Umax  “  ymin 

where  umax  and  umin,  and  ymax  and  ymin  are  the  max¬ 
imum  and  minimum  inbetween  which  the  input  and 
output  oscilate. 

The  hysteresis  area  is  defined  as  the  area  enclosed 
by  a  hysteresis  loop  in  the  u/y  plane.  In  [3]  it  has  been 
derived  that  the  hysteresis  area  is  given  by: 


_  _  Umax  “I"  l^min  ^  Q\ 

—  as  -  (lo) 

—  bs  ( Umax  umin)  (19) 


In  Fig.  4  a  realistic  hysteresis  loop  is  shown  in  which 
the  characteristics  center  point,  average  slope,  and  hys¬ 
teresis  area  are  visualized. 


u  — *■ 

Figure  4:  A  realistic  hysteresis  loop  where  several  hys¬ 
teresis  characteristics  are  visualized. 


For  decreasing  deviations  from  a  center  point  it  fol¬ 
lows  from  the  previous  expression  that,  because  of  the 
power  three,  the  hysteresis  area  decreases  faster.  This 
has  first  been  noted  in  [7].  As  a  result,  for  small  de¬ 
viations  the  nonlinearity  is  well  approximated  by  its 
average  slope.  Therefore  the  total  model  of  the  series 
connection  is  well  approximated  by: 

Aw  4-  ds  Aw  +  k8  Aw  =  k8  Au  (21) 

Ay  =  bsAw  (22) 

where  A  is  used  to  indicate  that  we  deal  with  deviation 
variables.  The  above  model  can  be  simplified  to: 

Ay  -f  ds  Ay  +  k's  Ay  =  klsbsAu  (23) 


4  Convergence  of  hysteresis  loops 


In  this  section  we  will  show  that  the  hysteresis  loops  of 
the  physical  model  converge  to  stable  loops.  Further¬ 
more,  we  will  compare  the  speed  of  convergence  with 
that  of  the  series  connection. 

Consider  solely  the  hysteresis  equation  in  the  phys¬ 
ical  model,  i.e.  Eq.  12.  From  [3]  we  learn  that  the 
progress  of  the  so-called  left-  and  right-turning-points 
of  a  hysteresis  loop  in  the  Uh/v  plane  are  given  by: 


'*+i 
?i+l 


g  20f(Uh,ma*  Jj 

g  2  Cfc(tXh)maa.  Uh  ,min) 


(24) 

(25) 


with  L  and  R  constant  for  particular  Uh.min  and 
Uh  ,max  •  Clearly,  for  convergence  it  is  required  that 
a  >  0  and  Uh,max—v>h,min  >  0.  The  latter  is  the  differ¬ 
ence  of  the  maximum  and  minimum  inbetween  which 
Uh  oscilates. 

However,  our  goal  is  to  prove  convergence  in  the  u/y 
plane.  To  this  purpose  we  only  need  to  prove  conver¬ 
gence  in  the  u/v  plane,  because  inbetween  v  and  y  there 
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is  only  the  dynamics  D '  (Fig.  2)  which  will  only  cause 
a  transformation  of  the  hysteresis  loop  but  which  has 
no  influence  on  the  speed  of  convergence. 

A  change  in  u  will  cause  changes  in  Uh ,  v ,  and  y. 
According  to  Eq.  11  the  changes  satisfy: 

A uh  =  An  -  pi  Av  +  p0  A y  (26) 

where  A  is  used  to  indicate  that  we  deal  with  changes 
in  the  variables.  For  clearity  both  pi  and  po  are  positive 
constants.  Suppose  that  a  Au  >  0  gives  a  A  uh  <  0. 
Due  to  the  monotony  of  the  hysteresis  equation  (section 
2)  we  then  have  Av  <  0.  It  is  now  easy  to  derive 
that  Eq.  26  is  only  satisfied  for  a  Ay  <  0.  In  other 
words  Av  and  Ay  have  the  same  sign.  For  arbitrary 
Av  this  is  only  the  case  when  there  is  no  phase  shift 
between  Av  and  A y,  or  in  other  words  in  case  of  a 
quasi-static  experiment.  It  is  just  in  this  case  that  it 
is  easy  to  show  that  the  scenario  sketched  above  leads 
to  a  contradiction.  In  quasi-static  operation  we  have 
Av  =  Ay,  because  the  static  gain  of  the  dynamics  has 
been  normalized  to  1  in  section  2.  By  substituting  this 
in  Eq.  26  we  obtain: 


speeds  up  convergence  because  uh,max  -  Uh,min  be¬ 
comes  larger  due  to  this  positive  feedback.  The  inner 
feedback  loop  (pi)  slows  down  the  convergence  because 
'U'h,max  U h,min  becomes  smaller  due  to  this  negative 
feedback.  The  terms  positive  and  negative  feedback 
are  visualized  in  the  block-diagram  of  Fig.  2.  If  overall 
convergence  is  speeded  up  or  slowed  down  depends  on 
the  input  frequency,  i.e.  on  the  magnitude  and  phase 
shift  of  the  transfer  function  of  the  dynamics.  However 
the  three  examples  given  above  indicate  that  gener¬ 
ally  convergence  is  slowed  down.  In  practice  the  input 
frequency  will  be  well  below  the  eigenfrequency  of  the 
linear  dynamics,  i.e.  in  practice  we  deal  with  the  quasi¬ 
static  case.  In  quasi-static  operation  the  convergence 
behavior  is  slowed  down  according  to  Eq.  27.  Conclud¬ 
ing,  for  this  model  hysteresis  loops  will  converge,  but 
the  speed  of  convergence  differs  from  that  of  the  se¬ 
ries  connection.  In  general  the  speed  of  convergence  is 
slower  and  in  the  quasi-static  case  it  is  surely  slower. 

5  Relations  between  the  parameters  of 
the  two  models 


A uh  =  A u  -  (ft  -  ft)  A y  (27) 

In  terms  of  the  original  parameters  it  can  be  derived 
that  pi  —  Po  =  If  >  0.  Again  suppose  that  a  A u  >  0 
gives  a  A  Uh  <  0.  For  Eq.  27  to  be  satisfied  we  need 
Av  —  Ay  >  0.  However,  due  to  the  monotony  of  the 
hysteresis  equation  this  is  not  possible. 

Concluding,  in  the  general  case  a  A u  >  0  gives  a 
A  Uh  >  0  which  gives  a  Av  >  0.  Depending  on  the  fre¬ 
quency/  phase  shift  Ay  can  be  positive  or  negative.  Let 
us  consider  three  examples.  1)  In  quasi-static  operation 
we  have  no  phase  shift  and  therefore  Ay  —  Av.  2)  If 
the  system  we  deal  with  was  linear  we  would  have  that 
at  the  resonance  frequency  the  phase  shift  was  90  de¬ 
grees  and  as  a  result  that  Ay  =  0  for  Av  =  vmax  -vmin. 
However  due  to  the  hysteresis  the  90  degrees  phase  shift 
occurs  for  a  somewhat  smaller  frequency.  The  exact 
value  depends  on  the  amount  of  hysteresis.  3)  For  very 
high  frequencies  the  phase  shift  is  180  degrees.  Thus 
for  Av  >  0  we  have  Ay  <  0.  However  the  amplitude 
drops  to  zero  for  very  high  frequencies,  i.e.  Ay  =  0. 
These  three  examples  show  that  Eq.  26  is  satisfied  for 
all  frequencies.  This  equation  may  also  be  written  as 
follows: 


In  [3]  the  parameters  of  the  series  connection  have  been 
identified  from  experiments.  Therefore  in  this  section 
we  will  derive  relations  between  all  parameters  of  the 
series  connection  and  the  normalized  physical  model. 
To  this  purpose  we  will  derive  expressions  for  the  center 
point,  the  average  slope,  and  the  hysteresis  area  for  the 
normalized  physical  model. 

Consider  solely  the  hysteresis  equation  of  the  phys¬ 
ical  model,  i.e.  Eq.  12.  Analogous  to  Eq.  18  and  19 
the  expressions  for  center  point  and  average  slope  are 
given  by: 

Vmax  Vmin  _  f  Uh^max  4"  /on\ 

2  ~  °  2  ^ 
Vmax  Vmin  b  (u/i.maj  (30) 

To  determine  the  center  point  for  our  total  model 
we  replace  Hf  by  a '  in  Fig.  2  and  because  we  con¬ 
sider  quasi-static  operation  the  dynamics  D '  may  be 
replaced  by  1.  The  following  expression  may  now  be 
derived: 

ymax  4"  2/min _  0>  4"  foi\ 

2  *  1  +  a' (ft  —  ft)  2  1  ' 


uh,max 


V'h,min 


—  umax  umin  (28) 

Pi  (Vmax  ~  ^min)  H"  Po  Ay 


We  now  have  an  expression  for  Uh,max  —  Uhymin 
which  is  an  important  term  in  the  exponent  of  Eq.  24. 
Note  that  if  Ay  >  0  the  outer  feedback  loop  (P0) 


To  determine  the  average  slope,  H*  is  replaced  by  bf 
and  D  again  by  1.  The  following  expression  may  now 
be  derived: 


Vmax  ymin  — 


1  4-  b'(pi  —  po) 


("Umax  Umin)  (32) 
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Again  consider  solely  the  hysteresis  equation,  i.e. 
Eq.  12.  Analogous  to  Eq.  20  the  expression  for  the 
hysteresis  area  in  the  Uh/y  plane  is  given  by: 

4  (  /  wv  ( Uh  max  ~  uh,min\  /oo\ 
**»/»  =  3  (o  ~b)  a  ( — - - - ^ - I  (33) 

Because  we  are  interested  in  an  expression  for  the 
hysteresis  area  in  the  u/y  plane  we  have  to  find  an 
expression  for  Uh,max  Ufo^min  a  function  of  Umax 
umin.  Here  Eq.  28  may  serve  as  a  start.  Because  we 
consider  quasi-static  operation  we  have  A y  =  ymax  — 
ymin  =  Umax  “  u-min •  After  substituting  Eq.  32  the 
following  relation  may  be  derived: 

Uh,max  ~Ufi,min  ~  j|  _|_  ~f3~)  ^'max  ~  ^m*n)  (34) 

Substitution  of  Eq.  34  in  Eq.  33  and  scaling  by  a 
factor  1  +  b\fii  -f30)  gives  the  relation  for  the  hysteresis 
area  in  the  u/y  plane: 

_ 4  {a  —b)  CX.  /  Umax  ~~  Umin\  forA 

_  3  a + m  -  Po))2  v  2  )  (J5j 


Comparing  Eq.  18-20  and  23  with  Eq.  31,  32,  35, 
and  39  after  some  manipulations  leads  to  the  following 


relations  between  the 

parameters: 

a '  = 

O/g 

(44) 

1  -aM-Po) 

b'  = 

bs 

(45) 

1 

$£ 

1 

1 — 1 

a  = 

1  as  {Pi  Po) 
s  1  -  bs(pi  -  p0) 

(46) 

K  - 

K{  1  +  bspo) 

(47) 

c'  = 

cf 

(48) 

All  parameters  of  the  normalized  physical  model  have 
now  been  related  to  the  ones  of  the  series  connection. 
The  relations  are  convenient  when  we  want  to  deter¬ 
mine  the  parameters  of  the  normalized  physical  model 
from  experiments,  because  the  parameters  of  the  se¬ 
ries  connection  are  directly  related  to  the  experimental 
characteristics  center  point,  average  slope,  and  hystere¬ 
sis  area.  From  the  relations  above  we  can  conclude  that 
these  characteristics  are  equally  accurately  described 
by  both  models. 


Analogous  to  section  3  we  note  that  for  small  de¬ 
viations  from  a  center  point  the  nonlinearity  is  well 
approximated  by  its  average  slope.  The  total  physical 
model  is  therefore  well  approximated  by: 

A Uh  —  A u-  /3iAv  4-  po Ay  (36) 
Av  =  b'Auh  (37) 

Ay  4-  c'Ay  4-  k'0Ay  -  k'Q Av  /  (38) 

where  A  again  indicates  that  we  deal  with  deviation 
variables.  The  above  model  can  be  simplified  to  the 
following  equation: 

Ay  4-  c'Ay  4-  k'cAy  =  /3'CA u  (39) 


6  The  feedback  parameters 

In  section  4  it  is  shown  that  the  speed  of  convergence 
of  hysteresis  loops  is  different  for  the  physical  model  as 
for  the  series  connection.  The  feedback  parameters  po 
and  pi  determine  the  difference  in  convergence.  These 
parameters  are  still  unknown  in  the  expressions  derived 
in  the  previous  section.  Therefore,  in  this  section  we 
will  present  a  way  to  identify  the  feedback  parameters 
from  experiments. 

The  progress  of  left-  and  right-turning-points,  i.e. 
the  convergence  behavior,  is  given  by  Eq.  24  and  25. 
In  quasi-static  operation  Ay  =  Av.  Substituting  Ay  = 
Umax  Umin  in  Eq.  28  and  the  resulting  equation  in  the 
exponent  of  Eq.  24  gives: 


where 


K 

Pc 


ht  1  4-  bf(Pj  -  p0) 

0  1  +  b'Pi 

W 

1  +  b'Pi 


(40) 

(41) 


In  terms  of  the  original  physical  parameters  k'c  and  p'c 
are,  as  in  [7],  derived  to  be  given  by: 


K 


Pc  = 


1  bT 
m  6  +  C 


(42) 

(43) 


2ot(Uhimax  Uhtmin)  —  2q:(' Umax  —  Umin  —  (49) 

{Pi  —  Po)  {ymax  2/min)) 

Substitution  of  Eq.  19  and  46  in  the  just  obtained  equa¬ 
tion  gives: 

2 °(.{Uh,max  ~  Uh^min)  =  2c*5(l  —  Us{Pi  —  pQ )) 

{Umax  Umin  )  (50) 

The  value  of  the  exponent  of  convergence  can  be  ex¬ 
perimentally  determined,  enabling  us  to  determine  the 
value  of  Pi  -  p0  because  as  and  as  are  already  known. 

In  section  4  we  noted  that  for  a  frequency  somewhat 
smaller  than  the  resonance  frequency  we  have  a  phase 
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shift  of  90  degrees  and  as  a  result  we  have  Ay  =  0  for 
Av  =  Vraax  —  vmin.  In  this  case  Eq.  28  simplifies  to: 

'U>h,max  ~~Uh,min  =  "■  A (%oi  “ ^min )  (51) 


For  not  too  large  amplitudes  we  have  vmax  —  vmin  = 
b'iu^max  -  uh,min)-  Substituting  this  gives: 

Uh.min  —  Z  ;  n  (jt max  ^min)  (52) 

1  +  V  Pi 

Substituting  this  expression  and  Eq.  46  in  the  exponent 
of  Eq.  24  gives: 


2  OL^Uh^ax  uh,min )  —  2tt, 


1  as{Pi  fio) 


1  +  bs/30 
{j^max  ~~  lAmin)  (53) 


The  only  unknown  in  the  above  expression  is  /30  which 
can  thus  be  determined  if  this  exponent  is  also  experi¬ 
mentally  determined. 

For  the  series  connection  the  exponent  is  2as(umax~ 
Umin )•  For  low  frequency  (If)  and  frequencies  some¬ 
what  smaller  than  the  resonance  frequency  =  high  fre¬ 
quency  (hf)  the  ratio  of  the  rate  of  convergence  of  the 
physical  model  and  the  series  connection,  denoted  by 
r,  may  be  derived  to  be: 


Tu 

Thf 


1  -  a8(Pi  -  po) 
1  -  as(Pi  —  ftp) 
1  4-  bsfio 


(54) 

(55) 


Preliminary  experiments  have  shown  that  the  con¬ 
vergence  behavior  is  more  accurately  described  by  the 
physical  model. 


7  Conclusion 

Based  on  the  literature  we  derived  a  physical  model 
for  a  voltage-driven  piezo-actuated  positioning  mecha¬ 
nism.  The  hysteresis  nonlinearity  that  plays  an  impor¬ 
tant  role  here  has  been  modeled  by  a  nonlinear  first- 
order  differential  equation.  The  total  model  consists 
of  the  series  connection  of  the  differential  equation  of 
hysteresis  and  a  linear  time-invariant  second  order  dif¬ 
ferential  equation  and  two  feedback  loops  with  constant 
gains. 

A  second,  less  complicated,  model  consists  of  the  se¬ 
ries  connection  of  a  linear  time-invariant  second-order 
differential  equation  and  the  differential  equation  of 
hysteresis.  For  this  model  convergence  of  hysteresis 
loops  has  been  proven  in  the  literature.  In  this  contri¬ 
bution  we  have  proved  convergence  of  hysteresis  loops 
for  the  physical  model.  Furthermore  we  showed  that 
convergence  is  slower  for  the  physical  model  than  for 
the  series  connection. 


In  the  literature  the  model  parameters  of  the  se¬ 
ries  connection  have  been  identified  from  experiments. 
This  model  describes  very  well  the  center  point,  aver¬ 
age  slope,  and  area  of  a  stationary  hysteresis  loop.  The 
physical  model  describes  these  characteristics  equally 
well.  The  difference  between  the  two  models  is  the 
speed  of  convergence  of  hysteresis  loops.  Analytical 
expressions  have  been  derived  relating  the  feedback  pa¬ 
rameters  to  the  exponent  of  the  speed  of  convergence. 
Preliminary  experiments  have  shown  that  the  physical 
model  describes  the  convergence  behavior  more  accu¬ 
rately. 

Based  on  the  series  connection  we  have  already  de¬ 
veloped  two  controllers  and  succesfully  applied  them 
to  an  existing  set-up  in  [2]  and  [4].  In  the  near  future 
we  will  investigate  whether  the  physical  model  can  be 
used  as  a  basis  for  controller  design. 
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Abstract 

This  paper  addresses  robotics,  humans  and  virtual  or 
simulated  characters,  multi-body  systems'  which  have  im¬ 
portant  similarities  and  differences.  In  robotics,  speed  and 
precision  are  often  of  primary  concern,  while  in  virtual 
character  simulation,  realistic  performance  may  be  most 
important.  In  all  cases,  however,  the  goals  of  classifying 
and  retrieving  motion  information  from  example  situations 
and  modifying  that  motion  to  meet  the  goals  of  new  situa¬ 
tions  are  common.  We  present  preliminary  results  for  prob¬ 
lems  in  motion  classification  and  control  using  the  simple 
but  powerful  analogy  that  motion  sequences,  just  like  visu¬ 
ally  observed  objects,  have  characteristic  shapes. 


1.  Introduction 

Many  studies  have  been  directed  toward  understanding 
and  modeling  the  way  biological  systems,  particularly  hu¬ 
mans  and  animals,  perform  motion  tasks  such  as  walking, 
running,  typing  and  lifting.  Such  research  is  fundamental 
to  the  development  of  procedures  for  improving  the  way 
humans  and  future  robots  will  be  able  to  perform  com¬ 
plex  tasks.  This  paper  addresses  robotics,  humans  and  vir¬ 
tual  or  simulated  characters,  multi-body  systems  which  have 
important  similarities  and  differences.  In  robotics,  speed 
and  precision  are  often  of  primary  concern,  while  in  virtual 
character  simulation,  realistic  performance  may  be  most  im¬ 
portant.  In  all  cases,  however,  the  goals  of  classifying  and 
retrieving  motion  information  from  example  situations  and 

JA  multi-body  system  is  one  that  is  composed  of  several  connected 
parts  or  bodies  [15]. 


modifying  that  motion  to  meet  the  goals  of  new  situations 
are  common. 

We  anticipate  an  explosion  in  the  availability  of  mea¬ 
sured  motion  data  over  the  next  ten  years,  which  is  an  ex¬ 
citing  trend.  Motion  databases  may  be  useful  in  themselves 
for  purposes  of  studying  human  motion.  For  example,  re¬ 
searchers  in  medicine  may  use  databases  of  normal  ver¬ 
sus  abnormal  motion  to  develop  diagnostic  procedures  and 
investigate  treatment  strategies.  Ergonomic  investigators 
could  study  the  finger  motions  in  repetitive  typing  tasks, 
for  different  individuals  and/or  keyboards,  to  quantify  the 
muscle  strain  that  produces  carpal  tunnel  syndrome. 

Motion  databases  also  may  be  useful  in  learning  how  to 
recognize  motion.  For  example,  one  might  want  to  distin¬ 
guish  running  from  walking  or  performing  some  other  ac¬ 
tivity.  There  is  a  considerable  amount  of  research  being 
performed  in  recognizing  gestures  within  the  context  of  spe¬ 
cific  applications  (see  [5]  for  an  overview).  The  ability  to 
construct  large  motion  databases  with  relatively  little  effort 
will  provide  additional  information  for  these  efforts.  The 
results  have  the  potential  to  dramatically  change  the  way 
we  interact  with  the  computer. 

To  classify  and  modify  motion  sequences,  we  employ 
the  simple  but  powerful  analogy  that  motion  sequences, 
just  like  visually  observed  objects,  have  characteristic 
shapes.  The  specific  idea  is  that  motion  sequences  can  be 
classified  based  on  “signature  curves”  formed  by  plotting 
important  parameters  of  the  motion  against  one  another. 
For  example,  Figure  1  (left)  shows  a  plot  of  the  knee  an¬ 
gle  vs.  the  hip  angle  of  a  simulated  male  running  character 
described  in  [6]2.  This  particular  type  of  phase  space  plot 


2  The  runner  data  is  being  used  with  the  permission  of  Jessica  Hodgins 
at  Georgia  Tech. 
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Figure  1.  Right  leg  knee  vs.  hip  plots  for 
man,  half-man/woman  and  woman  with  su¬ 
perimposed  algebraic  curves 


has  been  used  for  diagnostic  purposes  by  the  biomechan¬ 
ics  community  (e.g.  [2]).  Section  2  outlines  our  general 
approach  to  motion  classification  and  retrieval  using  plots 
such  as  these. 

To  allow  users  to  modify  motion  sequences,  we  employ 
primitive  signature  curve  shape  alterations  and  motion  sur¬ 
face  interpolation,  as  detailed  in  Section  3.  The  key  idea  is 
that  motion  signature  curves  that  have  good  discriminatory 
power  may  also  be  the  most  important  aspects  of  the  motion 
to  get  right  when  creating  a  new  motion  sequence. 

Motion  signature  curves  depict  a  static  2D  picture  of  dy¬ 
namic  movement.  Therefore,  to  obtain  a  more  complete 
description  of  motion,  we  introduce  time  as  an  additional 
parameter,  which  is  the  focus  Section  4.  We  conclude  with 
some  summary  observations  and  an  outline  of  our  proposed 
future  efforts  in  Section  5. 

2.  Motion  Signature  Curves  as  Implicit  Alge¬ 
braic  Models 

In  this  section  we  show  how  to  describe  motion  se¬ 
quences  using  two-dimensional  (2D)  phase  plots ,  which  are 
plots  of  one  motion  parameter  (e.g.  a  joint  angle  value)  ver¬ 
sus  another.  We  will  focus  on  motions  that  are  cyclic  or 
repetitive  in  nature,  which  will  give  us  plots  with  charac¬ 
teristic  2D  shapes.  Once  appropriate  sets  of  2D  phase  plots 
have  been  determined,  we  quantify  them  using  algebraic 
curves,  as  in  [16],  which  will  define  our  new  motion  signa¬ 
ture  curves. 

.  An  algebraic  curve  of  degree  n  can  be  defined  in  the 
{o',  0}  phase  plane  by  the  implicit  equation 

Fn(oc,  0)  =  ano<yn  +  an-i,iOtn  1/3  - aon0n 

+an-l,OOn  1  +Cfrw2,lOn  2/?+  .  .  .  +  GOl/?  +  ^00  =  0  (1) 


Figure  2.  A  typical  signature  curve 

Since  ( 1)  can  be  multiplied  by  any  non-zero  scalar  without 
changing  the  zero  set,  we  often  will  find  it  convenient  to 
deal  with  monic  cioves ,  which  are  defined  by  the  condition 
that  an o  =  1.  Figure  2  depicts  a  typical  signature  curve 
defined  by  a  quartic  (4th  degree)  algebraic  curve. 

We  formally  prove  in  [11,  17]  that  any  signature  cwye 
defined  by  (  1 )  can  be  expressed  as  a  finite  sum  of  geometric 
primitive  products ,  namely 

Fn(a,0)  = 

/3)-t-Qn_2(a,  0)-\-Qn-fia,  /?)+. .  .4- Q\  or  2(0, 0), 

(2) 

where  each 

Qk(a,  ff)  =  n  C«(Q,  0)  n  Lkj(a.  0)  (3) 

is  a  unique  conic-line  product.  Moreover,  each  (primitive) 
conic  factor  Cki(a,  0)  =  a2+qkiad+rkiP2+8kiOt+tki0+ 
Vki  of  (  3)  has  a  unique  center  pki  =  (q/c,  0ic),  which  is 
easily  determined  by  the  relationsf  10]  2aiC+qki0iC-\-ski  = 
0  and  qkiotic  +  2rki0ic  +  tki  =  0.  The  intersections  of  the 
real  (primitive)  line factors  Lkj (a,  0)  =  a- f  lkj 0  +  kkj  are 
also  easily  obtained. 

In  the  special  case  of  closed-bounded  quartics,  it  can  be 
shown  that[12] 

F4(a,  0)  -  (a2  +  qiQ0  +  rt02  +  s±a  +  ti0  +  Vi) 

C\  (a,  0) 

(a2  +  q2a0  +  r20 2  +  s2a  +  t20  +  v2) 

C2(a,0) 

+  tz(a2  +  fl2  +  s3a  +  h0  +  o3)  =  0,  (4) 

C3(a,0) 

the  product  of conics  C\  (a,  0)  andC2(a,0)  (ellipses  in  this 
case),  plus  a  third  conic  nCz(a,  0)  (a  circle  in  this  case). 
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Figure  3.  E2C  Representation  of  the  same  sig-  Figure  4.  Relative  positions  of  three  charac- 

nature  curve  ters  in  2D  invariant  space 


Figure  3  displays  this  unique  E2C  representation  for  the 
signature  curve  depicted  in  Figure  2. 

Quartic  curves  and  their  E2C  representations  define  10 
geometric  invariants ,  namely  the  4  major/minor  half-axes 
lengths  of  the  two  ellipses,  the  radius  of  the  circle,  the  3 
distances  between  the  centers  of  the  ellipses  and  the  cir¬ 
cle,  and  the  two  angles  that  define  the  orientation  of  the 
ellipses  relative  to  the  circle.  One  can  also  define  two  al¬ 
ternative  invariants  as  the  ratio  of  the  major  and  minor  axes 
of  the  ellipses.  These  two  elliptical  ratio  invariants  were 
used  in  [12]  to  identify  various  objects  by  the  relative  posi¬ 
tions  of  their  ratio  invariants  in  a  two-dimensional  invariant 
space ,  since  these  invariants  change  only  slightly  with  small 
changes  in  the  signature  curve. 

Our  investigations  have  shown  that  it  is  possible  to  model 
some  motion  signature  curves  by  quartic  equations,  where 
elliptical  ratio  invariants  can  be  used  to  quantify  corre¬ 
sponding  motions.  In  other  cases,  higher  degree  algebraic 
curves  will  have  to  be  employed  to  accurately  model  mo¬ 
tion  data.  In  these  cases,  alternative  ratio  invariants  can  be 
defined  and  employed.  In  particular,  the  ratios  of  Fn  (a,  fS) 
at  any  two  conic  factor  centers  or  line  factor  intersections, 
namely 


def  Fn(pki) 

"  Fn(PkjY 


(5) 


are  related-point  ratio  invariants[  17]  which  also  can  be  de¬ 
fined  and  used  to  quantify  and  uniquely  identify  motion  sig¬ 
nature  curves  of  any  degree  n[14]. 

We  next  note  that  it  is  possible  to  separate  and  uniquely 
identify  a  number  of  different  motions  by  an  automatic, 
quantitative  analysis  of  our  signature  curves.  For  example, 
Figure  1  depicts  the  data  points  and  the  corresponding  sig¬ 
nature  curves,  using  the  fitting  algrorithm  [8],  defined  for 
the  male  and  female  runners,  as  well  as  a  half-man/woman 
character  created  as  part  of  a  geometric  morphing  operation 


described  in  [7].  In  this  case,  6th  degree  curves  were  neces¬ 
sary  to  obtain  the  depicted  accuracy  of  fit.  Moreover,  in  this 
particular  case,  Qe(a,3)  =  C^o.  3)C2(aJ3)C3(a  .  j)  in 
(2),  the  product  of  three  conics  whose  centers  were  sub¬ 
stituted  into  (5)  to  define  two  related-point  ratio  invariants 
for  each  character.  Note  that  the  relative  positions  of  these 
invariants  in  the  2D  invariant  space  depicted  in  Figure  4 
clearly  and  (somewhat)  uniformly  differentiate  these  3  vir¬ 
tual  characters. 

3.  User  Interaction:  Modifying  Existing  Mo¬ 
tion 

Not  only  are  signature  curves  useful  for  discriminating 
between  character  motions,  they  also  are  useful  for  modi¬ 
fying  motion.  The  following  subsections  describe  two  ap¬ 
proaches  for  allowing  a  user  to  modify  a  motion  sequence: 
curve  shape  alterations  based  on  primitive  changes  and  an 
automation  interpolation  of  motion  signature  surfaces.  Our 
approach  differs  from  previous  work  in  that  we  use  motion 
signature  curves,  rather  than  joint  angles  or  joint  torques  as 
a  basis  for  modifying  motion  and/or  interpolating  between 
two  motion  sequences. 

3.1.  Curve  and  Pose  Modification 

Many  desired  motions  will  not  be  attainable  through  in¬ 
terpolation  or  blending  of  other  motions;  the  user  will  need 
to  modify  an  existing  motion  sequence.  One  approach  is  to 
allow  a  user  to  change  the  phase  plots  by  modifying  the  de¬ 
scriptive  parameters  of  these  phase  plots.  Recall  the  E2C 
representation  of  the  signature  curve  depicted  in  Figure  2. 
As  illustrated  in  [13],  the  base  ellipses  can  be  transformed 
to  their  canonical  form ,  with  their  centers  at  the  origin  and 
their  major  and  minor  axes  coincident  with  the  horizontal 
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Figure  5.  Signature  curve  shape  modifica¬ 
tions  via  primitive  elliptical  axis  changes 


Figure  6.  Signature  curve  shape  modifica¬ 
tions  via  primitive  elliptical  axis  changes 


Figure  7.  Original(solid)  and  modified 
(dashed)  signature  curves  for  a  simulated 
runner 


Figure  8.  A  user-modified  running  motion 
is  shown.  The  original  running  motion  is 
“ghosted”  behind  the  new  motion 


and  vertical  axes,  respectively.  Multiplicative  factors  can 
then  be  used  to  scale  either  or  both  eigenvalues  of  their 
canonical  matrices  to  produce  appropriate  shape  changes 
along  their  major/minor  axes.  An  inverse  transformation  to 
the  original  ellipses  produces  corresponding  changes  to  the 
original  signature  curve.  Using  such  a  procedure,  a  designer 
could  produce  significant  changes  to  initial  signature  curves 
by  varying  such  parameters  in  an  interactive  environment. 
Figure  5  depicts  the  effect  that  such  modifications  along  the 
minor  axis  of  Ci(a  ,  /3)  has  on  the  shape  of  the  original  sig¬ 
nature  curve,  while  Figure  6  depicts  the  shape  changes  pro¬ 
duced  by  analogous  modifications  along  the  major  axis  of 
C2(a,  /3).  Figures  7  and  8  illustrate  how  such  curve  mod¬ 
ification  techniques  can  be  used.  In  Figure  7,  the  plot  of 
knee  vs.  hip  angle  has  been  modified  to  produce  a  slightly 
larger  range  of  motion  for  both  the  hip  and  the  knee.  Figure 
8  shows  a  filmstrip  of  the  results.  The  new  motion  is  shown 
in  white  and  the  original  motion  is  “ghosted”  in  grey.  The 
rendering  technique  is  taken  from  [9];  it  is  fast  enough  to 
play  back  the  running  motion  in  real-time  (see  Section  4)  so 
that  the  user  can  immediately  see  the  effects  of  a  modifica¬ 
tion. 

3.2.  Automatic  Interpolation  of  Motion  Surfaces 

The  motion  signature  curves  associated  with  various 
phase  plots  will  vary  by  the  type  of  motion,  by  the  style 
or  speed  of  the  motion,  and  by  the  individual  performing 
the  motion.  Controlled  variations  in  a  motion,  such  as  run¬ 
ning  at  different  speeds,  lifting  various  objects,  or  the  grad¬ 
ual  transition  from  (say)  walking  to  running,  will  create 
families  of  signature  curves.  Once  we  have  established  a 
database  of  2D  signature  curves  for  different  character  mo¬ 
tions,  we  propose  the  automatic  interpolation  of  3D  motion 
signature  suifaces  defined  between  two  or  more  2D  signa¬ 
ture  curves,  as  in  [13].  Different  interpolation  algorithms 
will  produce  different  interpolated  motions  that  “exist”  be¬ 
tween  the  base  motions,  as  our  preliminary  investigations 
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have  shown. 

For  example,  the  middle  plot  in  Figure  1  represents  a 
character  geometrically  halfway  between  the  character  rep¬ 
resented  in  the  left  and  right  plots,  but  the  shape  of  the 
curve  is  not  what  we  would  expect  from  a  straightforward 
interpolation  of  joint  angles  from  one  character  to  the  next. 
However,  if  we  plot  the  same  information  in  one  invariant 
space  (Figure  4),  we  see  that  a  logical  interpolation  function 
within  this  space  can  easily  be  defined. 

Generally  speaking,  there  can  be  problems  with  interpo¬ 
lating  general  planar  curves  or  profiles: ;  e.g.  how  can  one 
interpolate  a  (three-sided)  triangle  and  a  (four-sided)  rect¬ 
angle?  As  noted  in  [1],  there  is  no  guarantee  that  a  non -self- 
intersecting  polyhedral  surface  can  be  found  which  interpo¬ 
lates  between  polygon  profiles.  A  measure  of  the  “good¬ 
ness”  of  any  particular  interpolation  is  also  lacking,  as  are 
general  techniques  for  including  more  than  two  consecutive 
profiles  in  a  single  interpolation  strategy.  For  such  reasons, 
we  have  developed  new  strategies  for  generating  3D  signa¬ 
ture  surfaces  which  interpolate  the  individual  (second  de¬ 
gree)  conic  factors  of  our  signature  curves,  rather  than  the 
curves  themselves[13]. 

For  example,  a  single  implicit  equation  can  be  used  to 
model  an  entire  3D  signature  surface  defined  by  (say)  three 
quartic  signature  curves  that  are  perpendicular  to  a “f  ”  axis 
of  orientation,  where  t  could  denote  particular  “times”  when 
a  given  signature  curve  would  be  dynamically  active.  If 
Ca(ct,  (3)  =  YuaijaiPj  -  0  at  t  =  tF,  Cb(a,/3)  - 
Y2  bijGl8i  =  0  at  t  =  t g,  and  Cc(o,  (3)  —  Y^cija%&*  —  0 
at  t  —  ip  define  any  three  corresponding3  conics  of  these 
signature  curves,  a  conic  interpolated  implicit  signature 
surface  through  these  three  conic  profiles  could  be  de¬ 
fined  by  setting  the  coefficients  a.\j  of  Ca{a,j3)  equal  to 
aij(t)  —  dij  -I-  €ijt  4-  fijt2,  where  each 


[djj  €jj  fij ]  —  [04 
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because  ay(f  =  tF)  =>  Ca(a,/3)  -  0,  a,j(t  ~  tG)  => 
Cb(a,0)  =  Oand  -  tK)  =»  Cc{a,j3)  =  0. 

We  expect  that  interpolation  of  motion  signature  curves 
will  give  more  accurate  results  than  a  simple,  linear  inter¬ 
polation  between  joint  angle  curves.  One  advantage  of  the 
proposed  interpolation  scheme  is  that  it  continuously  alters 
global  shape  parameters  rather  than  locally  defined  state 
vectors.  One  area  of  future  work  is  to  explore  the  utility 
of  this  interpolation  scheme  for  allowing  a  user  to  blend  a 
number  of  different  motion  sequences  to  create  new  motion. 


3Note  that  changes  in  the  correspondence  among  the  conics  would  pro¬ 
duce  different  3D  surfaces. 


Figure  9.  Knee  angle  vs  time  for  the  origi- 
nal(solid)  and  modified(dashed)  runner 


Figure  10.  Hip  angle  vs  time  for  the  origi 
nal(solid)  and  modified(dashed)  runner 


4.  Simulated  Motion 

Clearly,  motion  signature  curves  depict  a  static  2D  pic¬ 
ture  of  dynamic  movement.  Therefore,  to  obtain  a  more 
complete  description  of  motion,  we  must  introduce  time  as 
an  additional  parameter,  which  is  the  focus  of  this  section. 
To  do  this,  we  first  sequentially  order  all  of  the  boundary 
points  defined  around  a  motion  signature  curve.  Time  can 
then  be  defined  to  be  proportional  to  the  distance  between 
any  base  point  (starting  at  t  —  0)  and  the  increasing  distance 
to  all  subsequent  ordered  points.  Using  this  procedure,  we 
have  defined  a  periodic  time-varying  function  for  both  the 
knee  angle  and  the  hip  angle  shown  in  Figures  9  and  10  for 
both  signature  curves  depicted  in  Figure  7. 

However,  such  plots  may  not  imply  realistic,  human-like 
motion.  Therefore,  to  produce  more  realistic  motion,  we 
could  use  the  time  intervals  defined  between  actual  motion 
data  points  to  specify  the  relative  velocity  at  any  point  on  a 
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closed  signature  curve  as  a  function  of  that  particular  posi¬ 
tion  on  the  signature  curve,  thereby  producing  a  data-based 
time-scaling  curve.  When  we  then  modify  motion  by  al¬ 
tering  the  shape  of  a  (static)  motion  signature  curve,  we 
can  employ  this  same  time-scaling  curve  to  define  a  veloc¬ 
ity  function  that  would  more  realistically  reproduce  the  new 
motion. 

5.  Concluding  Remarks 

We  have  outlined  some  new,  quantitative  procedures 
for  modeling  and  analyzing  various  motion  tasks  involv¬ 
ing  temporal  and  spatial  relationships  in  multi-body  sys¬ 
tems.  Motion  databases,  which  play  an  important  role  in 
these  ongoing  investigations,  have  been  employed  in  the 
creation  of  2D  phase  plots  for  depicting  various  types  of 
human  and  human-like  motion.  Motion  signature  curves, 
or  implicit  algebraic  models  of  these  phase  plots,  have  been 
defined  and  used  for  the  first  time  to  classify  and  identify 
the  shapes  of  various  motion  tasks.  Primitive  decomposi¬ 
tions  of  our  signature  curves  have  been  used  to  both  modify 
existing  curves  and  to  interpolate  signature  surfaces  for  dif¬ 
ferent  tasks  and/or  different  individuals.  These  static  signa¬ 
ture  curves  are  then  parameterized  in  various  ways  to  pro¬ 
duce  dynamic,  time-varying  motion.  Future  investigations 
will  focus  on  expanding  many  of  our  investigations  using 
other  motion  parameters  and  interpolating  them  in  higher 
dimensions. 
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Abstract 

In  this  paper  we  present  a  new  methodology  for  the 
residual  vibration  reduction  in  point-to-point  motions 
of  mechanical  systems  endowed  with  elastic  transmis¬ 
sions.  The  approach  consists  of  selecting  a  smooth 
polynomial  motion  function  for  the  load,  which  assures 
the  absence  of  oscillation  during  and  at  the  end  of  the 
motion ,  and  then  determining  the  corresponding  input 
function  by  means  of  a  noncausal  dynamic  system  in¬ 
version.  An  optimization  procedure  can  be  applied  in 
this  context  in  order  to  minimize  the  motion  time  sub¬ 
ject  to  actuator  constraints  and  to  select  the  system 
inversion  point  that  minimizes  the  worst-case  residual 
vibration  in  presence  of  parameter  uncertainties.  Sim¬ 
ulation  results  show  the  significant  robustness  proper¬ 
ties  of  this  methodology,  in  spite  of  the  simplicity  of 
the  overall  open-loop  control  scheme. 

1  Introduction 

High-performances  positioning  servosystems  such  as 
robot  manipulators,  can  be  significantly  limited  by 
the  presence  of  elasticities  in  the  transmissions,  which 
cause  vibrations  at  the  end  of  a  point-to-point  motion. 
In  fact,  if  the  elasticity  is  neglected  in  the  motion  plan¬ 
ning  phase,  to  accurately  achieve  the  final  position,  a 
delay  time  has  to  be  added  at  the  end  of  the  motion, 
in  order  for  the  residual  oscillation  to  vanish.  Hence, 
the  total  motion  time  necessarily  increases. 

In  order  to  cope  with  this  problem  two  different  strate¬ 
gies  can  be  applied:  a  feedback  control,  which  may  re¬ 
quire  the  knowledge  of  the  state  of  the  system  at  any 
time,  or  a  feedforward  control,  which  implies  that  the 
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motion  law  is  determined  in  advance,  taking  into  ac¬ 
count  the  parameters  of  the  elastic  transmission  [1]. 
In  the  latter  case,  a  further  closed-loop  can  be  added 
to  the  system  in  order  to  cope  with  system  uncertain¬ 
ties  and  parameter  variations.  Regarding  the  motion 
planning  aspect,  many  solutions  have  been  provided  in 
the  literature.  The  most  well-known  technique  is  the 
input-shaping  one,  developed  in  the  last  decade  (e.g. 
see  [2]-[10]),  which  consists  of  convolving  a  sequence 
of  impulses  (the  input  shaper)  with  the  desired  system 
command  in  order  to  generate  the  system  command 
that  is  then  actually  employed  to  drive  the  system. 
The  input  shaper  is  determined  upon  the  natural  fre¬ 
quency  and  the  damping  of  the  vibratory  system  and 
further  developments  have  been  devised  to  increase  ro¬ 
bustness. 

In  a  previous  paper  [11],  we  proposed  an  alternative 
approach  based  on  system  inversion.  Specifically,  an 
arbitrarily  smooth  closed-form  motion  law  for  the  load 
of  the  system  has  to  be  defined  so  that  there  are  no  os¬ 
cillations  during  and  at  the  end  of  the  motion.  Then, 
by  means  of  a  noncausal  system  inversion,  the  corre¬ 
sponding  actuator  input  is  determined.  Polynomial 
functions  are  suitable  to  be  adopted  as  output  func¬ 
tions  for  this  method  since  monotonicity  can  be  easily 
obtained  and  they  assure  the  smoothness  of  the  input 
function  and  its  derivatives  until  an  arbitrarily  pre¬ 
fixed  order.  Furthermore,  in  this  context,  the  motion 
time  can  be  effectively  minimized,  taking  into  account 
actuator  constraints,  by  means  of  a  simple  bisection 
algorithm. 

In  this  paper  we  significantly  improve  the  above 
methodology  by  optimizing,  in  addition  to  the  motion 
time,  also  the  choice  of  the  system  inversion  point  so 
that  it  is  minimized  the  worst-case  amplitude  of  the 
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residual  vibration  in  presence  of  significant  uncertain¬ 
ties  on  the  values  of  the  stiffness  constant  and  damp¬ 
ing  ratio  of  the  system.  In  other  words,  by  means  of 
a  simple  iterative  procedure,  the  worst-case  amplitude 
of  the  residual  vibration  is  reduced  in  a  point-to-point 
motion  whose  duration  is  minimized  subject  to  actua¬ 
tor  constraints. 

The  paper  is  organized  as  follows.  In  Section  2  the  sys¬ 
tem  inversion  based  motion  planning  is  explained.  In 
Section  3  the  optmization  algorithm  is  described  and 
in  Section  4  simulation  results  are  presented.  Conclu¬ 
sions  are  drawn  in  the  last  section. 


2  System  inversion  based 
methodology 

An  elastic  transimission  can  be  simple  represented  by 
the  mass-spring-damper  model  depicted  in  Figure  1, 
where  y  is  the  motor  shaft  displacement,  x  is  the  load 
displacement,  m  is  the  load  mass,  k  the  stiffness  con¬ 
stant  and  c  the  damping  of  the  transmission  [12].  The 
model  is  described  by  the  following  simple  differential 
equation  which  links  x  and  y: 

mx  +  cx  +  kx  =  cy  +  ky  (1) 

which  can  be  re-written  as: 


z  +  2  £unz  +  u)2nz  =  -  y 

where  z  =  x-y,un  =  yfk/m  rad/s  is  the  frequency  of 
the  oscillatory  mode  and  £  =  c/2mun  is  the  damping 
ratio. 

Laplace  transform  operator  can  be  applied  to  equation 
(1),  resulting  in  the  following  transfer  function: 


G(s)  = 


cs  4-  k 

ms 2  -\-cs-\-k 


(2) 


When  the  elasticity  of  the  transmission  is  neglected, 
the  motion  planning  is  accomplished  on  y(£),  with¬ 
out  taking  into  account  the  actual  motion  x(t)  of  the 
load.  This  might  represent  a  major  drawback  when 
the  servosystem  is  required  to  have  high  performances. 
The  basic  idea  proposed  in  this  paper  is  to  adopt  a 
system  inversion  based  methodology  to  calculate  the 
input  function  y(t)  that  determines  a  desired  output 
function  x(t)  without  oscillations.  In  this  context,  it 
appears  that  the  choice  of  the  load  motion  law  x(t)  is  a 
crucial  issue.  Polynomial  functions  present  important 
properties  that  render  them  particularly  suitable  for 
this  purpose.  Hence,  we  define 


x(t).—  antn  +  an_i£n  1  +  . . .  +  ait  +  do  (3) 


[11]),  regarding  the  differentiability  of  the  input  of  the 
system.  Denote  by  the  class  of  functions  which 
have  a  to-th  continuous  derivative.  Let  also  the  i- th 
derivatives  y^(t)  and  x^(t)  be  equal  to  zero  for  all 
t  <  0  and  alH  E  N  (at  time  t  =  0“  all  initial  conditions 
are  equal  to  zero). 

Property  1  Assume  that  h  >  1.  For  the  system  de¬ 
scribed  by  the  differential  equation  ( 1 ),  if  x(t)  E 
then  it  is  functional  reproducible  by  a  unique  y(t )  E 


Hence,  the  degree  h  of  the  polynomial  output  function 
has  to  be  selected  to  guarantee  that  y(t)  E  C^h~^. 
The  n  =  2h  +  2  coefficients  an, . . .  ,oo  can  be  deter¬ 
mined  by  considering  the  point-to-point  motion  from 
position  zero  to  position  q  that  has  to  be  completed  in 
the  time  interval  [0,  r]  with  r  being  the  free  parameter 
that  defines  the  motion  duration  time.  Imposing  that 
x(t ;  r)  E  means  that  the  following  linear  algebraic 

system  has  to  be  solved: 

z(0;  r)  =  0  x(t ;  r)  =  q 

(0;  r)  =  0  x^(t;  t)  =  0 

<  :  :  (4) 

k  z^(0;r)=0  a?^)(r;r)  =  0 

It  comes  out  that  the  general  closed-form  expression 
of  x(t\r)  with  t  E  [0 ,r]  that  results  from  (4)  is  given 
by:  ^ 

x(t;r)  =  a(h,r)q  f  vh(r  —  v)hdv  (5) 

Jo 

with  positive  coefficient  a(/i,r)  =  (/Qr  u/l(r-‘y)/ldt»)_1. 
The  above  formula  (5)  can  be  easily  proven  taking  into 
account  that 


x(t;  t)  =  a(h,  r)qth(r  -  t)h.  (6) 

From  the  above  expression  (6)  it  is  also  apparent  that 
x{t\  t)  is  monotonically  increasing,  since  x(t ;  r)  is  posi¬ 
tive  all  over  (0,  r),  so  that  no  oscillations  occurs  during 
the  whole  motion  of  m. 

At  this  point,  the  input  function  y{t\ r)  can  be  calcu¬ 
lated  by  means  of  the  noncausal  system  inversion,  that 
is: 


v(f ;  r)  =  C-1  r Y(S ;  r)]  =  C~1  [G-1  («)*(«;  r)]  (7) 

where  X (s;  r)  is  the  Laplace  transform  of  x(t ;  r).  After 
a  few  passages,  the  closed-form  expression  of  y(t;  r)  for 
t  >  0  can  be  derived  (obviously  y(t;r)  =  0  if  t  <  0): 


The  choice  of  the  order  n  is  a  direct  consequence  of 
the  following  property  (whose  proof  can  be  found  in 


y{t;r)=  fx(t-,T)  +  (l-*g)x(t-,T)+ 

zg-e-V'WfZeW'  >x(v,r)dv.  W 
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where 


Considering  that  x(t\r)  =  q  and  x(t;r)  =  0  for  t  >  r, 
expression  (8)  can  be  significantly  simplified  for  t  <  r, 
obtaining: 

y{t]  t)  =  q  +  [j/(r,r)  -  9]e_(Vc)(t-T)  (9) 

Prom  (9)  it  appears  how  y(t;r)  is  all  over  bounded 
because  the  excited  zero  mode  is  stable. 

3  Optimization  procedure 

In  the  application  of  the  methodology  described  in  Sec¬ 
tion  2,  it  is  sensible  to  consider  constraints  on  the  ac¬ 
tuators  and  therefore  on  the  input  function  and  its 
time  derivatives,  and  that  the  linkage  parameters  are 
not  exactly  known.  Hence,  we  propose  to  minimize 
the  motion  time,  subject  to  actuator  constraints,  and 
the  worst-case  residual  vibration  amplitude.  Specif¬ 
ically,  we  assume  that  the  values  of  the  linkage  pa¬ 
rameters  c  and  k  are  uncertain,  but  it  is  known  that 
c  G  [c-  c+]  and  k  €  [A;~,fc4'],  with  known  interval 
endpoints.  Using  another  notation,  we  consider  the 
uncertain  parameter  vector  p  =  [c,  k]T  belonging  to 
V  :=  [c“,c+]  x  [fc“,  fc+].  In  this  context,  we  define  the 
input  function  as 

V(t\rd9pd)  =r-1[G-1(S;P(i)X(5;rd)] 

where  rd  G  M+  and  pd  G  V  are  design  parameters.  The 
problem  is  to  optimally  choose  rd  and  pj  according  to: 

1 

r*d  :=  min  rd 
rde  R+ 

subject  to: 

Td,  Pd)|  <  y$ax,  Vi  >  0 

Vpde7>  i  =  l,...,l  (l  <h). 

2-  Pd  =  [cjj  ArJ]  is  the  minimizer  of  the  following 
problem: 

>  Pd-  P)  ~  «l)  (10) 


*(tiT5,Pd>p)  =  -C-1[G(s;p)F(S;rd*,pd)]. 

It  comes  out  that  the  optimal  input  motion  function 
is: 

2/(^j  rd  Jpd)- 

Define  the  residual  vibration  amplitude  as: 

V(Td> Pd.P)  :=max|a;(<;rd*,pd,p)  -g|  (11) 

The  proposed  minimization  of  the  worst-case  resid¬ 
ual  vibration  with  minimum  time  motion  planning  is 
a  problem  that  is  very  difficult  to  solve.  Hence,  we 
look  for  a  suboptimal  solution  that  can  however  be 
very  useful  in  practical  cases.  This  can  be  obtained  by 
relaxing  the  worst-case  search  in  the  dense  V  with  a 
discrete  finite  search  over  the  vertexes  of  V  plus  the 
nominal  midpoint  of  V  given  by  ( c0yk0 ).  In  this  case, 
expression  (11)  is  substituted  by: 

^(T  iPrfjPi)  =  maxi>r  \x(t;  Td,pd,pj)  —  q\ 

3  =  (12) 

where  pi  [c~  k~)T,  p2  :=  [c~  k+]T,  p3  :=  [c+  k~]T, 
p4  :=  [c+  A;+]T,  and  p5  :=  [c0  k0]T.  Defining 

Mv(rd,pd)  :=max{V(r^pd,Pj)  :  j  =  1,...,5} 

(13) 

permits  to  rewrite  expression  10  as  follows: 

min  Mv(rd*,Pd).  (14) 

The  value  of  the  minimum  time  and  the  minimizers 
of  problem  (14)  can  then  be  found  by  means  of  the 
following  optimal  system  inversion  (OSI)  algorithm, 
which  can  logically  be  divided  in  two  parts,  as  well  as 
the  problem.  The  first  is  a  typical  bisection  algorithm 
which  is  devoted  to  the  search  of  rd  which  guarantees 
that  limits  on  the  input  derivatives  are  satisfied  for 
any  pd  G  V,  that  is  for  pd  =  Pj,  j  =  1, . . . ,  5,  since 
we  consider  again  the  relaxed  problem.  Then,  a  local 
search  procedure  is  applied  to  find  c*d  and  k%  that  solve 
problem  (14). 

OSI  Algorithm 

First  part 

1.  Set  Tmjn  =  0. 

2.  Determine  an  initial  value  for  rdrnax  such  as 

maxt>0  |y«  (t;  rdmai ,  Pj-)|  <  y$ax,  i  =  1, . . . ,  l  and 
J  =  1, ,  5. 
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3.  Set  rd  =  (rdmin  +r(JmoJ/2. 

4.  If  maxt>0  |y(i)(f;Td,Pj)|  <  Vmax,  i  =  1,. ..,1  and 
j  =  !,•••,  5  then  set  rdmax  -  rd  else  set  rdmj„  = 
Td- 

5.  If  (rdmax  -  Tdmin)  >  £  then  goto  3. 

6.  Set  T*d  =Tdmai.  ! 

Second  part 

7.  Set  k*d  =  k0,  c*d  =  c0,  flag=0. 

8.  Set  I  :=  Mv(rd,pd)  and  set  I+  =  I. 

9.  Set  k*  =  (1  +  7 )k*d.  If  k*d  >  k+  then  set  k*d  =  k+. 

10.  Set  I  ~  Mv(rd,p*d). 

11.  If  I  <  I+  then  set  flag=l  and  go  to  9. 

12.  If  flag=l  go  to  16. 

13.  Set  kd  =  (1  -  ^)k*d.  If  kd  <  k~  then  set  kd  = 

14.  Set  I  :=  Mv(t%,  p*j). 

15.  If  I  <  I+  then  go  to  13. 

16.  Set  flag=0. 

17.  Set  cd  —  (1  +  7 )cd.  If  cd  >  c+  then  set  cd  =  c+. 

18.  Set  /  :=  Mv(rd,pd). 

19.  If  I  <  I+  then  set  flag=l  and  go  to  17. 

20.  If  flag=l  go  to  18. 

21.  Set  c*  =  (1  —  7)c^.  If  cd  <  c~  then  set  cd  =  c“. 

22.  Set  I  :=  Mv(r*,  pj). 

23.  If  I  <  7+  then  go  to  21. 

24  End. 

It  has  to  be  noted  that  parameter  e  is  a  precision  pa¬ 
rameter  that  can  arbitrarily  fixed  and  7  determines 
the  velocity  of  the  descent  to  the  minimum  and  the 
precision  in  determining  cd  and  kd.  It  is  easy  to  make 
it  adaptive  in  order  to  decrease  the  overall  computa¬ 
tional  time  of  the  algorithm. 

Finally,  it  has  to  be  stressed  that  in  practical  cases, 
limits  on  the  first  and  on  the  second  derivative  of  the 
input  function,  which  correspond  to  limits  on  the  max¬ 
imum  velocity  and  acceleration  (torque)  provided  by 
the  actuator,  are  generally  considered.  In  some  cases, 
also  the  limit  on  the  third  derivative  (jerk)  is  worthy 
to  be  taken  into  account. 


4  An  illustrative  example 

As  an  illustrative  example,  we  consider  an  elastic 
transmission  with  c0  =  8[kg-s_1],  k0  =  2000[N-m-1] 
and  a  load  mass  m  =  3 [kg],  which  corresponds  to 
a  nominal  frequency  of  25.82[rad-s“1]  and  a  nominal 
damping  ratio  of  0.05.  The  parameter  range  of  uncer¬ 
tainty  is  ±50%,  i.e,  c  €  [4, 12]  and  k  e  [1000, 3000]. 
The  motion  is  from  0[m]  to  l[m].  We  fixed  h  =  2,  so 
that  the  output  function  results  to  be  ( t  €  [0,t]): 


15. 


10. 


We  also  fixed  l  =  2  and  y$ax  =  1.2,  y& L  =  4  and 
Umax  =  5.  The  resulting  values  of  the  OSI  algorithm 
(setting  e  =  0.01  and  7  =  0.001)  are  r*d  =  1.65s, 
cd  =  12[kg*s-1]  and  kd  =  1389.3[N-m_1],  which  corre¬ 
sponds  to  a  maximum  vibration  amplitude  of  5.6-10-4 
for  the  case  where  c  =  c+  =  12[kg-s-1]  and  k  — 
k ”  =  1000[N*m-1].  The  corresponding  input  func¬ 
tion  y(t;T%,  p5)  is  plotted  in  Figure  2  and  its  first  and 
second  time  derivatives  are  plotted  in  Figures  3  and  4 
respectively.  Not  that  for  the  sake  of  clarity  the  input 
function  starts  at  t  =  0.1s.  It  can  be  seen  how  the  con¬ 
straints  are  not  exceeded.  Figure  5  shows  the  motion 
of  the  load  corresponding  to  the  maximum  vibration 
amplitude  over  the  relaxed  uncertain  domain,  i.e.  it 
is  plotted  pj,p3).  Figure  6  reports  the  value 

of  the  maximum  amplitude  vibration  for  different  val¬ 
ues  of  the  system  parameters.  In  order  to  verify  the 
effectiveness  of  the  proposed  approach,  we  also  evalu¬ 
ated  the  amplitudes  of  the  residual  vibration,  for  dif¬ 
ferent  values  of  system  parameters,  in  case  the  input 
function  is  determined  using  a  system  inversion  per¬ 
formed  on  the  nominal  model,  i.e.  y(t;rd,  pJJ)  is  the 
input  with  pd  :=  [ c0,k0]T .  These  results  are  reported 
in  Figure  7.  It  appears  how  the  maximum  value  of 
the  amplitude  of  the  residual  vibration,  attained  when 
c  =  c+  =  12[kg*s-1]  and  k  =  Ar  =  1000[N-m-1]  is 
1.1  •  10-3.  The  output  motion  corresponding  to  the 
maximum  amplitude  of  the  residual  vibration  is  plot¬ 
ted  in  Figure  8.  From  the  above  results  it  appears 
a  neat  improvement  (reduction  of  about  49%  in  the 
worst-case  amplitude  vibration)  of  the  optimal  system 
inversion  based  motion  planning  over  the  nominal  sys¬ 
tem  inversion  based  one. 


5  Conclusions 

In  this  paper  we  proposed  a  new  optimization  proce¬ 
dure  in  the  system  inversion  based  point-to-point  mo¬ 
tion  planning  for  servosystems  endowed  with  elastic 
transmissions.  Once  the  motion  time  has  been  mini¬ 
mized  subject  to  actuator  constraints,  the  system  in¬ 
version  point  is  optimally  chosen  in  order  to  minimize 
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Figure  2:  Optimal  input  function. 


Figure  4:  Second  derivative  of  the  optimal  input  func¬ 
tion. 


Figure  3:  First  derivative  of  the  optimal  input  func¬ 
tion. 


Figure  5:  Worst-case  output  with  the  optimal  system 
inversion. 


the  worst-case  residual  vibration  in  presence  of  para¬ 
metric  uncertainties.  Results  have  shown  the  effective¬ 
ness  of  the  overall  method,  despite  its  simplicity. 
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Abstract 

This  paper  addresses  the  tracking  control  problem  of 
a  robotic  manipulator  with  unknown  and  changing 
dynamics .  A  neuro-control  architecture  including  a 
nonlinear  compensator  using  an  adaptive  neural 
network  is  presented.  The  proposed  scheme  is 
simulated  and  successfully  tested  for  the  path 
tracking  problem  of  a  two-link  planar  arm .  The  fast 
adaptation  capability  of  the  proposed  neuro¬ 
controller  to  changes  in  manipulator  dynamics  is 
satisfactory . 

Keywords:  Robotic  Manipulators,  Trajectory  Control, 
Artificial  Neural  Networks,  Neuro-Adaptive  Control 


1.  INTRODUCTION 

Trajectory  control  problem  of  the  robotic 
manipulators  have  become  increasingly  important 
because  of  their  essential  role  in  today’s  flexible 
automation  and  manufacturing  strategies  [1]. 
However,  manipulators  are  subject  to  structured 
and/or  unstructured  uncertainties  in  industrial 
applications.  Structured  uncertainty  involves 
variations  in  the  manipulator  link  properties, 
unknown  loads  and  inaccuracies  in  the  torque 
constants  of  the  actuators.  Typical  unstructured 
uncertainties  are  unmodeled  dynamics  caused  by  the 
presence  of  high-frequency  modes,  neglected  time- 
delays  and  nonlinear  friction  [2].  Highly  nonlinear, 
complex  and  coupled  dynamics  of  a  robotic  structure 
hinder  the  efficient  use  of  the  well-known  linear 
control  techniques.  Although  an  acceptable  path 
tracking  performance  can  be  achieved  using  the  linear 
controllers,  nonlinear  model  based  controllers  offer  a 
better  efficient  control  technique.  It  has  become 


widely  recognized  that  the  tracking  performance  of 
the  conventional  control  methods  in  high-speed 
operations  is  severely  affected  by  the  uncertainties 
[2].  Thus,  the  need  of  new  control  strategies  capable 
of  dealing  with  highly  complex  and  nonlinear  systems 
even  under  uncertainties  has  appeared.  Towards  this 
end  the  use  of  artificial  neural  networks  appear  very 
attractive. 

Due  to  the  natural  properties  of  the  artificial 
neural  networks,  they  are  ideally  suited  for  path 
tracking  control  process.  They  are  able  to  learn,  to 
approximate  any  function  to  a  high  degree  of 
accuracy  and  to  classify  patterns  [3].  Moreover,  a 
neuro-controller  needs  significantly  less  priori 
knowledge  about  the  system.  Using  these  attractive 
features,  artificial  neural  networks  have  been  shown 
to  be  very  efficient  in  identification  and  control  of  a 
two-link  planar  robot  arm. 

This  paper  presents  a  nonlinear  compensator 
using  an  inverse  neuro-model  which  incorporates  the 
computed  torque  method  implemented  to  a  two-link 
planar  arm.  While  realizing  the  computed  torque 
strategy,  instead  of  using  an  inverse  mathematical 
model  of  the  robot  dynamics,  a  neuro-model  is 
employed.  Before  adopting  the  neuro-model  in  the 
control  loop,  its  weights  are  adjusted  off-line.  Pre¬ 
training  process  of  the  neuro-model  needs  recording 
data  of  actual  robot  arm  operation.  The  pre-trained 
neuro-model  which  is  substituted  in  the  control  loop 
has  obviously  faster  error  convergence  in  responding 
to  an  untrained  trajectory  including  strong 
uncertainties.  Consequently  the  control  architecture 
has  better  trajectory  following  performance.  Thus,  on¬ 
line  adaptation  of  the  network  parameters  enables  the 
control  system  to  react  properly  and  effectively  under 
changing  or  unknown  dynamics. 
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2.  ROBOT  DYNAMICS  MODEL 


The  two-link  planar  manipulator  is  shown  in 
Fig.  1.  The  rigid  body  links  are  jointed  together  with 
two  revolute  joints  and  actuated  by  two  separate 
motors  at  each  joint. 


Fig.  1.  Two-link  planar  arm. 


The  dynamic  equation  of  the  manipulator  is 
given  by: 

t =M(e).e +v(o,d)+G(0)  a> 

where  T  is  the  2x1  vector  of  joint  torques  supplied  by 
actuators,  M  is  the  2x2  positive  definite  manipulator 
inertia  matrix,  6  is  the  2x1  vector  of  joint  angular 
positions,#  is  the  2x1  vector  of  joint  angular 
velocities,  V  is  the  2x1  vector  function  representing 
centrifugal  and  Coriolis  effects  and  G  is  the  2x1 
vector  function  representing  torques  due  to  the 
gravity. 

3.  COMPUTED  TORQUE  METHOD 


Computed  torque  method  is  commonly  and 
efficiently  used  for  the  path  tracking  control  process 
of  a  robotic  manipulator  [4]. 
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Fig.  2.  Computed  Torque  Method 

The  control  law  consists  of  two  parts;  a  linear 
feedback  term  and  a  feedforward  term  concerning  the 
robot  inverse  dynamics.  The  feedback  controller, 
which  is  an  independent-joint  PD  controller  with 
velocity  reference,  plus  the  desired  acceleration, 


sends  its  output  through  the  dynamic  model  (see  Fig. 
2).  This  leads  to  a  corrected  acceleration  which  is 
used  as  an  input  to  the  inverse  dynamics  model  [4]: 

T=R^{d\e,G)  (2) 

where, 

d'  =  9d+Kp(ed-o)+Kd(ed-d)( 3) 

Computed  torque  control  is  a  form  of  control 
method  called  non-linearity  cancellation,  because  if 
the  dynamic  model  is  exact  (Rm=R\  the  nonlinear 
dynamic  perturbations  are  exactly  cancelled.  The  rest 
of  the  system  is  a  decoupled  linear  system  that  can  be 
controlled  according  to  standard  techniques  [4]. 
Fortunately,  while  implementing  the  computed  torque 
method  high  control  gains  are  not  needed  for  a  secure 
stability. 

The  computed  torque  method  has  been  shown  to 
make  the  overall  system  stable  with  a  very  low 
tracking  error  level  if  an  exact  model  of  the  robot 
dynamics  is  available.  However  if  such  a  model  is  not 
available,  the  system  dynamics  have  to  be  adaptively 
identified  in  order  to  achieve  a  feedforward  non-linear 
compensation.  In  such  a  case,  artificial  neural 
networks  (ANNs)  which  provide  the  learning  ability 
can  be  used  to  generate  the  necessary  torque 
commands.  However,  its  performance  especially  in 
high-speed  operations  is  severely  affected  by 
uncertainties.  Adaptive  approaches  may  be  proposed 
to  obtain  more  robust  control  architecture  in  presence 
of  structured  uncertainties. 

4.  NEURO-CONTROL 

This  section  presents  a  nonlinear  compensator 
using  neural  networks  incorporating  the  computed 
torque  method  for  trajectory  control  of  the  robotic 
manipulator.  The  proposed  control  architecture  is 
shown  in  Fig.  3. 

Basically,  the  control  architecture  has  the  same 
structure  as  that  of  the  computed  torque  method 
described  in  section  3.  Instead  of  using  an  inverse 
mathematical  model  of  the  plant  an  inverse 

dynamics  neuro-model  (NN1nv)  is  used  to  compensate 
the  system  non-linearities  as  shown  in  Fig.  3. 


e'd 


Fig.3.  Neuro-controller  based  on  computed  torque 
method. 
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The  ANN  model  which  is  used  to  model  the  real 
inverse  dynamics  structure  of  the  robotic  system  for 
non-linear  compensation  has  a  simple  structure.  It  is  a 
three-layer  fully  connected  feedforward  neural 
network. 

Indeed,  even  with  neural  network  modeling 
identification,  robot  inverse  dynamics  can  not  be 
modeled  including  structured  uncertainties  and 
unexpected  changes  in  properties  of  the  robot  and  its 
environment.  To  make  the  overall  control  system  be 
more  robust,  an  adaptive  control  scheme  must  be 
associated  with  the  present  control  architecture.  An 
on-line  adaptation  of  the  neuro-controller  structure  is 
realized  through  an  on-line  learning  process. 

As  a  result  of  analyzing  feedback  control 
techniques  and  integrating  them  with  the  neuro 
modeling  method,  the  “Feedback  Error  Learning” 
scheme  is  introduced  in  order  to  provide  the 
necessary  information  to  perform  the  on-line  training 
of  the  neuro-controller  [5],  The  “Feedback  Error 
Learning”  scheme  does  not  require  a  teaching  signal 
the  feedback  torque  is  used  as  the  error  signal  for  the 
on-line  training  process  of  the  network.  On-line 
training  process  of  the  proposed  neuro-control 
architecture  uses  basically  the  same  idea  with  an 
additional  but  necessary  scaling  factor,  as  shown  in 
Fig.  3.  The  connection  strength  between  node  from  i 
and  to  nodey,  Wfi  is  changed  by  an  amount  given  by 

AWji  =  TJ.dj.fXOjli,  (4) 

where  tj  is  the  learning  rate,  /  is  the  derivative  of  the 
activation  function,  is  the  weighted  sum  of  the 
outputs  of  the  previous  layer  and  Oj  is  the  output 
signal  of  the  node  j.  Besides  <5,  is  given  by 

dj=K.{Kp(ed-e)+Kd{ed-o)){5) 


robot  motion  [2],  This  permits  the  construction  of  an 
approximate  inverse  neuro-model  of  the  system 
which  is  then  substituted  into  the  control  architecture 
where  the  training  process  will  continue  on-line 
making  use  of  the  feedback  error  signal.  Hence  faster 
adaptation  of  the  neuro-controller  becomes  possible 
and  more  robust  control  strategies  may  be  achieved. 

5.  SIMULATION  RESULTS 

To  analyze  the  various  features  of  this  neuro¬ 
control  system,  the  two-link  robotic  manipulator 
shown  in  Fig.  1  is  simulated.  The  governing  dynamic 
equations  of  the  system  ignoring  the  friction  term  is 
available  in  a  number  of  references  (see  e.g.  1). 

The  two-link  planar  arm  has  been  modeled  as 
two  rigid  links  of  lengths  lj=l  m  and  l 2=0.8  m  with 
lumped  masses  of  rrti=10  kg  and  m2=5  kg.  The 
simulation  has  been  carried  out  using  fourth  order 
Runge-Kutta  algorithm  with  a  sampling  interval  of 
T= 0.005  sec,  while  the  control  time  interval  is  chosen 
as  0.01s. 

The  robotic  arm  is  assumed  to  have  the  physical 
constraints  of  maximum  and  minimum  joint  angular 
positions  values  of, 

e^=no-  &2oljn  =io" 

It  is  assumed  that  the  maximum  and  minimum  torque 
values  provided  by  each  actuator  located  on  each 
revolute  joint  are  as, 

7lmax  =  T2max  =  500kgm2  Is 2 

(8) 

rimin  =  ^”2 min  =  -500%7»2  I S* 


where  K  is  the  matching  gain  for  adjusting  the 
network  output  to  the  real  range  of  actuator  torques 
[1].  The  activation  function  /  is  a  bipolar  sigmoidal 
function  expressed  by 


In  all  the  simulations  the  desired  trajectories  of 
the  joints  are  specified  in  the  joint  space  as  the 
following  sinusoidal  motions  (see  Fig.  4.) : 

01=1.34sin(0. 77 t)  and  62=0.78.cos(0.91t)+1.13 


/(*)= 


2 

1+e-* 


-1 


(6) 


where  A>0  and  determines  steepness  of  the  continuos 
function  fix)  near  x=  ft 

Note  that  the  control  and  learning  are  performed 
simultaneously.  Furthermore,  as  the  on-line  learning 
proceeds,  convergence  of  the  feedback  signal  towards 
zero  is  expected  [2], 

The  neuro-controller  which  identifies  the  robot 
inverse  dynamics  is  trained  first  off-line  using  the 
history  of  input  and  output  recorded  from  the  actual 


The  angular  velocities  and  angular  accelerations 
are  simply  the  derivatives . 

The  end-effector  trajectory  that  corresponds  to 
the  above  desired  angular  trajectories  in  the 
operational  space  is  a  curvilinear  motion  enduring 
about  six  seconds  (see  Fig.  5). 
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Fig.  4.  Desired  angular  trajectories. 


Fig.  5.  Desired  trajectory  of  the  end-effector. 

There  are  two  simulation  types  carried  out.  The 
simulations  type  A  correspond  to  the  well-known 
desired  trajectory  tracking  objective  without  any 
disturbances  and  changing  dynamics.  In  simulations 
type  B,  the  objective  is  the  same  but  the  controller  has 
to  overcome  changing  plant  dynamics  by  adapting 
itself. 

There  is  not  yet  a  well-defined  procedure  for  the 
determination  of  the  optimum  network  architecture 
for  a  given  control  problem.  The  common  approach  is 
the  trial  and  error  method  where  the  goal  is  to  have 
good  learning  with  the  simplest  network  architecture. 
The  neural  network  used  in  the  simulations  consisted 
of  one  hidden  layer  with  25  nodes,  2  nodes  in  the 
input  layers  and  6  nodes  in  the  output  layer  (2-25-6). 

The  simulation  type  A  is  implemented  with  the 
feedback  gains  set  to  kp=200  and  kd=40  for  both 
joints.  Since  the  robot  dynamics  is  nonlinear,  the 
Ziegler-Nichols  tuning  method  is  not  implemented  to 
determine  the  optimum  kp  and  kd  parameters.  Instead, 
the  "best"  kp  and  kd  values  are  determined  by  trial  and 
error  such  that  the  error  is  minimum,  the  response  is 


fast  and  possibly  well  damped.  Large  feedback  gains 
caused  instability  in  the  adaptation  phase  of  the 
controller. 

In  Fig.6.  and  Fig. 7,  the  desired  trajectories,  the 
actual  trajectories  and  the  position  errors  of  joints 
0i  and  02  are  displayed  respectively.  It  is  seen  that  the 
proposed  controller  is  very  effective. 


JOINT  1 


Fig.  6.  Angular  position  0!  for  simulation  type  A. 


JOINT  2 


Fig.  7.  Angular  position  02  for  simulation  type  A. 

Since  the  positioning  errors  are  very  small  they 
are  plotted  separately  in  Fig.  8  and  Fig.  9,  respectively. 

Since  the  neuro-model  is  pre-trained  off-line 
with  the  data  collected  during  the  actual  motion  of  the 
robot  arm,  the  path  tracking  performance  is  achieved 
even  at  the  very  beginning  of  the  simulation.  Figure  8 
and  Figure  9  show  that  the  negligible  positioning 
errors  occur  during  the  whole  duration  of  the  robot 
motion.  It  is  observed  that  the  relatively  big  errors 
occurring  at  the  start  of  the  motion  converge  rapidly 
towards  zero. 
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Fig.  8.  Position  error  0!  for  simulation  type  A 
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Fig  .9.  Position  error  02  for  simulation  type  A 

In  simulation  type  B,  the  robot  arm  picks  up  a 
load  of  5  kg  at  t=2  sec.  In  other  words,  the  point  mass 
at  the  end  of  robot  arm,  m2  is  increased  by  100%.  The 
resulting  plots  of  the  trajectory  tracking  performance 
of  the  joint  positions  0i  and  02  are  displayed  in  Fig.  10 
and  Fig.  11,  respectively.  It  is  observed  that  the 
desired  and  actual  trajectories  agree  extremely  well. 

Since  the  position  errors  0]  and  02  are  very 
small  they  are  plotted  separately  in  Fig.  12  and  Fig.  13. 
It  is  observed  that  overshoots  in  positioning  errors 
occur  at  t=2.  However,  the  on-line  adaptation  feature 
of  the  proposed  controller  become  effective  in 
decreasing  the  positioning  errors  0i  and  02  which 
converge  towards  zero.  This  demonstrates  the 
capability  of  the  controller  to  adapt  to  sudden  changes 
in  dynamics  while  providing  necessary  motor 
commands  to  reach  the  desired  final  position 
following  a  path  close  to  the  desired  one. 


TIME  [SEC] 

Fig.  10.  Angular  position  0!  for  simulation  type  B 
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Fig.  11.  Angular  position  02  for  simulation  type  B. 


The  adaptation  process  of  the  controller  is  fast 
enough  to  give  a  satisfactory  path  tracking 
performance  for  both  simulation  types  and  even  under 
strong  uncertainties.  The  learning  rate  used  during  the 
on-line  training  process  of  the  network  was  set  to  ft.  02 
value.  The  network  weights  are  adjusted  using 
backpropagation  algorithm. 

Note  that  the  significant  success  in  the  tracking 
performance  is  mainly  due  to  the  close  approximation 
of  the  inverse  dynamics  by  the  neuro-model  using 
both  off-line  and  on-line  training  procedures. 
Simulation  results  show  that  the  proposed  architecture 
is  capable  of  tracking  the  desired  path  and  of 
compensating  unstructured  uncertainties. 

6.  CONCLUSIONS 

This  work  presented  a  nonlinear  compensator 
using  an  inverse  neuro-model  for  trajectory  control  of 
a  two-link  planar  robot  arm.  Using  a  two  stage 
training  process,  off-line  and  on-line  learning,  the 
neural  network  is  capable  of  approximating  nearly  the 
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exact  inverse  dynamics  of  the  manipulator  and  of 
adapting  itself  to  the  structured  effects. 


JOINT  1 


Fig.  12.  Position  error  0]  for  simulation  type  B 
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Fig.  13.  Position  error  of  02  for  simulation  type  B. 


Simulation  results  indicate  that  the  neuro¬ 
controller  is  able  to  track  the  sinusoidal  trajectory 
with  a  high  degree  of  accuracy.  The  simulations 
showed  that  better  the  approximation  of  the  inverse 
dynamics,  the  lower  are  the  tracking  errors  since  the 
bounds  on  the  tracking  errors  are  directly  proportional 
to  die  ones  of  die  neuro-model  approximation  errors. 

Another  significant  result  is  the  effective 
adaptation  capability  of  the  proposed  scheme.  It  is 
shown  that  a  sudden  change  in  the  manipulator 
dynamics  can  be  overcome  by  using  die  neuro- 
adaptivc  scheme  which  make  use  of  on-line  NN 
training.  This  is  interesting  for  real-time  applications 
where  manipulator  dynamics  is  often  faced  with 
sudden  changes  due  to  die  parameter  variations,  load 
changes  and  possible  external  disturbances.  As  an 
example  in  this  paper  the  end-effector  mass  was 
doubled  during  the  robotic  motion.  The  controller 


acted  immediately  to  recover  the  sudden  jumps  in 
tracking  errors.  Moreover,  the  error  recovery  time  is 
satisfactory  since  an  off-line  training  of  the  network 
has  been  realized  first  by  using  the  data  collected 
form  the  real-time  motion  of  the  robot  arm.  The 
efficiency  and  the  adaptive  capability  of  the  artificial 
neural  network  controller  based  on  the  computed 
torque  method  has  been  proved.  The  fast  adaptation 
capability  of  the  proposed  neuro-controller  to  changes 
in  manipulator  dynamics  is  also  remarkable. 

The  advent  of  neuro-adaptive  techniques,  with 
optimization  and  adaptive  critics,  might  pose  the  next 
step  in  the  area  of  the  neuro-control  in  robotics 
applications.  On  the  other  hand,  theoretical  studies 
concerning  stability  and  convergence  are  still  in  the 
initial  stages  and  numerous  questions  have  yet  to  be 
answered.  These  titles  are  all  prospective  popular 
research  topics  of  the  future  concerning  neural 
network  controllers  and  their  implementations  to 
robotics. 
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Abstract  -  The  constraints  of  non-holonomic  mobile  robots  are  expressed  as  a  non-integrable 
equations.  For  solving  of  trajectory  generation  problem,  we  have  developed  computer-aided  technique 
by  means  of  a  somposite  analytical  and  heuristic  method  which  is  based  on  geometric  reasoning  and 
provides  deterministic  trajectories  for  all  pairs  of  initial  and  final  configurations  and  positions.  In  the 
cases  of  obstacled  environment  we  have  extended  this  work  by  adding  some  intelligence  to 
automatically  guided  vehicle  /  mobile  robot  via  fuzzy-logic  supervisory  controller.  A  number  of 
simulation  experiments  on  various  PCs  using  this  technique  have  been  done,  and  run  time  takes  only  a 
few  miliseconds.  It  has  become  apparent  that  this  approach  is  quite  attractive  for  application  to  real- 
time  motion  planning  of  factory-floor  AGV/MR. 

Keywords:  Automatically  Guided  Vehicles,  Mobile  robots,  Non-holonomic  Motion  Planning, 
Trajectory  Generation,  Obstacle  Avoidance,  Fuzzy-logic  control. 


I.  INTRODUCTION 

A  number  of  both  scientific,  and 
engineering  and  technological  results  in  the  area 
of  analysis,  simulation  modelling,  design  and 
implementation  of  AGVs/MRs  have  been 
achieved  during  the  last  decade  (e.g.,  see 
references  in  this  paper  and  references  therein). 
However,  non-holoninmic  considerations,  due  to 
their  theoretic  complexity,  only  recently  have 
been  subject  to  both  theoretic  and  applied 
research  (e.g.,  see  [1],  [2],  [7],  [9],  [10],  [12], 
[14]).  In  general,  taking  into  consideration  the 
aspect  of  steering  non-holonomic  systems  by 
definition  and  right  from  the  basic  modelling 
stage  is  of  rather  recent  date  [11],  [10],  [7]. 

The  complexity  of  AGVs/MRs  control 
and  navigation  problems  has  stimulated  various 
approaches.  One  way  of  overcomming  the 
difficulties  encountered  and  of  ammeliorating 
practical,  though  non-analytical,  solutions  to  these 
problems  is  via  the  application  of  fuzzy  systems 
modelling  and  control  as  in  [3]-[6],  [13].  One  of 
analytical  approaches  is  the  use  of  Pontryagin’s 


maximum  principle  optimal  control  to  solve  these 
problems  as  in  [10],  which  appears  to  be  of  more 
academic  importance.  It  turned  out,  however,  the 
analytical  approach,  which  is  based  on 
differential-geometric  formalism  and  on  non¬ 
linear  control  theory  [1],  [2],  [7],  [10],  [14],  to  be 
promissing.  For  it  is  the  solely  one  to  provide  for 
proper  inclusion  of  non-holonimic  aspects. 
Following  our  previous  studies  [8],  [9]  along 
these  lines,  in  this  we  present  a  geometric  based, 
analytic-simulation  technique  resolving  the 
problem  of  concern.  In  the  presence  of  obstacles 
fuzzy  control  becomes  an  attractive  proposition, 
because  such  algorithms  are  flexible,  robust  and 
effective  due  to  machine  intelligence  so  that  can 
adapt  to  almost  environments  within  the  motion 
horison.  Again  following  our  previous  studies 
reported  in  [3],  [4],  we  have  adopted  fuzzy  logic 
control  approach  to  AGV/MR  navigation  for 
obstacle  avoidance  in  a  2D-world  with  predefined 
horison. 


*)  Author  responsible  for  all  correspondence. 
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D.  ON  NON-HOLONOMIC  MOTION 
PLANNING 

Traditional  robot  motion  planning 
considers  equality  or  inequality  constraints  over 
the  configuration  variables.  These  constraints  are 
holonomic,  i.e.  topological  properties  of  the 
configuration  space  are  altered,  or  the  robot  is 
constrained  to  a  lower  dimensional  manifold  of 
the  configuration  space.  •  Non-holonomic 
constraints,  on  the  other  hand,  are  imposed  on  the 
velocities  instead  of  on  the  position  variables  of  a 
system.  A  non-holonomic  constraint  is  expressed 
as  a  non-integrable  equation  involving  derivatives 
of  the  configuration  parameters  and  it  can  not  be 
reduced  to  equality  constraint  on  the  position 
parameters.  Such  constraints  are  expressed  in  the 
tangent  space  at  each  configuration  defining 
allowable  velocities  of  the  system.  For  example 
the  equation  w(x)x  '= 0  expresses  a  linear  velocity 
constraint,  where  x(t)  eFP  represents  configuration 
of  the  system  and  x '  belongs  to  the  tangent  space 
of  the  configuration  space  at  x. 

Consider  a  problem  where  we  wish  to 
move  the  system  from  one  configuration  to 
another.  There  are  many  systems  that  are  not 
capable  to  move  between  two  arbitrary 
configurations  for  the  kinematics  constraints  limit 
the  set  of  reachable  configurations  to  a  space  of 
lower  dimension.  Unlike  such  systems,  we  are 
interested  in  the  cases  in  which  the  kinematics 
constraints  do  not  restrict  the  reachable 
configurations.  (An  example  of  such  a  system  is 
an  automobile  -  front  wheel  drive  cart).  Its 
kinematics  is  constrained  because  the  front  wheels 
can  only  roll  and  spin,  but  not  slide  sideways. 
Despite  this,  we  can  park  a  car  at  any  position  and 
orientation.  It  is  convinent  to  convert  the  problem 
of  finding  a  path  between  two  given 
cohfiguratiohs  subject  to  k  constraints  of  form 
w(x)xf-0  into  the  problem  in  control  theory. 
Namely,  we  are  interested  in  describing  the 
directions  in  which,  by  means  of  control,  we  can 
move  our  AGV/MR  instead  of  those  in  which  we 
cannot. 

There  are  two  different  kinds  of 
techniques  to  the  problem  of  generating 
trajectories  for  AG Vs/MRs  with  non-holonomic 
constraints.  Following  the  first  kind  of 
methodological  techniques,  a  geometric  reasoning 
is  utilizied  to  construct  feasible  trajectories  by 
assembling  arcs  of  simple  curves  (see  Pin  and 
Vasseur,  [14]).  In  contrast,  the  philosophy  within 
the  second  type  relies  on  generating  paths  by 
searching  the  configuration  space  of  the  robot  via 


applying  the  non-holonomic  constraint  as  an 
additional  heuristic  at  every  step  of  the  search. 
Barraquand  and  Latombe  [2],  have  proposed  an 
ellegant  method  along  these  lines  based  on 
hierarchial  bit-map  discretization  and  potential 
field  functions.  Their  method  may  be  generalized 
to  robots  with  large  number  of  degrees  of 
freedom,  however,  its  significan  drawback  is  in 
large  memory  and  computation  time  required,  and 
is  limited  to  bounded  domains  because  search  in 
configuration  space  is  necessary.  Our  analytic- 
simulation  technique  belongs  to  the  first  type  of 
techniques,  and  it  is  extended  with  fuzzy-logic 
control  for  the  case  of  obstacled  environment. 

ffl.  ON  KINEMATICS  OF  A  MOBILE  ROBOT 

“Car-like  mobile  robot”  is  a  front-wheel- 
drive  four-wheel  vehicle  (Figure  1).  The  fact  that 
the  two  rear  wheels  are  not  steerable  and  roll 
without  slipping  on  the  ground  introduces  a 
constraint  on  the  vehicle  motion  expressing  that 
the  velocity  of  the  center  of  the  rear  axle  is 
colinear  with  the  orientation  axis  of  the  vehicle 
base.  This  constraint  is  of  a  non-holonomic  type, 
i.e.  it  is  not  integrable,  and  does  not  allow  a 
closed  form  analytical  resolution  for  the  vehicle 
.  trajectory. 

We  denote  by  L  the  distance  between  the 
2  axes  of  the  wheels,  by  0  the  angle  between  the 
major  axis  of  the  robot  and  the  x  axis  of  the 
absolute  reference  frame,  by  <f>  the  steering  angle 
(i.e.  the  orientation  of  the  front  wheels  with 
respect  to  the  major  axis  of  the  robot),  by  xM,  yM , 
and  VM  the  coordinates  and  the  velocity  of  M,  the 
middle  of  the  axle  of  the  rear  wheels.  We  assume 
that  there  is  no  slipping  of  the  rear  wheels, 
therefore  the  velocity  vector  of  M  is  always 
colinear  with  the  major  axis  of  the  robot: 

(dxfj  dt)  sin  6  -  (dy^f  dt)  cos  6=  0  (1) 

This  equation  is  not  integrable.  It  is  a  constraint 
on  the  velocity  of  the  robot  but  does  not  affect  the 
dimension  of  the  space  of  configurations.  On  the 
other  hand,  for  a  given  configuration,  the  space  of 
achievable  velocities  has  a  dimension  of  only  two. 
These  are  therefore  a  non-holonomic  constraint. 
Another  characteristic  of  the  car-like  robots  is  that 
the  steering  angle  is  constrained: 

\<f>\  <(t>ma,<n/2  (2) 

Let  us  denote  by  N  the  middle  of  the  front  axle. 
Let  Xu  and  yN  be  the  coordinates,  and  VN  the 
velocity  of  N  (see  Figure  1). 
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Fig.l.  Shematic  of  car-like  mobile  robot. 

Since  we  assumed  no  slipping  of  the  front  wheels, 

Kn  =  PvNcos((h-<f>)] 

L.VNsin((fr<j>)  J  (3) 

and  since 

QN  =  OM+MN  (4) 

we  have: 

Vn=  Vm  +  dMN/dt  (5) 

Hence,  we  obtain: 

VN  cos(6+<f>)  =  x  ’M-  0'  *  L  *  sin  0 
VN  sin ( 9+  <j>)  —  y  ’M -  0‘  *  L  *  cos  6  (6) 

Combining  these  two  equations  with  the  non- 


holonomic  constraint,  we  obtain: 

L6‘  =  VNsin  (j> 

(V 

This  equation  can  be  rewritten  as: 

ds/dO  =  L/  sin  (f) 

(8) 

In  here,  s  is  the  curvilinear  distance  along  the 
trajectory.  The  previous  equation  provides  for  R, 
the  radius  of  curvature  of  the  trajectory,  according 
to  the  steering  angle.  Given  the  constraint  on  the 
steering  angle  <f>max,  we  have  an  experssion  for  the 
minimum  radius  of  turn  of  the  vehicle: 

|J?|  —  Rfnin  ~  L/sin  (f>max  (9) 


IV.  GEOMETRIC  TRAJECTORY 
GENERATION 

A  mobile  robot  moving  in  a  2D-world 
with  given  horison,  such  as  a  factory-floor,  has  3 
degrees  of  freedom:  the  coordinates  of  a  given 
point  of  the  robot,  and  the  orientation  of  the  robot 
itself.  A  path  is  a  curve  joining  2  points  and 
providing  a  change  in  the  robot  orientation. 


Changing  the  orientation  of  the  vehicle  can  be 
done  only  when  the  vehicle  is  moving.  To  change 
the  orientation  0  of  the  vehicle  with  a  minimum 
length  of  trajectory,  we  need  to  maximize  the 
curvature  of  the  trajectory.  This  is  achieved  when 
the  vehicle  is  moving  along  a  circle  the  radius  of 
which  is  the  minimum  radius  of  curvature.  The 
problem  is  that  circles  take  a  heavy  toll  on  the 
length  of  the  trajectory:  for  a  long  trajectory,  a 
straight  line  segment  is  really  needed.  Hence  the 
idea  of  combining  arcs  of  circles  and  straight  line 
segments.  Therefore  the  trajectories  are  designed 
the  following  way:  the  robot  moves  along  one  of 
the  circles  related  to  the  initial  configuration  in 
order  to  be  oriented  towards  the  goal,  no  matter 
what  the  final  orientation  must  be.  Then,  it  moves 
along  a  straight  line  in  the  direction  of  the  goal. 


Fig.2.  Example  of  trajectory  joining  two 
configurations  and  positions. 

The  last  part  of  the  trajectory  is  meant  to 
provide  the  desired  final  orientation:  the  robot 
moves  along  one  of  the  circles  erlated  to  the  final 
configuration.  To  be  physically  achievable,  the 
trajectory  must  provide  a  continuous  orientation 
to  the  robot.  Therefore,  the  straight  line  segment 
must  be  tangential  to  the  circles  (Figure  2). 

If  the  distance  between  the  center  of  2 
circles  is  greater  than  the  sum  of  their  2  radii, 
there  are  4  common  tangents  to  the  circles.  Since 
there  are  4  possible  couples  of  circles,  there  are 
16  possible  paths.  The  16  paths  provide  the  right 
orientation  modulo  n.  Only  8  provide  the  desired 
orientation,  the  others  lead  to  the  opposite. 
Among  the  remaining  paths,  the  preffered 
trajectory  is  selected  using  a  criterion,  e.g. 
shortest  length  of  the  path.  If  the  distance 
between  the  2  chosen  circles  is  smaller  than 
2*Rmm,  then  the  circles  have  only  2  common 
tangents  but  the  principle  remains  the  same.  When 
the  initial  and  final  positions  of  the  robot  are  very 
near  each  other,  backing  control  and  maneuvering 
are  often  involved. 
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The  geometric  method  provides  a 
continuous  orientation  to  the  robot  because  the 
circles  and  the  segment  are  tangent.  However,  the 
curvature  of  the  obtained  curve  is  discontinues:  it 
is  a  non-zero  constant  on  the  circles,  and  zero  on 
the  straight  line  segments. 

As  an  example,  we  consider  a  mission  in 
a  2D  world  of  a  non-holonomic  MR  that  has  its 
starting  position  at  point  (0.2,  0.9)  (in  relative 
spatial  units)  and  its  target  position  at  (0.9,  0.2). 
In  addition,  it  is  known  that  MR  has  an  initial 
orientation  of  30  deg.,  and  at  the  end  point  it 
should  take  a  parking  orientation  of  180  deg.  Note 
that,  no  matter  what  is  the  aimed  trajectory  of  MR 
motion,  due  to  non-holonomic  constraints,  velo¬ 
city  coordinate  has  to  change  according  to  a  cont¬ 
rol  law  similar  to  time-optimal  control,  in  order  to 
realize  the  required  orientation  coordinate. 

Our  simulation  software  technique  for 
trajectory  generation  is  written  in  C++  language 
for  a  PC  platform.  The  graphic  device  is  mapped 
into  frame  x=[0,l],  y~[0,l]  by  the  header  file 
64 s  tar  base,  h ”,  and  the  orientation  can  be  given  in 
degrees  [0-360],  The  coordinates  of  position  are 
(de)incrementing  by  0.04,  and  the  orientation  by 
15  degrees  at  each  moving  step. 

V.  OBSTACLE  AVOIDANCE  USING  FUZZY 
CONTROL 

In  the  case  of  obstacled  environment, 
mobile  robot  will  be  treated  as  an  automatically 
guided  vehicle  (AGV)  which  can  operate  in 
camera  mode  and  trajectory  generation  will  be 
based  on  fuzzy  logic  control.  The  work  covers 
situations  in  which  sensor  information  is  either 
incomplete  or  unreliable.  The  predefined  fuzzy 
regions  have 


Speed 


Fig.3.  Example  of  fuzzy  regions  for  AGV  speed 

triangular  form,  and  they  define  three  files 
covering  distance  from  an  object,  orientation  and 
speed.  Clearly,  the  first  two  files  are  directly 
linked  with  obstacle  detection  whereas  the  last 


one  determine  the  strategy  for  steering.  The 
location  of  a  vehicle  may  be  specified  as  a  set 
A={near  distance,  middle  distance,  far  distance, 
no  obstacles}. 

To  obtain  the  required  orientation,  this  is 
concatenated  with  one  of  the  following  set 
B={more  left,  slightly  left,  straight,  slightly  right, 
more  right}.  A  third  set  C  is  defined  to  cover  three 
speeds  C  =  {slow,  middle,  fast}.  The  the 
movement  of  the  vehicle  is  defined  as: 

depending  on  A  move  B  at  speed  C 

One  simple  control  strategy  can  be  defined  as 
follows: 

•  if  no  obstacles,  go  to  the  end  position  fast; 

•  if  there  is  an  obstacle  at  far  distance  more  to 
the  left,  then  go  slightly  right  with  medium 
speed; 

•  if  there  is  an  obstacle  at  middle  distance  more 
to  the  left,  then  go  slightly  right  with  slow 
speed; 

•  if  there  is  an  obstacle  at  far  distance  slightly 
left,  go  more  to  the  right  with  medium  speed; 

•  if  there  is  an  obstacle  at  middle  distance 
slightly  left,  go  more  to  the  right  with  slow 
speed; 

•  if  there  is  an  obstacle  at  far  distance  more  to 
the  right,  go  slightly  left  with  medium  speed; 

•  if  there  is  an  obstacle  at  middle  distance  more 
to  the  right,  go  slightly  left  with  slow  speed; 

•  if  there  is  an  obstacle  at  middle  slightly  right, 
go  more  to  the  left  with  slow  speed. 

It  should  be  noted  that  for  such  predefined 
fuzzy  system  the  following  holds: 

N=(Io . PCf)k  (10) 

where  N  is  number  of  different  situations,  p  is 
number  of  obstacle  angle  fuzzy  regions  and  k  is 
number  of  obstacle  distance  fuzzy  regions.  For 
the  present  work,  N=32768  different  situations 
may  be  accomodated.  However,  it  is  clear  that 
such  fuzzy  rule  base  is  not  efficient  for  control 
strategies.  This  leads  to  the  development  of  a  tree- 
structured  fuzzy  rule  base  with  no  more  than  32 
‘if-than-else’  rules. 


VI.  CONCLUSIONS 

The  use  of  conventional  path  planners  for 
generating  trajectories  for  AGVs/MRs,  and  any 
non-holonomic  system,  easily  cause  incorrect 
results.  Though  the  standard  approach  in  path 
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planning  is  to  use  a  planner  for  finding  a 
trajectory  without  taking  into  account  non- 
holonomic  constraints,  and  than  steering  the 
system  as  close  by  as  possible  to  this  trajectory 
along  an  physically  admissible  trajectory. 

A  more  appropriate  approach  is  via 
simulataneous  consideration  of  the  integrability 
and  the  control-lability  apects  of  non-holonomic 
systems.  The  main  advantage  of  the  geometric 
method  is  in  being  a  deterministic  methodology, 
and  providing  trajectories  joining  all  pairs  of 
configurations  of  AGVs/MRs  in  motion.  The 
method  also  takes  advantage  of  the  forward  and 
reverse  mode  of  motion  of  the  vehicle  as  well  as 
of  the  combination  of  these  on  trajectory  requiring 
maneuvering.  The  purpose  of  application  of  fuzzy 
control  in  obstacled  environment  is  to  realize  a 
control  performance  using  the  experience  of 
expert  (control  rules)  with  ‘if...  then ...  ‘  rules. 

Our  algorithmic  software  structure  can  be 
easily  used  by  an  operator:  simply,  he  has  only  to 
specify  the  desired  final  configuration  of  the 
vehicle.  This  point  along  with  the  fact  that  only  a 
few  miliseconds  of  run  time  on  a  PC  are  necessary 
to  generate  the  trajectories  makes  this 
methodology  quite  attractive  for  real-time 
navigation  and  motion  planning  in  an  obstacled 
and/or  a  priori  unknown  environment. 
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Abstract 

Simple  linear  joint  controllers  are  still  used  in  typical 
industrial  robotic  systems ,  although  there  exists  a  mul¬ 
titude  of  sophisticated  non-linear  control  algorithms. 
The  use  of  these  controllers  leads  to  non-negligible  path 
deviations  for  applications  that  require  high  speed  mo¬ 
tions. 

In  this  paper  a  method  is  proposed  which  reduces 
path  deviations  by  precorrection  of  the  desired  trajec¬ 
tory.  The  corrections  are  calculated  with  respect  to  a 
non-linear  dynamic  model  of  the  robot  and  transferred 
to  the  robot.  The  method  is  applicable  to  typical  in¬ 
dustrial  robots  on  condition  that  an  interface  for  path 
corrections  exists  and  that  joint  angles  and  torques  are 
measurable. 

Experimental  investigations  for  the  6-dof  industrial 
robot  Siemens  manutec-rl5  show  reductions  of  path  er- 
rofs  that  are  much  larger  than  for  linear  precorrection 
which  was  proposed  in  [1,  2].  The  results  exhibit  the 
robustness  of  the  method  with  respect  to  different  in¬ 
terpolation  cycle  times. 

1.  Introduction 

Nowadays  there  is  a  large  gap  between  the  sophisti¬ 
cated  control  algorithms  developed  in  robotics  research 
and  the  simple  control  techniques  used  in  industrial 
robotic  applications.  Typical  industrial  controllers  dis¬ 
regard  the  non-linearities  of  robot  dynamics,  like  fric¬ 
tion  and  couplings  of  the  different  joints.  Therefore, 
robust  independent  joint  controllers  are  implemented 
which  are  sufficient  for  applications  that  require  only 
high  positioning  accuracy. 

Although  such  applications  are  still  the  main  field 
of  industrial  robots’  tasks,  applications  requiring  high 
path  accuracy  —  e.g.  laser-cutting  —  are  becoming 
more  and  more  important.  Many  powerful  control  al¬ 
gorithms,  like  computed-torque  [3]  and  adaptive  meth¬ 
ods  [4,  5],  have  been  developed  in  robotics  research.  By 
these  algorithms  the  tolerances  of  such  demanding  ap¬ 
plications  can  be  complied.  But  they  usually  require 


a  centralized  control  architecture  which  is  not  existent 
in  typical  industrial  controllers  due  to  its  complexity 
and  the  minor  share  of  demanding  applications. 

However,  common  industrial  robots  have  an  inter¬ 
face  which  is  designed  for  handing  over  path  correc¬ 
tions  from  external  sensor  signals.  An  algorithm  is 
proposed  in  [1,  2]  which  uses  this  interface  for  the  re¬ 
duction  of  path  deviations.  Corrections  of  the  desired 
trajectory  are  calculated  with  respect  to  a  linear  model 
of  the  closed  loop  system  and  ’wrong’  trajectories  are 
fed  into  the  controller.  Due  to  the  robot  dynamics 
the  resulting  trajectory  is  closer  to  the  original  desired 
path  than  it  would  have  been  otherwise.  The  snag  with 
this  method  is  that  only  linear  models  of  the  over-all 
system  are  used. 

In  this  paper  another  approach  is  presented  which 
bases  on  a  complete  non-linear  model  of  the  robot. 
It  combines  the  main  ideas  and  advantages  of  the 
computed-torque  method  and  the  mentioned  pre¬ 
correction  algorithm.  Like  in  the  computed-torque 
method  the  inverse  dynamics  of  the  robot  is  used  to 
calculate  the  expected  torques  from  the  desired  joint 
motions.  Since  no  torque  interface  is  available  for  feed¬ 
forward  control,  the  resulting  values  are  transferred 
into  trajectory  corrections  by  an  inverted  controller 
model.  The  robot  model  is  divided  into  two  parts 
which  are  seperately  identified.  For  the  identification 
it  has  only  to  be  assumed  that  joint  angles  and  torques 
are  measurable. 

The  points  in  time  for  the  transfer  of  path  correc¬ 
tions  are  usually  set  by  the  interpolation  cycle.  Experi¬ 
mental  investigations  with  different  cycle  times  exhibit 
the  robustness  and  the  wide  applicability  of  the  pro¬ 
posed  method.  The  achieved  reduction  of  path  devia¬ 
tions  is  impressing.  The  results  are  much  better  than 
for  the  linear  precorrection  scheme. 

2.  Inverse  robot  dynamics 

The  inverse  dynamic  model  of  robots  is  usually  divided 
into  a  gear  and  a  rigid  body  model. 
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2.1.  Gear  model 

Concerning  the  modelling  of  gears  in  robot  dynamics, 
normally  only  the  losses  are  taken  into  account  by  ve¬ 
locity  dependent  friction  torques.  Other  effects  like 
elasticity  and  backlash  are  neglected. 

Often  a  physically  motivated  model  is  used  combin¬ 
ing  dry  friction  and  viscous  damping  [6]: 

t/m  =  «i9i  +  a2sign(gi).  (1) 

Although  this  model  is  sufficient  for  many  applica¬ 
tions,  a  more  precise  model  should  cover  the  degressive 
characteristic  of  friction  for  higher  velocities,  which  is 
shown  in  figure  1  where  the  normalized  over-all  friction 
chacteristics  for  different  axes  of  the  industrial  robot 
Siemens  manutec-rl5  are  plotted.  To  regard  this  the 
inclusion  of  an  additional  term  is  suggested  in  [7]: 

Tf2,i  =  +  fl2sign(^)  -I-  03  arctan(a4^;).  (2) 

A  great  difficulty  in  friction  modelling  for  industrial 
robots  is  the  dependence  on  the  time-varying  operat¬ 
ing  conditions,  especially  on  the  gear  temperature  [7]. 
Therefore  ’warm-up’  motions  are  often  used  to  reach 
stationary  conditions.  Figure  2,  however,  shows  that 
this  is  not  sufficient.  During  a  continous  ’back- an  d- 
forth ’-motion  of  one  gear  the  friction  is  measured  ev¬ 
ery  12  s  at  the  same  angle.  Even  after  a  long  period 
no  stationary  condition  is  reached  and  after  short  in¬ 
terruptions  friction  increases  significantly.  This  shows 
that  not  only  the  temperature  but  also  the  distribution 
of  the  lubricants  has  a  large  influence.  Therefore,  an 
exact  prediction  of  friction  is  almost  impossible.  This 
can  lead  to  errors  not  only  for  correction  but  also  for 
rigid  body  identification. 


Figure  1.  Normalized  friction  characteristics  for  differ¬ 
ent  axes  of  the  Siemens  manutec-r!5. 


2.2.  Rigid  body  model 

The  dynamic  equation  of  the  robot’s  rigid  body  model 
can  be  written  as 

r  =  M(q)q  +  c(q,q)+g(q)  (3) 

t  =  A(q,q,q)p.  (4) 

Equation  (3)  represents  the  usual  form  of  the  dynamic 
equation  with  the  mass  matrix  M(q)  as  well  as  the 
vectors  of  centrifugal  and  Coriolis  forces  c(g,  g),  grav¬ 
itational  g(q)  and  joint  torques  r.  Equation  (4)  is  the 
corresponding  parameter  linear  form.  The  base  pa¬ 
rameter  vector  p  consists  of  the  inertial  and  gravita¬ 
tional  parameters  of  the  links,  e.g.  masses  and  mo¬ 
ments  of  inertia,  and  linear  combinations  of  them.  It 
has  minimal  order  to  guarantee  identifiability  of  all  el¬ 
ements  pi  and  can  be  derived  automatically  for  any 
serial  robot  [8].  For  typical  industrial  robots  dim(p) 
is  relatively  small  because  of  the  symmetric  link  struc¬ 
ture. 

3.  Identification 

The  friction  characteristics  are  assembled  by  measur¬ 
ing  the  friction  torques  at  different  constant  velocities. 
Subsequently,  the  friction  model  2  is  adapted  by  a  non¬ 
linear  optimization. 

There  exists  a  vast  amount  of  literature  on  rigid 
body  identification.  However,  most  of  the  methods 
are  variations  of  the  same  identification  scheme.  The 
robot  is  moved  along  a  trajectory  which  is  optimized 
to  guarantee  maximum  ’excitation’  of  the  parameters. 


Figure  2.  Variation  of  friction  with  time  during  conti¬ 
nous  operation  and  brakes. 
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Joint  motion  and  torque  are  measured  and  friction  is 
compensated  by  a  model  determined  in  advance. 

Here,  a  different  method  is  used.  The  identifica¬ 
tion  is  done  by  a  two-step  algorithm.  In  the  first 
step  the  elements  of  g  and  M  are  determined  for  a 
significant  number  of  different  configurations  of  the 
robot.  In  the  second  step  the  parameters  are  estimated 
by  a  weighted  LS-method.  A  more  detailed  descrip¬ 
tion  of  the  procedure  and  the  application  to  the  robot 
Siemens  manutec-rl5  is  given  in  [9]. 

This  identification  procedure  has  two  main  advan¬ 
tages.  On  the  one  hand  the  implementation  is  easy 
since  no  optimized  trajectories  are  needed.  On  the 
other  hand  no  compensation  of  friction  is  required.  So, 
the  method  is  robust  with  respect  to  lack  of  knowledge 
about  and  variations  of  friction  conditions. 

4.  Trajectory  precorrection 

For  trajectory  precorrection  a  model  of  the  robot  is 
inverted  to  compute  the  position  commands  that  are 
necessary  to  track  the  desired  trajectory.  In  the  next 
section,  the  linear  precorrection  scheme  [1,  2]  is  pre¬ 
sented  for  comparison.  Subsequently,  the  precorrec¬ 
tion  scheme  is  depicted  which  bases  on  the  non-linear 
model  presented  in  the  sections  before. 

4.1.  Linear  precorrection  scheme 

In  [1,  2]  Lange  and  Hirzinger  proposed  the  use  of  lin¬ 
ear  decoupled  impulse  response  models  of  each  joint’s 
closed  loop  for  reduction  of  path  deviations.  The 
model  predicts  the  actual  joint  positions  qi(k)  based 
on  the  position  commands  Ui{k  - 1)  for  /  =  md)*,  ..,mj 

rm 

?<(*)  =  9i(l)ui(k  -  l).  (5) 

l=mdti 

Here  &(Z)  denotes  the  estimate  of  one  impulse  response 
coefficient  and  mdti  the  relative  degree  of  the  system. 
Theoretically,  mi  must  be  infinite  but  for  stable  sys¬ 
tems  the  first  few  gi{l)  are  sufficient.  The  joint  index 
i  is  dropped  for  simplicity  in  the  following. 

The  impulse  response  can  be  estimated  using  the 
LS-method  applied  to  one  short  (500  sampling  points) 
trajectory.  No  additional  knowledge  about  the  struc¬ 
ture  of  the  system  or  the  controller  is  used  during  the 
identification  process.  This  is  an  advantage  on  the 
one  hand  because  the  procedure  can  be  implemented 
on  unknown  systems.  On  the  other  hand  it  can  result 
in  inferior  model  quality  since  available  information 
about  the  controller  and  the  physics  of  the  system  is 
not  taken  into  consideration.  Due  to  its  linearity,  the 


model  of  the  robot  is  incapable  of  incorporating  the 
non-linear  effects  such  as  friction  and  the  couplings 
between  different  joints. 

Given  the  desired  trajectory  qd(k)  for  k  = 
and  letting  qd(N  +  j)  =  qd(N)  for  j  >  1,  it  is  possible 
to  formulate  a  system  of  equations  to  estimate  the  nec¬ 
essary  position  commands  un(k)  for  k  =  1  -  md, ..,  N : 


9{md)  0  \ 

/  Wn(l  ~md)  \ 

g(m) 

V  0  a(m)  ■■■■  g(md)  ^ 

\  V>n(N)  J 

(  ?«(1) 

\ 

(6) 


\  qd{N  +  md)  ) 

Gun  =  qd .  (7) 

It  is  not  possible  to  simply  invert  G  to  calculate 
un  because  the  linear  model  is  not  guaranteed  to  be 
minimum-phase.  That  means  the  zeros  of  the  system 
can  be  outside  of  the  unit  circle  of  the  Z- plane.  In 
that  case,  the  inverted  model  is  unstable  and  the  es¬ 
timated  necessary  commands  un  are  likely  to  contain 
large  variations.  Therefore,  un  is  computed  using  an 
Inverse-Covariance  Kalman  Filter  [10]  for  the  system 


Un(fc  +  1)  =  Eun(k) 

qd(k)  =  Gkti..N+mdUn(k)  +  v(k).  (8) 

Given  the  covariances  of  the  first  estimate  ttn>0  =  qd 
and  of  the  measurement  noise  v(k ),  the  ICKF  com¬ 
putes  the  position  commands.  Small  deviations  of  the 
desired  robot  path  qd  =  qd  +  v  are  accepted  in  order 
to  keep  the  modifications  of  the  position  commands 
Attcor  =  un  —  qd  small  and  thus  to  produce  a  smooth 
corrected  trajectory. 

To  reduce  the  computional  effort,  it  is  desirable  to 
keep  the  number  of  sampling  points  as  low  as  possible. 
Experiments  with  the  testbed  Siemens  manutec-rl5 
have  shown  that  the  method  works  satisfactorily  with 
sampling  times  up  to  AT  =  20  ms,  the  resulting  cor¬ 
rected  trajectory  can  be  transformed  to  the  necessary 
sampling  interval  using  spline  interpolation.  There¬ 
fore,  the  method  is  suitable  for  different  interpola¬ 
tion  cycle  times.  For  the  results  shown  in  section  4.3 
AT  =  10  ms  was  used.  ... 
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Figure  3.  Structure  of  the  complete  robot  system  with  non-linear  trajectory  precorrection. 


4.2.  Non-linear  precorrection  scheme 

Instead  of  using  a  simple  linear  decoupled  model  of  the 
robot  and  the  controller  to  calculate  a  corrected  tra¬ 
jectory,  it  is  possible  to  use  a  model  of  the  closed  loop 
system  based  on  the  inverse  robot  dynamics  and  the 
known  controller  parameters.  The  necessary  torques 
for  all  joints  f  „  for  a  given  trajectory  qa  can  be  esti¬ 
mated  using  equations  (2)  and  (4): 

f„  =  M(qd)qd  +  c(qd,qd)+g(qd)  +  tf(qd) 

=  Mqd>4<h9d)P  +  *‘f(4d)-  (9) 

The  inverted  controller  can  convert  the  desired 
torques  into  deviations  Aucor  F^nirfn  that  have 
to  be  added  to  the  desired  trajectory  to  achieve  max¬ 
imum  path  accuracy.  The  structure  of  the  complete 
system  can  be  observed  in  figure  3. 

The  over-all  structure  of  the  trajectory  precorrec¬ 
tion  resembles  the  well  known  computed  torque  con¬ 
trol  [3]: 

Tact  —  Fcontr(Aucor  +  qd  —  Qact)  (10) 
=  +  F contriQd  ~~  Qact) 

+Fcontr(qd  -  qact). 

The  results  of  the  trajectory  precorrection  are  not 
expected  to  be  as  good  as  the  ones  achieved  with  com¬ 
puted  torque  control,  where  the  controller  can  be  op¬ 
timized  to  reduce  the  remaining  path  errors  that  are 
caused  by  model  errors  and  outside  disturbances.  The 
advantage  of  trajectory  precorrection  is  that  one  only 


Table  1.  Results  of  precorrection  for  trajectory  1,  ver¬ 
tical  circle 


needs  to  convey  path  information  to  the  robotic  sys¬ 
tem  and  no  additional  torque  information.  So,  only  a 
path  interface  is  necessary. 

The  non-linear  model  is  much  more  accurate  than 
the  linear  model  presented  in  the  previous  section  and 
the  resulting  path  accuracy  is  expected  to  be  better. 

4.3.  Results 

The  results  for  the  two  previously  introduced  ap¬ 
proaches  are  presented  for  three  different  trajectories: 

•  Trajectory  1:  vertical  circle,  gripper  vertical,  di¬ 
ameter  40  cm,  0.6  y. 

t  Trajectory  2:  horizontal  circle,  gripper  down,  di¬ 
ameter  40  cm,  0.6  y , 

•  Trajectory  3:  90  °  edge  in  x-y-plane. 

In  order  to  quantify  the  results  of  the  precorrection 
five  measures  are  given:  the  absolute  maximum  carte¬ 
sian  path  error,  the  maximum  cartesian  path  error  for 
the  x,  y  and  ^-direction,  and  the  results  of  the  follow¬ 
ing  criterion  evaluating  the  performance  for  the  whole 
trajectory: 


where  m  is  the  number  of  sampling  points. 

The  tables  1  to  3  show  that  the  path  accuracy  mea¬ 
sured  by  E  is  decreasing  due  to  the  precorrection  for 
all  tested  trajectories.  As  expected  the  results  are  bet¬ 
ter  for  the  non-linear  precorrection  scheme  (reduction 


Table  2.  Results  of  precorrection  for  trajectory  2,  hor¬ 
izontal  circle 
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Table  3.  Results  of  precorrection  for  trajectory  3,  hor¬ 
izontal  edge 


[ 

uncorrected 

linear 

non-linear 

E  [1(T* 

1.43 

0.97 

0.60 

Aabs,max 

mm 

0.99 

0.68 

0.36 

Ax, max 

mm 

0.21 

0.27 

0.22 

Ay)rnax 

mm 

0.98 

0.68 

0.15 

A  z,max 

mm] 

0.49 

0.50 

0.24 

of  at  least  57%)  than  for  the  linear  (reduction  of  at 
least  31%). 

Experiments  have  shown  that  the  main  reason  for 
the  performance  difference  is  the  inability  of  the  lin¬ 
ear  model  to  incorporate  friction.  In  figure  4  mea¬ 
surements  of  joint  2  for  trajectory  1  are  shown.  It  is 
clearly  visible  that  the  linear  model  does  not  compen¬ 
sate  the  changing  sign  of  friction  torque  at  the  maxi¬ 
mum.  Therefore,  its  corrections  are  much  worse  than 
the  ones  of  the  non-linear  model.  Nevertheless,  the  re¬ 
sulting  path  of  the  manipulator  is  closer  to  the  desired 
trajectory  than  without  precorrection. 

In  figure  5  the  break-away  behaviour  of  the  robot 
is  presented  for  trajectory  1  in  cartesian  coordinates. 
Both  models  reduce  the  initial  deviation  impressively 
compared  to  the  uncorrected  path.  Tables  1  to  3  show 
that  the  decrease  of  the  maximum  absolute  error  in 
cartesian  coordinates  for  the  linear  model  is  consider¬ 
able  (at  least  31%  reduction)  but  inferior  to  the  de¬ 
crease  for  the  non-linear  model  (at  least  63%  reduc¬ 
tion). 


xIO"3 


Figure  5.  Results  of  precorrection  for  trajectory  1  in 
cartesian  coordinates,  break-away  behaviour. 


Figure  6  shows  the  absolute  cartesian  error  for  tra¬ 
jectory  2.  It  is  reduced  for  almost  the  whole  length  of 
the  trajectory  except  for  two  points  (0.5  s  and  1.5  s) 
where  the  accuracy  for  the  uncorrected  trajectory  is 
high.  Here  also  can  be  seen  again  (especially  at  0.7  s 
and  1.4  s)  that  the  non-linear  model  precorrection 
is  much  better  than  the  correction  using  the  linear 
model. 

The  tracking  improvements  for  trajectory  3  are  the 
least  significant  for  all  three  trajectories  according  to 
the  3  tables  -  the  maximum  error  in  x-direction  even 


Figure  6.  Results  of  precorrection  for  trajectory  2, 
absolute  error  in  cartesian  coordinates. 
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5.  Conclusion 
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Figure  7.  Results  of  precorrection  for  trajectory  3  in 
cartesian  coordinates. 

increases  slightly  due  to  precorrection.  Nevertheless, 
the  path-accuracy  is  impressive,  which  can  be  seen  in 
figure  7  for  trajectory  3  .  The  linear  model  reduces  the 
deviations  to  a  certain  degree,  while  the  more  sophis¬ 
ticated  model  improves  drastically  the  path  accuracy. 

For  most  industrial  robot  controllers  the  interpola¬ 
tion  cycle  time  and  the  controller  cycle  time  differ  from 
each  other.  The  controller  of  the  Siemens  manutec- 
rl5  generates  path  data  with  ATjpo  =  32  ms  and 
then  interpolates  for  the  6  individual  joint  controllers 
using  A Tcontr  —  2  ms.  The  new  non-linear  precorrec¬ 
tion  scheme  was  tested  for  robustness  with  ATjpo  = 
10  ms,  ATipo  =  20  ms  and  ATipo  =  32  ms.  Even 
though  the  interpolation  leads  to  a  loss  of  path- 
information  especially  at  the  points  of  speed-reverse, 
the  results  presented  in  table  4  show  that  the  method 
is  robust  with  respect  to  the  interpolation  for  all  three 
test  trajectories.  The  path  accuracy  achieved  by  the 
precorrection  only  changes  marginally. 


Table  4.  Results  of  non-linear  precorrection  for  differ¬ 
ent  interpolation  cycle  times 


E  [10“3] 

traj.  1 

traj.  2 

traj.  3 

corrected,  non-linear 

0.37 

0.42 

0.60 

AT  =  10  ms,  uncor. 

1.52 

0.77 

1.44 

AT  =  10  ms,  cor. 

0.36 

0.48 

0.60 

AT  =  20  ms,  uncor. 

1.55 

1.78 

1.43 

AT  =  20  ms,  cor. 

0.39 

0.47 

0.62 

AT  =  32  ms,  uncor. 

1.54 

1.78 

1.44 

AT  =  32  ms,  cor. 

0.40 

0.49 

0.62 

Trajectory  precorrection  is  suitable  for  manipulators 
that  provide  no  access  to  control  algorithms  but  have 
an  interface  for  path  corrections.  In  this  paper  a  tra¬ 
jectory  precorrection  method  is  presented  which  leads 
to  impressive  reductions  of  path  deviations  for  typ¬ 
ical  industrial  robots.  The  method  is  based  on  an 
over-all,  non-linear  model  of  the  robot  which  can  be 
automatically  formulated  and  identified  by  means  of 
the  presented  modelling  and  identification  approach. 
An  experimental  comparison  to  a  linear  precorrec¬ 
tion  scheme  for  the  6-dof  robot  Siemens  manutec-rl5 
shows  that  the  consideration  of  non-linearities  leads  to 
better  results.  However,  the  linear  scheme  combines 
also  remarkable  improvements  with  simplicity.  Fur¬ 
thermore,  it  provides  the  possibility  of  learning  which 
is  not  presented  in  this  paper  so  that  it  is  an  alterna¬ 
tive  to  non-linear  modelling. 
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Abstract 

In  this  paper,  a  new  neural  networks  system  modeling  and 
sliding  mode  suspension  controller  are  proposed.  The 
new  neural  networks  are  an  error  self-recurrent  neural 
networks  which  use  a  recursive  least  squares  method  for 
the  fast  on-line  learning.  The  proposed  neural  networks 
converge  considerably  faster  than  the  back-propagation 
algorithm  and  have  advantages  of  being  less  affected  by 
the  poor  initial  weights  and  learning  rate.  The  controller 
for  suspension  system  is  designed  according  to  sliding 
mode  technique  based  on  new  proposed  neural  networks. 
In  order  to  adapt  sliding  mode  control  method,  each 
frame  distance  between  ground  and  vehicle  body  is 
estimated  and  controller  is  designed  according  to 
estimated  neural  model. 


1.  Introduction 

Neural  networks  have  been  applied  to  various  areas 
including  the  estimation  of  nonlinear  systems  and  control 
application  because  of  being  strong  on  nonlinear 
modeling  [1].  However,  there  are  many  disadvantages 
such  as  local  minima,  long  time  of  learning  and  the  slow 
convergence  rate  in  the  neural  networks  and  back- 
propagation  algorithm [2]  used  in  the  estimation  of 
systems  and  controls.  In  order  to  improve  these 
disadvantages,  many  researches  has  been  reported. 
Representatively  Singhal  and  Wu  incorporated  extended 
Kalman  filter  in  the  BP  algorithm.  However,  the 
computational  complexity  of  this  algorithm  becomes 
intractable  as  the  size  of  multilayer  neural  networks 
increases.  Recently,  Scalero  and  Tepedelenlioglu,  Lou 
and  Perez  derived  another  modified  algorithm[3].  This 
algorithm  divided  neural  networks  into  linear  and 
nonlinear  portion  and  then  apply  Kalman  filter  to  improve 
the  rate  of  convergence  and  also  not  sensitive  to  initial 
weights.  However,  it  is  still  sensitive  to  learning  rates 
because  it  computes  the  desired  values  of  hidden  layer 
based  on  the  BP  algorithm[4]. 


The  key  point  of  this  work  is  to  develop  a  new 
algorithm  which  does  not  require  any  pre-information 
about  the  system  and  architecture.  In  order  to  accomplish 
this  goal,  the  desired  values  of  hidden  layers  are 
computed  by  optimal  method  directly  and  each  weight  is 
updated  by  RLS(recursive  least  square).  Specially  in 
architecture,  self-recurrent  time  delayed  error  as  a  bias  is 
offered.  This  neural  networks  improve  the  disadvantages 
mentioned  above. 

We  also  estimate  7  degree  of  freedom  vehicle  model  in 
realtime;  in  fact  estimate  each  frame  distance  between 
ground  and  car  body  which  is  mentioned  later  and 
develop  an  adaptive  sliding  mode  suspension  controller 
based  on  this  neural  networks. 

2.  Error  self-recurrent  neural  networks 

The  proposed  neural  networks  have  inputs  as 
following;  the  time  delayed  output  of  plant  and  error 
between  plant  and  output  of  neural  networks.  Generally,  a 
bias  or  threshold  is  fixed  to  +1  or  -1  which  just  increases 
or  decreases  the  output  of  each  neuron.  But  there  is  no 
exact  theory  available  yet  how  to  select  this  value 
according  to  system. 

In  this  paper,  in  order  to  solve  this  vagueness  and 
estimate  a  fast  variable  system,  we  suggest  new  model 
like  Figure  lb.  in  which  one-step  previous  error  is 
feedback  into  input,  where  the  weights  have  a  positive 
value 


bias  input  -  +1,  -1,0 


Figure  la.  Conventional  Neural  Network  model 
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Figure  lb.  Error  Self-Recurrent  Neural  Network 


dpjk  ~  f  \xpjk) 

Kk=Pkx/(A  +  xrPkx) 

Pk=(I-KkxT)PkU  (2) 

wl.k  =  wl.k  +  kK  (d*pLk  ~ypLk  +/'0V*(/_I)) 

Wjk  =  Wjk  +  kj  (d*pjk  -  ypIk  +  /J0epjk  ( t  - 1)) 

2.2.  The  estimation  of  a  fast  varying  nonlinear 
system 


Figure  la.  and  Figure  lb.  represent  a  conventional 
neural  networks  and  the  proposed  one,  respectively. 
Because  the  new  neural  networks  described  by  Figure  lb. 
have  error  as  bias  which  is  different  from  Figure  la.  This 
solves  vagueness  and  also  estimates  a  fast  variable  system. 

2.1.  Learning  algorithm 

First,  divide  neural  networks  with  linear  portion  and 
nonlinear  one  and  then  compute  desired  values  of  hidden 
layers  in  the  linear  portion  with  an  optimal  method. 


Linear  portion  1  Nonlinear 

portion 

•  v  >'  ■  ! 

1  1  Linear  portion 

1  |  -wu» 

!  ! 

!  j 
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.  -  * - 

!  1  "uv 

j 

▼  i 
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error  1 
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1  1  error 

l  1 
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1  l 
j  L 

j 

Hidden  Layrcr 

OmpiK  Layrcr 

Figure  2.  Division  of  Neural 
The  learning  algorithm  is  as  follows; 


In  order  to  evaluate  the  performance  of  the  error  self¬ 
recurrent  neural  networks  and  the  learning  algorithm,  the 
test  is  executed  like  shown  in  Figure  3. 


Figure  3.  System  modeling  block  diagram 


The  complex  nonlinear  plant  model  and  input  signal 
are  selected  as  equation  (3) 

w(/)  =  0.5  sin(2;r  /  50)  +  0.5  sin(2  nt  / 1 20) 

y(t)  =  (0.8 5y(t  - 1  )y(t  -  2)  +  0. 1 6u(t  - 1)  +  (3) 

0.25w(/-2))/(l  +  .y2(f-l)) 


Z  A=0 

dplk  ~  f  ](xp!.k  ) 
dE  K 

—  =  0  =>  ^dpu't'u  =  'WijcXj  0) 

°Xj  k~\ 

r  .*  r 

"•i.dpi.  =  «’/.  "'l*j 

:wi. l)~' dpi. ..  if  A'  ^  K 

x)  =(w[«-t)-'w[rft>  if  N<K 

In  the  every  node  of  each  layer,  compute  the  gain 
vector  kfc  and  then  update  weight  vectors  as  follows; 


The  randomized  weights  were  between  0  and  0.5  and 
the  learning  rate  was  selected  as  0.05  for  BP  and  Scalero 
algorithm.  Figure  4.  and  Figure  5.  show  the  result  of  BP 
and  Scalero  algorithm,  respectively.  Figure  4.  and  Figure 
5.  show  that  Scalero  algorithm  converges  faster  than  BP 
algorithm.  Figure  6.  is  the  output  response  of  neural 
networks  trained  by  suggested  algorithm.  It  converges 
faster  than  Scalero  algorithm. 

3.  Real-time  Identification  of  7-DOF  Vehicle 
Suspension  System 

A  vehicle  model  for  simulation  is  shown  in  Figure  7. 
which  has  7  degree  of  freedom.  The  state  of  system 
considered  here  is  heave,  pitch,  roll  motion  and  4  wheels' 
suspension  strokes  due  to  vibration  of  4  wheel  by  road 
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disturbance.  If  we  consider  each  motion,  we  assume  that 
the  order  of  this  system  is  second. 


Figure  7.  7-degree  of  freedom  vehicle  model 

Figure  8.  show  the  block  diagram  for  identifying 
vehicle  suspension  system. 


Figure  4.  Output  by  BP  algorithm  (Bias=+1) 


Figure  5.  Output  by  Scalero’s  algorithm  (Bias=+1) 


Figure  6.  Output  of  proposed  algorithm  (Bias=e(t- 1 )) 


Road  Disturbance 


Figure  8.  The  method  for  Estimating  a  Full  car 

Inputs  of  neural  networks  are  composed  of  1  and  2  step 
time-delayed  plant  output  and  1  step  time-delayed  road 
disturbance.  y(t)  is  the  frame  distances  which  mean 
distances  between  ground  and  car  body,  <?,  +  jt,-  in  Figure 
7. 


Figure  9.  Road  condition(left:0.2,  right:0.1) 
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Frame  distances  should  be  selected  to  estimate  and 
control  suspensions,  because  sliding, mode  control  method 
consider  single  input  and  multi  output  system. 

The  initial  weights  were  selected  between  0  and  0.5 
and  weights  to  error  were  fixed  to  0.05.  Figure  10.,  Figure 
11.,  Figure  12.  and  Figure  13.  show  the  estimation  results 
of  each  frame  distance  when  road  conditions  are  given 
such  as  Figure  9.  The  estimation  errors  almost  converge 
to  zero  at  travel  distance  of  0.5m 


Figure  10.  Front  Left  Frame  Distance  (m) 


Figure  11.  Front  Right  Frame  Distance  (m) 


Figure  13.  Rear  Right  Frame  Distance  (m) 


4.  Adaptive  Sliding  mode  Suspension 
Controller 

Controller  based  on  neural  networks  model  were 
derived  by  A.J.  Morris,  Keyser[7].  But  this  kind  of  neural 
networks  can’t  be  applied  at  real-time  control,  because 
most  control  responses  are  sensitive  to  neural  network 
model  and  a  long  time  of  learning[4]-[7]. 

In  this  paper,  we  developed  adaptive  sliding  mode 
suspension  controller  based  on  suggested  new  neural 
networks.  Sliding  mode  control  method  is  a  kind  of 
robust  control  which  has  a  high  performance  in  spite  of 
model  uncertainty  and  advantages  of  fast  response  feature 
and  including  nonlinear  equations  in  controller. 

The  typical  equation  of  sliding  mode  control  is 

*(">  -f(x)  +  b(x)u.  But  in  this  paper,  because  a 
controller  is  based  on  estimated  neural  networks,  we  just 
make  an  adaptation  that  method. 

The  output  of  neural  networks  is  equation  (4)  which 
describe  system. 

yf{t)  =  F[AyP  (/-!)+  By?  (t-2)  +  Ci,{t-\)  + 

Dw(t-\)  +  Ee(t-\)] 


Figure  12.  Rear  Left  Frame  Distance  (m) 


As  mentioned  above,  we  assume  that  full-car  model  is 
second  order.  So  sliding  mode  controller  could  be 
equation  (5) 

it  -  -  f  +  xj  -  Ax  -  ksat(s  /  (j>) 

f  =  F[Ay'\t)  +  By1'  (/  - 1)  +  6/(0  +  Dn( /)  +  Ee(t)\ 

Xj  =  x,i  =0,  A  =50  (3) 

k  =  F  +  t]  =  lOx  |  y(t)  |2  +0.1,  <j>  =  40 

Since  x  can  not  be  computed  directly,  so  we  must 
estimate  it  as  follows; 
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~(f  1 1)  yQ + j)  -  X 0  (/  +  i)-  yiO 

h  h 

yim  ('  + 1)  =  F[Ayp  (0  +  V  (/  - 1)  +  6/(0  +  (6) 

*■  ZMO +  £<?«] 

Figure  10.  explains  system  identification  and  control  at 
the  same  time 

We  estimate  frame  distances  by  proposed  neural 
networks  in  these  simulations.  Then  design  the  controller 
based  on  suggested  neural  networks  model  which  makes 
4  control  output.  To  evaluate  control  performance,  heave, 
pitch  and  roll  motion  are  monitored  instead  of  each  frame 
distance. 

Road  Disturbance 


Figure  14.  System  control  block  diagram 


Figure  15.,  Figure  16.  and  Figure  17.  show  each  motion 
controlled  by  sliding  mode  controller,  equation  (5)  when 
road  condition  is  given  as  Figure  9.  The  result  of  different 
road  disturbance  such  as  sine  wave,  2  bound  sigmoid,  etc 
was  good,  too. 

5.  Conclusion  and  Consideration 

In  this  paper,  new  recurrent  neural  networks  for 
improving  convergence  rate  which  have  time  delayed 
errors  as  a  bias  is  suggested.  And  the  performance  is 
proved  to  be  effective  to  estimation  of  fast  variable 
system  by  computer  simulations. 

In  order  to  get  this  results  we  compute  desired  values 
of  hidden  layer  directly  instead  of  transferring  desired 
values  by  back  -propagation  and  develop  algorithm  which 
updates  weights  by  RLS  in  online. 

According  to  the  various  testing  results,  new  algorithm 
makes  the  real-time  controller  design  based  on  neural 
networks  easy,  because  the  suggested  algorithm  is  not  so 
sensitive  to  initial  weights,  solves  the  problem  of 
selecting  learning  rate  and  make  a  convergence  fast. 

The  adaptive  sliding  mode  controller  for  vehicle 
suspension  is  developed  based  on  the  suggested  neural 
networks  model.  For  more  reality,  we  estimate  frame 
distances  instead  of  suspension  strokes.  According  to 
computer  simulations  this  controller  was  proved  to  be 
effective  to  suspension  control. 
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Abstract 

This  paper  demonstrates  the  estimation  of  signals  by 
using  a  neural  network  structure  composed  of  cosine 
neurons.  The  building  blocks  of  the  architecture  are 
cosine  components  with  adjustable  amplitude , 
frequency  and  phase.  The  training  procedure  is  based 
on  the  mixture  of  gradient  descent  with  a  method 
utilizing  sliding  mode  control  philosophy.  The 
proposed  use  of  mixed  information  in  training 
dynamics  leads  to  the  minimization  of  the  cost  of 
estimation  as  well  as  the  cost  of  stability.  Two 
application  examples  are  presented  in  the  paper.  The 
first  example  considers  the  reconstruction  of  a  time 
signal  having  finite  frequency  components  in  the 
spectrum.  The  second  example  shows  the 
reconstruction  of  the  frequency  plot  of  a  FIR  filter. 


1.  Introduction 

Signal  estimation  is  an  interdisciplinary  area,  the 
techniques  of  which  are  required  in  most  engineering 
practice.  The  main  issue  in  estimation  theory  is  to 
construct  an  appropriate  estimator,  which  gives 
descriptive  information  about  the  signal  or  system  to  be 
identified.  Some  approaches  dealing  with  signal 
estimation  adopt  methods  in  the  least  squares  sense  [1]. 

Artificial  neural  networks  have  been  studied 
extensively  in  the  context  of  least  squares  [2].  Various 
applications  mentioned  in  the  literature  utilize  the  least 
squares  based  training  procedures.  A  common  property 
of  the  neural  systems  is  their  mathematical  tractability 
in  the  sense  of  evaluating  the  sensitivity  derivatives 
easily.  In  this  study,  a  simple  way  for  the  estimating 
the  spectra  of  unknown  signals  is  proposed  with  the 
introduction  of  a  new  neuron  model,  which  we  call 
cosine  neurons.  The  proposed  estimation  scheme  is 
carried  out  in  time  domain  and  operates  on-line.  The 
use  of  cosine  neurons  provides  an  important 
information  about  the  shape  of  the  frequency  spectrum 
of  the  input  signal.  In  general,  if  there  is  no  closed- 
form  time-domain  equation  of  the  input  signal,  it  is 
considerably  difficult  to  obtain  its  frequency  domain 


representation.  Therefore  a  need  arises  for  the  use  of  an 
intelligent  technique.  The  proposed  approach  has  this 
characteristic  and  the  fact  that  it  can  operate  on-line 
makes  it  especially  attractive. 

At  this  point,  the  mechanism  introducing  the 
intelligence  determines  the  overall  performance  of  the 
estimator.  Typically,  the  methods  using  the  sensitivity 
derivatives  of  the  cost  function  suffer  from  the  shape  of 
the  cost  surface,  which  is  a  multidimensional  surface. 
In  tuning  the  parameters,  the  information  is  extracted 
from  the  partial  derivative  of  the  cost  function  with 
respect  to  an  adjustable  parameter,  along  which  the 
derivative  may  assume  large  values.  Therefore, 
imposing  a  stabilizing  force  on  training  dynamics  can 
eliminate  the  local  unstability  of  gradient  based 
techniques.  This  could  be  achieved  by  the  use  of 
Sliding  Mode  Control  (SMC)  philosophy  in  the 
training  phase. 

Earliest  notion  of  SMC  strategy  was  constructed  on 
a  second  order  system  in  the  late  1960s  by  Emelyanov 
[3].  The  work  stipulated  that  a  special  line  could  be 
defined  on  the  phase  plane,  such  that  any  initial  state 
vector  can  be  driven  towards  the  plane  and  then  be 
maintained  on  it,  while  forcing  the  error  dynamics 
towards  the  origin.  The  concept  introduced  by 
Emelyanov  has  first  been  applied  to  simple  systems  of 
order  two.  Since  then,  the  theory  has  greatly  been 
improved  and  a  well  defined  design  framework  has 
been  established.  The  sliding  line  has  taken  the  form  of 
a  multidimensional  surface,  called  the  sliding  surface 
and  the  function  defining  it  is  called  the  switching 
function.  The  main  advantage  introduced  by  the  use  of 
SMC  approach  is  its  robustness  to  unmodeled 
dynamics  of  the  system  under  control.  In  this  paper  the 
system  corresponds  to  the  training  dynamics. 

Latest  studies  consider  SMC  approach  with 
adjustable  design  parameters  [4-5].  In  [4],  Kaynak  et  al 
demonstrate  that  the  redesign  of  sliding  surface  can 
improve  the  performance  of  the  overall  control 
mechanism.  The  use  of  such  techniques  can  therefore 
offer  a  practical  alternative  for  stable  training  of 
intelligent  systems.  In  [6-8],  Efe  and  Kaynak 
demonstrate  the  distinguished  performance  introduced 
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by  using  Variable  Structure  Systems  (VSS)  analogy  in 
the  training  of  neuro-fuzzy  systems. 

The  organization  of  this  paper  is  as  follows.  The 
second  section  describes  the  problem  and  the 
conventional  solution.  The  third  section  demonstrates 
the  modified  training  procedure.  In  the  fourth  section 
simulation  examples  are  presented.  Conclusions 
constitute  the  last  part  of  the  paper. 

2.  Estimation  by  Cosine  Neural  Networks 

The  approach  presented  in  this  section  is  based  on 
the  matching  of  two  signals  in  time-domain.  A  natural 
consequence  of  this  is  the  similarity  in  the  frequency 
views.  For  this  reason,  an  estimator  composed  of  finite 
number  of  cosine  components  is  constructed  as 
described  by  (1). 

neurons 

xs( 0  =  ZQ  COs(wy  +  pi )  (1) 

/= 1 

where,  xs  is  the  response  of  the  estimator.  C„  w,  and  p, 
denote  the  amplitude,  frequency  and  the  phase  of  the  i* 
unit  respectively.  Let  xd  and  e  denote  the  desired  signal 
and  estimation  error  respectively.  In  order  to  minimize 
the  instantaneous  cost  defined  by  (2),  gradient  descent 
based  update  formula  given  by  (3)  is  employed. 

J  =  \{*d-*s)2=^e2  (2) 


^GD-^GD-ri  (3) 


In  (3),  ^  is  a  generic  parameter  of  the  neuroestimator 
and  pGD  is  the  learning  rate  from  the  interval  (0,1). 
The  subscript  GD  denotes  the  gradient  descent.  Having 
this  in  mind,  the  update  formulas  for  i,h  unit  can  be 
described  in  (4)  through  (6). 


ACj  =  pe  cos(\Vjt  +  pj ) 


(4) 


the  most  crucial  point  in  training  of  a  neuro-fuzzy 
system  is  the  fact  that  the  training  procedure  tries  to 
minimize  the  cost,  which  is  a  function  of  the 
realization  error.  However,  during  the  training  phase, 
there  is  no  force  ensuring  the  parametric  stability  or 
convergence.  The  approach  analyzed  here  demonstrates 
how  a  VSS  based  stabilizing  information  could  be 
incorporated  into  the  learning  strategy  adopted. 

The  parameter  tuning  formula  given  by  (3)  can  be 
approximated  by  a  first  order  system  given  by  (7).  In 
(7),  Ts  denotes  the  sampling  period.  If  the  parametric 
displacement  A<j>  is  defined  as  the  sliding  line  for 
parameter  <j>,  adopting  the  reaching  law  in  (8)  and 
equating  (7)  and  (8)  yields  the  solution  in  (9). 


rte  dx~ 

~Ts~dj 


(7) 


A0  =  -~ ^-tanh(A^)-— ! ^-A<f>  (8) 

*s  Ts 

Ox 

A<f>  =  Ve~  +  Q^  tanh(A$0 + K(j>A<t>  (9) 


For  the  stability  of  the  proposed  solution,  (10)  is  chosen 
as  a  Lyapunov  function.  Parametric  stability  is  ensured 
if  the  inequality  in  (1 1)  holds  true. 


(10) 


Vd  = 


r .  \ 

A(j> 

V  J 


A<f>  <0 


(11) 


If  (7)  and  (9)  are  substituted  into  (11),  the  following 
selection  of  learning  rate  satisfies  the  negative 
definiteness  of  the  time  derivative  of  the  Lyapunov 
function  in  (10).  This  selection  of  p  is  given  by  (12). 


2k 


Awj  =-TjeCjrem  t, —  sin  (w,/ +  /?,•) 


w 


i  J 


(5) 


ri^  =  p  min 


A  <j> 

A  A 

lv 

Nt) 

(12) 


4 Pi  =  -ijeCj  sin(w,f  +  pt )  (6) 

$ 

3.  Variable  Structure  Systems  Based 
Extraction  of  Stabilizing  Information 

The  methods  of  computational  intelligence 
frequently  utilize  the  gradient  based  training 
procedures  for  parameter  tuning.  As  mentioned  earlier, 


where, 


Define  the  following  quantity; 

a4>  =Q<j>tm\\{A^)+K4>A^> 


(13) 


(14) 
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with  p being  a  constant  from  the  interval  (0,1). 

If  this  choice  of  is  used  in  the  approximate  model 
of  training  dynamics  (7),  the  stabilizing  component  of 
parameter  update  formula,  which  is  given  by  (15),  is 
obtained. 

^<kss  =^min|A4|^|)sgn(^)+^  (15) 


of  the  estimated  signal,  which  corresponds  to  In 
seconds  in  time-domain.  Starting  from  the  second 
period,  the  estimated  spectral  view  and  the  desired  one 
become  nearly  indistinguishable. 

In  (18),  the  cost  of  stability  is  formulated.  In  Fig.  4, 
the  time  behavior  of  this  quantity  is  illustrated  whereas 
Figs.  5  and  6  demonstrate  the  realization  error  and  the 
cost  of  stability  without  using  parameter  stabilizing 
information  derived  in  the  third  section. 


At  this  point,  one  might  argue  that  whether  this  rule 
leads  to  the  minimization  of  realization  error  or  not. 
Clearly,  the  rule  described  above  will  enforce  the 
adjustable  parameters  to  settle  down  but  will  not 
minimize  the  cost  in  (2).  An  appropriate  combination 
of  this  rule  and  gradient  technique  can  result  in  the 
minimization  of  both  the  realization  error  and  the 
displacement  magnitude  of  the  relevant  parameter. 
This  mixture  can  be  performed  by  utilizing  a  weighted 
average  as  described  in  (16). 

=  al^<hss  +a2^GD 

(%i  +  (X  2 


In  (16),  aj  and  a2  are  positive  weights  and  determine 
the  influence  of  each  approach  in  the  final  value  of  the 
parameter  change  vector. 

4.  Simulation  Results 

Two  simulation  examples  are  presented.  In  the  first 
example,  a  signal  having  finite  frequency  components 
in  the  spectra  is  reconstructed.  The  second  example 
reveals  the  performance  of  the  proposed  scheme  in 
extracting  the  frequency  plot  of  a  FIR  filter. 

4.1.  Reconstruction  of  a  Signal 


Js  =  X  A  C?  +  Aw?  +  Ap?  (18) 

/=l 

During  the  simulations  of  signal  reconstruction 
example,  following  parameters  are  used  as  the 
simulation  settings. 

Table  1.  Simulation  Settings  for  Signal  Reconstruction 


# Neurons 

5 

a 

0.99 

Vgd 

0.01 

j2 

0.10 

K 

0.10 

Otj 

3.00 

a2 

1.00 

Ts 

10  msec 

4.2.  Extraction  of  Frequency  Plot  of  a  FIR  Filter 

In  this  section,  the  average  power  spectral  density 
(PSD)  plot  of  a  FIR  filter  is  extracted  by  the  use  of  the 
algorithm  presented.  The  filter  has  the  following 
transfer  function. 

H(z)  =  1  +  0. 999z-2  (19) 


In  this  part  the  signal  described  by  (17)  is  reconstructed 
by  the  proposed  structure  and  the  training  procedure. 
The  desired  and  estimated  signals  are  illustrated  in  Fig. 
1. 


xd( o  = 


sin[(2y'  +  l>] 
2  j  +  1 


(17) 


In  Fig.  7,  the  averaged  power  spectral  density  graph  is 
illustrated  for  an  intermediate  step.  As  the  input  signal, 
a  white  noise  sequence  generated  by  Matlab  is  used. 
For  this  case  following  parameters  are  used  as  the 
simulation  settings. 

Table  1.  Simulation  Settings  for  Frequency  Plot 
Extraction 


In  Fig.  2,  the  discrepancy  between  desired  and 
estimated  signals  is  depicted.  For  each  period  of  the 
desired  signal,  a  628-point  Fast  Fourier  Transform 
(FFT)  is  evaluated  and  corresponding  to  each  period, 
results  are  given  in  Fig.  3  where  the  upper  limit  of  the 
horizontal  axis  is  adjusted  such  that  the  nonzero 
frequency  components  are  easily  seen.  In  this  figure, 
the  top  row  is  the  FFT  of  the  desired  signal  (17). 
Subsequent  plots  in  Fig.  3  illustrate  the  628-point  FFT 


#. Neurons 

3 

0 

0.01 

?1gd 

0.90 

jQ 

0.10 

K 

0.10 

aj 

1.00 

a2 

1.00 

Ts 

10  msec 
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5.  Conclusions 

The  method  reported  in  this  paper  demonstrates  that 
the  conventional  training  procedures  for 
computationally  intelligent  systems,  such  as  neural 
networks,  fuzzy  systems  or  methods  adapted  from 
artificial  intelligence  can  be  incorporated  with 
methodologies  leading  to  parametric  stability. 

Variable  Structure  Systems  technique  or  Sliding 
Mode  Control  philosophy  is  one  of  the  methods  which 
is  well-known  with  its  robustness  to  unmodeled 
internal  dynamics  of  the  system  under  investigation.  A 
suitable  combination  of  traditional  training  methods 
with  VSS  technique  can  offer  much  preferable 
solutions  in  the  sense  of  safety.  This  is  apparent  from 
the  comparison  of  estimation  error  trends  illustrated  in 
Figs.  2  and  5,  and  the  cost  of  stability  plots  depicted  in 
Figs.  4  and  6. 

In  this  study,  the  performance  of  the  proposed 
approach  has  been  demonstrated  on  the  estimation  of 
signals  and  extraction  of  frequency  plot  of  a  FIR  filter. 
The  strategy  is  based  on  the  matching  of  signals  in 
time-domain.  Naturally,  if  two  signals  have  similar 
views  in  time-domain,  their  frequency  plots  are  similar. 
The  estimator  studied  in  this  study  comprises  cosine¬ 
like  basis  functions  with  adjustable  parameters.  The 
developed  method  tunes  the  parameters  on-line. 

Various  identification  and  control  oriented 
applications  of  the  proposed  approach  clearly  stipulate 
that  the  method  is  capable  of  eliminating  the  locally 
divergent  behavior  of  gradient  based  training 
approaches.  This  study  demonstrates  that  the  method 
can  also  be  used  in  signal  processing  applications. 
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Figure  1.  Desired  and  Estimated  Signals 
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Figure  2.  Estimation  Error 
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Figure  3.  628-Point  FFT  of  the  Desired  Signal  (top 
row)  and  Time  Evolution  of  628-Point  FFT  of 
Estimated  Signal  Corresponding  to  Each  2n  Seconds 


Figure  4.  Cost  of  Stability  with  VSS  and  Gradient 
Descent  Based  Training  Information  Mixture 


Figure  6.  Cost  of  Stability  without  VSS  Based 
Component  in  Training  Information 
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Figure  7.  Desired  Average  Power  Spectral  Density 
and  Estimated  Average  Power  Spectral  Density 


Figure  5.  Estimation  Error  without  VSS  Based 
Component  in  Training  Information 
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Abstract 

In  this  paper  we  present  an  experimental  evaluation  of 
a  new  type  of  Variable  Structure  Control  with  sliding 
mode  for  robot  manipulators.  The  controller  design  is 
based  on  a  Lyapunov  approach  and  engineering  consid¬ 
erations  about  the  robot  model  are  adopted  in  order  to 
limit  the  chattering  effect.  Experiments  are  performed 
by  means  of  an  industrial  SCARA  robot ,  whose  dy¬ 
namic  model  parameters  are  estimated  according  to  a 
simple  procedure .  Tracking  errors  have  been  evaluated 
accomplishing  different  kinds  of  trajectories  and  the  ob¬ 
tained  results  show  the  valuability  of  the  devised  con¬ 
troller  which  appears  to  be  better  than  other  standard 
methodologies. 

1  Introduction 

Robot  manipulators  are  highly  nonlinear  systems,  tra¬ 
ditionally  controlled  by  assuming  that  the  joints  are 
independent  of  each  other  and  interactions  between 
them  are  considered  as  disturbances.  This  assumption 
is  valid  when  there  is  a  large  reduction  ratio  between 
the  actuator  and  the  joint,  and  when  the  performances 
are  not  too  demanding;  in  this  case  PID  like  con¬ 
trollers  are  generally  sufficient.  However,  the  require¬ 
ments  for  robot  manipulators  nowadays  can  not  be  sat¬ 
isfied  anymore  with  such  approaches.  This  has  lead  to 
the  development  of  new  types  of  controllers  that  take 
into  account  the  joint  couplings.  In  the  model  based 
methodology  [1],  the  dynamic  equations  of  robot  are 
integrated  in  the  controller  as  a  feedforward  term.  The 
implementation  of  such  equations  results  in  a  overall 
linearized  system.  The  linearization  is  efficient  as  long 
as  the  model  used  perfectly  matches  the  robot  system 
[2].  In  practical  applications,  such  an  ideal  situation 
is  not  obtained,  since  the  adopted  model  has  some  pa- 

*  Partial  support  for  this  research  has  been  provided  by 
E.P.S.R.C.  and  M.U.R.S.T. 


rameter  uncertainties  as  well  as  unmodeled  structures. 
To  deal  with  that,  the  controller  must  be  made  adap¬ 
tive  or  robust  so  that  it  copes  with  the  uncertainties 
and/or  mismatches. 

Variable  structure  with  sliding  mode  is  a  robust  control 
methodology  that  can  deal  with  large  disturbances  [3]. 
It  has  gained  popularity  in  the  recent  years,  because 
of  its  versatile  nature  and  inherent  robustness  prop¬ 
erties.  The  scheme  can  be  applied  to  linear  as  well 
as  to  nonlinear  systems  and  it  can  provide  consistent 
responses  in  presence  of  large  uncertainties  and  distur¬ 
bances.  Since  the  first  application  of  variable  structure 
control  with  sliding  mode  to  robotics  by  Young  [4],  the 
number  of  reported  applications  in  this  field  has  kept 
increasing  (see,  for  example,  [5,  6,  7]). 

In  the  initial  control  scheme  used  by  Young  [4],  hier¬ 
archical  methodology  [3],  is  adopted  for  the  controller 
design.  The  sliding  mode  on  each  sliding  surface  takes 
place  in  a  specific  order;  this  requirement  necessarily 
slows  down  the  convergence  of  the  tracking  error.  Fur¬ 
thermore,  to  achieved  this  result,  the  control  laws  are 
complicated  and  do  not  take  advantage  of  the  robot 
dynamic  structure  and  are  therefore  not  very  effective. 
Slotine  [8]  proposes  to  use  a  computed  torque  feed¬ 
forward  term  to  somehow  linearize  the  overall  system 
and  he  compensates  the  uncertainties  of  the  feedfoward 
term  with  a  VSC  element.  The  sliding  manifolds  are 
considered  to  be  independent,  hence  sliding  mode  oc¬ 
curs  on  the  manifolds  in  a  free  order  thereby  increasing 
the  speed  of  convergence  of  the  tracking  errors.  The 
controller  is  designed  so  that  the  sliding  condition  is 
guaranteed  for  each  sliding  surface  [3,  8].  This  requires 
an  estimate  of  the  inverse  of  the  inertia  matrix,  and  re¬ 
sults  in  a  complex  set  of  equations  to  be  satisfied  for 
the  control  law.  The  scheme  has  shown  to  be  effec¬ 
tive  for  trajectory  tracking  of  robot  manipulator,  but 
the  control  effort  is  however  conservative,  which  results 
in  large  chattering.  Slotine  addresses  this  problem  by 
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replacing  the  discontinuous  function  by  a  variable  sat¬ 
uration  one  (referred  as  suction  control  [9]).  The  chat¬ 
tering  problem  is  solved  at  the  price  of  the  robustness 
and  sliding  mode  is  no  more  guaranteed.  Bailey  et 
al  [10]  propose  to  design  a  controller  by  using  a  Lya¬ 
punov  function  that  characterizes  the  overall  tracking 
error  behaviour  of  the  system.  The  resulting  equa¬ 
tions  to  be  satisfied  for  the  controller  design  are  much 
simpler  than  in  Slotine’s  proposal.  With  this  approach 
the  sliding  mode  on  each  sliding  surface  is  however  not 
anymore  guaranteed.  Instead,  sliding  mode  occurs  at 
the  intersection  of  all  the  sliding  surfaces.  The  chatter¬ 
ing  problem  still  occurs  but  in  a  reduced  form.  Bailey 
et  al  tackles  it  by  using  a  saturation  function  instead 
of  the  pure  discontinuous  function. 

Even  if  Slotine  and  Bailey  schemes  depart  from  differ¬ 
ent  perspectives,  the  resulting  controller  structure  is 
similar,  and  both  address  the  chattering  problem  after 
the  controller  designed  has  been  completed. 

The  controller  proposed  in  this  paper  addresses  the 
problem  of  chattering  directly  in  its  design.  The  de¬ 
sign  is  based  on  engineering  considerations  of  the  robot 
manipulator  equations  and  structure.  The  scheme  is 
tested  on  the  ICOMATIC  SCARA  03  robot  manipu¬ 
lator  installed  in  the  Applied  Mechanics  and  Robotics 
Laboratory  of  the  University  of  Brescia. 

2  Variable  Structure  Control 
with  varying  bounds 

The  most  significant  limitation  with  robot  control  al¬ 
gorithms  employing  VSC  with  sliding  mode  is  the  chat¬ 
tering  phenomenon,  which  can  damage  the  overall  elec¬ 
tromechanical  structure,  as  well  as  reduce  the  motion 
performances.  The  general  approach  to  overcome  it  is 
to  replaced  the  switching  function  by  a  smoother  one 
[9,  10,  11].  This  general  approach  however  seriously  al¬ 
ter  the  performances  of  the  controller.  The  controller 
proposed  in  this  paper  addresses  chattering  during  its 
design  stage. 

The  well-known  dynamic  model  of  a  n-joint  robot  ma¬ 
nipulator  can  be  written  as  follows: 

T  =  Af (0)0  +  <7(0, 0)0  +  (7(0)  +  F(0)  (1) 

where  0  is  the  n  x  1  joint  angle  vector,  T  is  the  n  x  1  in¬ 
put  torque  vector  (control  variable),  M(6)  is  the  nxn 
inertia  matrix,  which  is  symmetric  positive  definite, 
(7(0,0)  is  the  nxn  matrix  representing  the  centrifu¬ 
gal,  and  Coriolis  terms,  (7(0)  is  the  n  x  1  vector  of 
the  gravity  terms  and  F(0)  is  the  friction  term.  Note 
that  (7(0,0)  is  defined  such  that  it  verifies  the  skew- 
symmetric  property: 

xt[M(0)  -  2(7(0, 0)}x  =  0  Vx  €  Rn . 


Similarly  to  [10],  the  controller  design  relies  on  a 
pseudo-energy  Lyapunov  function: 

V  =  lsrM5  (2) 

where  S  =  [5i...Sn]T  is  the  column  vector  for  the  slid¬ 
ing  surfaces  where  each  sliding  surface  Si  is  defined  as 
(*  =  1  ,-,*»): 

Si  ~  ( $i  —Odi)  +  A  i($i  —  Odi)  /gX 

=  0j  H-  A  i6i 

where  0*  and  0*  are  the  angular  position  and  velocity 
of  the  i- th  joint,  0di  and  Odi  are  the  desired  position 
and  velocity  respectively  for  the  i- th  joint  and  A*  is 
a  constant  that  defines  the  slope  of  the  z- th  sliding 
surface.  We  consider  the  following  general  controller 
structure: 

T  -  M(0)e  +  6(0, 0)e  +  6(0)  +  F(0)  +  Ts  (4) 

where  (•)  is  the  parameter  estimate,  e  =  (0d  -  A 0), 
which  is  the  velocity  demand  augmented  by  the  posi¬ 
tion  error  and  similarly  e  =  (0d  -  A 0)  represents  the 
acceleration  demand  augmented  by  the  velocity  error 
(note  that  0  =  0  —  0rf).  As  shown  by  Nigrowsky  et  al. 
[12],  the  controller  must  satisfy  the  following  equation: 

-  V  =  ST  [f(0)  +  G{9)  +  (7(0, 0)e  +  M(0)e  +  Ts]  <  0 

_  (5) 

where  (■)  is  the  parameter  error.  Bounds  on  each  ele¬ 
ments  F(0 ),  (7(0),  (7(0,0),  M(0),  e  and  e  in  the  above 
equation  can  be  established  by  considering  their  phys¬ 
ical  natures.  For  a  robot  manipulator,  the  positions 
of  the  joints  are  physically  bounded,  and  since  M  (0) 
and  G(0)  are  functions  of  the  joint  position,  they  are 
also  bounded.  In  a  similar  way,  bounds  on  (7(0, 0) 
and  F(0)  are  established  by  assuming  the  boundness 
of  the  joint  position  and  its  physically  possible  ve¬ 
locity.  Indeed,  the  terms  F(Q),  (7(0),  <7(0,0)  and 
M(0)  can  be  bounded  by  a  constant  term  Ki.  The 
bounds  for  e  can  simply  be  taken  as  a  constant  term 
with  a  velocity  error,  i.e.  k{  +  k2\0\.  Similarly,  e  is 

bounded  by  kf  +  ks\0\.  Hence,  the  inverse  dynamic  er¬ 
ror  | F(0)  +  (7(0)  +  C (0, 0)e  +  M (0)e\i  can  be  bounded 
as  follows  (i  =  1, . . . ,  n): 

|F(0)  +  G(0)  +  C(0,0)e  +  M(0)e |_  <  *n+fcai|0i|+JfeM|0<| 

(6) 

where  ku,  k<ii  and  k^i  are  positive  constant  terms.  Let 
define: 

Ki&X)  =  ku  +  k2i\0i\  +  hi\0i\  (7) 
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Figure  1:  The  SCARA  ICOMATIC  03  situated  in  the 
Applied  Mechanics  and  Robotics  Laboratory  of  the 
University  of  Brescia. 


The  negativeness  of  V  can  be  guaranteed  by  taking 
Tsi  =  -Ki(6i,6i)sgn(Si). 

With  such  bounds  on  the  system  uncertainties,  the 
chattering  magnitude  is  greatly  reduced.  Chattering  is 
however  not  completely  overcome  because  of  the  term 
ku,  and  the  magnitude  of  the  chattering  is  directly, 
proportional  to  the  magnitude  of  ku.  To  overcome 
this  problem,  ku  is  removed  and  k2i  and  k3i  are  slightly 
overestimated  to  compensate  for  the  missing  term.  In 

this  way,  a  problem  occurs  when  6i  and  0l  are  close 
to  zero;  since  the  boundness  of  the  uncertainties  is  not 
anymore  guaranteed,  an  offset  occurs  at  the  regulation 
point  [12].  To  remove  it,  an  integral  action  is  used. 
Hence,  the  control  law  is: 

T=  M(6)e  +  C(6,6)e  +  G(0)+F(0)+ 

K(0,0)sgn(S)+KIj0dt,  (8) 

0 

where 

k(U)=[k1(01,01),  k2(02,o2),  k3(o3X),..]T 

and 

Ki(9i^0i)  =  ku\6i  \  +  k2i\  9{  |. 

The  term  Kj  is  column  vector  of  constant  terms  for 
the  integral  action. 

3  Estimation  of  the  dynamic 
model  of  a  SCARA  robot 

To  test  the  devised  Variable  Structure  Controller  with 
varying  bounds,  we  adopted  a  two  degrees-of-freedom 


SCARA  robot,  which  is  widely  adopted  in  industrial 
environments  for  assembly  tasks.  Specifically,  the 
SCARA  ICOMATIC  03  robot  manipulator  has  been 
employed.  It  has  been  built  by  Icomatic  of  Gussago 
(Brescia,  Italy)  and  installed  in  the  Applied  Mechan¬ 
ics  and  Robotics  Laboratory  of  the  University  of  Bres¬ 
cia.  The  robot,  shown  in  Figure  1,  has  actually  three 
degrees-of-freedom,  but  the  third  link,  which  is  de¬ 
voted  to  move  the  gripper  up  and  down,  has  not  been 
involved  in  the  experiments  since  its  dynamics  is  com¬ 
pletely  decoupled  from  the  others.  The  first  two  links 
are  actuated  by  means  of  two  direct  current  motors 
endowed  with  Harmonic  Drive  reduction  gears  (the  re¬ 
duction  ratio  is  r  =  1/100).  The  positions  of  the  two 
joints  are  measured  by  means  of  incremental  encoders 
(with  a  resolution  of  n  ■  10~5rad/step)  and  the  veloc¬ 
ity  measurements  are  provided  by  tachometers.  The 
torque  applied  to  the  motors  can  be  measured  by  a 
current  test-point  in  the  drive. 

The  robot  is  controlled  by  means  of  an  open  PC-based 
control  architecture  with  QNX  real-time  operating  sys¬ 
tem,  which  assures  a  closed-loop  control  frequency  of 
1kHz.  In  order  to  implement  the  controller,  an  esti¬ 
mate  of  the  dynamic  model  is  required  [13].  Assuming 
that  no  external  forces  and  torques  are  applied,  the 
dynamic  model  of  the  robot  can  be  written  as  follows 
(note  that  no  gravity  terms  are  present): 


'  7i  ' 

mn  m12 

■  01  ■ 

'  M0i)  ' 

.  T2 

r 

[  m21  m2  2 

.  *2  . 

1 

.  h{02)  _ 

+ 


m2g2c0l  +  ml2c0\ 


where  Tx  and  T2  are  the  torques  applied  to  the  links, 
h  and  l2  are  the  lengths  of  the  links,  m,  mi  and  m2 
are  the  masses  of  the  payload  and  of  the  two  links  and 
/i  and  f2  are  the  dynamic  friction  functions  of  the 
two  links.  The  elements  of  the  inertia  matrix  have  the 
following  expressions: 


mu  =  m\2  +  mig2  +  m2r2  +  J  +  JGl  +  JG2; 
mi2  =  m2i  =  rnl2fJL  +  m2g2b  4-  J  +  JG2; 

•m22  =  mil  +  ™2  92  +  J  +  Jg2  - 


where  gi  and  g2  are  the  distances  of  the  centres  of  mass 
of  the  two  links  from  the  related  joint  and  J,  JGl  and 
JG2  are  the  inertia  moments  of  the  payload  and  of  the 
two  links  with  respect  to  the  centre  of  mass.  Finally, 
r  ,  6,  c,  A2,  f. i 2  are  geometric  parameters  that  depend 
on  the  links  position: 

r2  =  +<?|  +  2/102cos(<92); 

&  =  92  +  h  COs(#2); 
c  =  li  sin(02); 

A2  =  /^  -f  /|  -f  2l\l2  cos(02); 

/^2  =  ^2  +  Ji  cos(02). 
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Joint  1 


Taking  into  account  the  presence  of  the  reduction 
gears,  the  dynamic  model  can  also  be  written  with 
respect  to  the  actual  torque  generated  by  the  motors. 
With  this  aim,  some  simplifying  assumptions  can  be 
made.  Since  the  motors  of  the  second  and  third  links 
are  located  at  the  end  of  the  first  and  second  link  re¬ 
spectively,  the  masses  of  the  first  and  second  link  can 
be  considered  as  lumped  at  the  end  of  the  same  links 
(i.e.  g\=l\  and  g2  —  I2  and  the  values  of  Jq  1  and  Jq 2 
can  be  neglected).  Then,  considering  also  li  =  l2  =  l 
we  have: 


Tml 

T7ji2 

Ji  -f  r2mn 
r2mi2 


T2m12 

J2  +  r2m22 


+rm2k 


^ml 

@ml 

+ 

#ml 

0 

6  m2 

@ml 
@m2 

fl  0ml) 


(10) 

where  Tmi  is  the  torque  applied  to  the  i-th  motor,  6mi 
and  6mi  are  the  velocity  and  acceleration  respectively 
of  the  i-th  motor,  Ji  is  the  sum  of  the  known  inertia 
moments  of  the  2-th  motor  and  the  2-th  reduction  gear 
(2  =  1,2),  and: 


mu  =  Triil2  -f-  m2r2; 
mi2  =  m2lb] 
m22  =  m2l2; 
r 2  =  2/2(l  +  cos#2); 
b  =  1(1  +  cos02); 
c  =  l  sin  62  • 


The  first  estimation  that  can  be  performed  is  the  one 
regarding  the  dynamic  friction.  From  equation  (10), 
it  appears  that  the  inertia  effects  can  be  set  to  zero 
by  simply  moving  the  joints  at  a  constant  velocity  (i.e. 
the  acceleration  is  zero).  Moreover,  setting  to  zero  the 
velocity  of  a  single  joint,  the  dynamic  term  represent¬ 
ing  the  Coriolis  and  centrifugal  torques  is  cancelled,  so 
that  only  the  term  regarding  the  friction  of  the  other 
joint  is  present.  The  above  functions  fi  (2  =  1,2)  for 
the  two  joints  have  been  numerically  evaluated  by  ex¬ 
ecuting  different  motions  of  single  joints  with  different 
constant  velocities  and  measuring  the  torque.  The  re¬ 
sulting  numerical  values  have  been  subsequently  inter¬ 
polated  by  polynomial  functions. 

Once  the  friction  functions  have  been  estimated,  the 
approximate  values  of  the  inertial  parameters  can  be 
determined.  The  method  consists  of  executing  differ¬ 
ent  experiments  in  which  the  two  links  are  moved  sep¬ 
arately  with  piecewise  constant  accelerations,  so  that 
C(6,0)6  =  0  and  the  only  unknown  values  in  equa¬ 
tions  (10)  are  the  elements  of  the  inertia  matrix.  Since 
the  element  mu  depends  on  the  position  of  the  second 
link,  it  appears  that  the  motor  torque  Tm  1  depends  on 
the  position  of  the  second  link  too.  For  this  reason,  we 
fixed  62  =  0,  evaluating  Jri  =  Ji  +  r2(m^2  +  4m2Z2). 


Time  [s] 


Figure  2:  Estimated  and  actual  torque  for  the  joint  1 
motion. 


Joint  2 


Figure  3:  Estimated  and  actual  torque  for  the  joint  2 
motion. 

Note  that  for  the  motor  of  the  second  joint,  the  value 
of  Jr2  =  J2  +r2m22  does  not  depend  on  the  position  of 
the  first  one.  From  the  experimentally  obtained  values 
of  Jri  and  Jr2,  it  is  easy  to  determine  the  values  of  m\ 
and  m2  and  consequently  the  values  of  the  elements 
of  the  matrix  representing  the  Coriolis  and  centrifugal 
torques. 

To  see  how  accurate  the  obtained  model  is,  in  Figures  2 
and  3  are  reported  the  estimated  and  the  actual  torque 
for  a  point-to-point  motion  followed  by  a  circular  one, 
which  covers  a  large  portion  of  the  robot  workspace.  It 
appears  that,  in  spite  of  the  simplifying  assumptions, 
the  obtained  model  is  accurate. 

4  Results 

Various  trajectories  have  been  used  to  evaluate  the 
tracking  error.  Specifically,  a  point  to  point  motion 
from  {0\,6\)  =  (0,0)  to  (0f,02)  =  (7r/2,7r/2)  in  2s  has 
been  performed  on  the  robot,  with  a  piecewise  con¬ 
stant  acceleration  profile  (see  Figure  4).  Then,  a  lin¬ 
ear  trajectory  in  the  Cartesian  space  from  (a:i  ,3/1)  = 
(“0.3,0.54)  to  (x2,y2)  =  (0.6,0.16)  has  been  accom¬ 
plished  in  4s  (see  Figure  .  5).  Finally,  a  circular  tra¬ 
jectory  in  the  Cartesian  space,  centered  in  ( xc,yc )  = 
(0,0.5)  and  with  radius  equal  to  0.1m  has  been  per¬ 
formed  in  4s  (see  Figure  6).  It  has  to  be  observed  that 
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Figure  4:  Cartesian  space  representation  of  the  point- 
to-point  trajectory,  to  be  accomplished  in  2s. 

these  trajectories  require  different  torque  levels. 
Regarding  the  tuning  of  the  parameters  ku,  k2i  and 
kji  (i  =  1,2),  in  order  to  assure  a  low  tracking  po¬ 
sition  error,  the  value  of  ku  has  to  be  much  greater 
than  the  other  two  values.  For  each  trajectory,  the 
maximum  and  the  mean  absolute  position  tracking  er¬ 
ror  of  the  end-effector  has  been  evaluated.  The  end- 
effector  position  has  been  determined  by  applying  the 
direct  kinematics  to  the  joint  positions  and  therefore 
the  presence  of  elasticities  in  the  mechanical  structure 
and  calibration  errors  have  not  been  taken  into  ac¬ 
count.  Note  that,  according  to  the  ISO  standard  [14], 
the  tracking  error  for  the  linear  and  circular  trajec¬ 
tories  have  been  calculated  as  the  minimum  difference 
between  the  real  end-effector  position  and  the  reference 
trajectory.  Furthermore,  an  additional  parameter,  in¬ 
spired  by  the  concept  of  accuracy  and  repeatibility  [14], 
has  also  been  evaluated:  it  is  defined  as 

Ei  —  e-\-  3ae 

where  e  is  the  average  value  of  the  absolute  error  and 
<re  is  the  standard  deviation  of  e.  Results  are  shown 
in  Table  1 .  It  has  to  be  stressed  that  for  all  the  exper¬ 
imented  motions,  the  resulting  final  error  is  zero. 

It  appears  that  in  general  the  adopted  controller  as¬ 
sures  a  low  tracking  error,  since  its  maximum  value 
for  the  linear  and  circular  trajectory,  where  the  track¬ 
ing  is  of  major  concern,  is  about  0.1mm  and  the  mean 
absolute  error  is  not  much  greater  than  0.01mm;  hence 
it  is  close  to  the  maximum  performance  achievable  by 
the  encoder  resolution  (note  that  an  error  of  one  en¬ 
coder  step  for  each  joint  corresponds  to  a  maximum  er¬ 
ror  on  the  end-effector  of  0.01mm).  To  better  evaluate 
the  achieved  performances,  it  is  interesting  to  compare 
them  with  the  ones  obtained  by  the  typical  computed- 
torque  controller  [15]  and  the  typical  PID-based  con¬ 
troller,  which  consists  of  two  nested  loops:  the  inner 
(which  is  analogically  built  in  the  drive),  is  devoted  to 
control  the  velocity  of  the  joint  and  the  outer  (imple- 


Figure  5:  Cartesian  space  representation  of  the  linear 
trajectory,  to  be  accomplished  in  4s. 


Figure  6:  Cartesian  space  representation  of  the  circular 
trajectory,  to  be  accomplished  in  4s. 

mented  digitally  in  the  PC)  is  devoted  to  control  the 
position.  For  these  two  controllers,  results  obtained 
performing  the  circular  trajectory  are  shown  in  Table 
2.  It  turns  out  that  the  variable  structure  control  as¬ 
sures  overall  better  performances  with  respect  to  the 
standard  methodologies. 

Regarding  the  chattering,  it  comes  out  that  no  par¬ 
ticularly  significant  vibrations  are  introduced  by  the 
controller.  To  verify  it,  the  torque  signal  relative  to 
the  first  joint  during  the  linear  trajectory  has  been  di¬ 
rectly  read  from  the  current  test  point  of  the  drive  and 
not  filtered.  It  has  been  plotted  in  Figures  7  and  8  for 
the  VSC  and  the  PID-based  controller  respectively.  It 
has  also  to  be  stressed  that  chattering  can  be  reduced 
by  detuning  the  sliding  mode  controller  and  allowing 
a  larger  tracking  error. 


Trajectory 

max 

mean 

Ei 

point-to-point 

0.253 

0.048 

0.023 

linear 

0.081 

0.016 

0.056 

circular 

^0.104 

0.019 

0.069 

Table  1:  End-effector  tracking  errors  (in  [mm])  for  the 
different  experimented  trajectories. 
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Controller 

max 

mean 

Ei 

PID 

0.430 

0.132 

0.526 

computed-torque 

0.172 

0.039 

0.102 

Table  2:  End-effector  tracking  errors  (in  [mm])  for  the 
PID-based  and  computed- torque  control  laws. 


timafs] 

Figure  7:  Torque  signal  for  the  variable  structure  con¬ 
troller  performing  the  linear  trajectory  (joint  1). 


5  Conclusions 

In  this  paper  we  have  presented  an  experimental  eval¬ 
uation  of  a  new  type  of  variable  structure  controller  for 
robot  manipulators.  Engineering  considerations  are 
used  in  the  design  of  the  controller,  which  is  based 
on  a  Lyapunov  approach,  and  to  limit  the  chattering 
effect  the  bounds  are  made  variable.  The  tracking  er¬ 
rors  obtained  for  the  different  trajectories  by  means 
of  an  industrial  SCAR  A  robot  show  the  effectiveness 
of  the  devised  controller.  It  is  also  observed  that  the 
obtained  results  outperformed  standard  control  algo¬ 
rithms,  without  significantly  increasing  the  chattering. 
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Abstract 

In  this  paper,  Gaussian  Neuro  Sliding  Mode  Controller 
(GNSMC)  is  proposed  to  overcome  difficulty  in  the 
calculation  of  the  equivalent  control  In  sliding  mode 
controllers,  a  whole  knowledge  of  the  plant  dynamics  is 
required  for  this  purpose.  In  the  GNSMC,  Gaussian 
function  based  adaptive  RBF  neural  network  is  used  to 
compute  the  equivalent  control  signal  of  sliding  mode 
controller.  The  performance  of  the  proposed  schemes  is 
verified  by  simulation  results  on  two  dof  direct  drive 
manipulator.  Simulation  studies  show  that  the  novel 
approach  ensures  good  trajectory  following  performance 
with  payload  variations  and  eliminates  chattering  without 
need  a  priori  knowledge  about  plant.  These 
characteristics  make  the  proposed  control  technique  an 
attractive  solution  for  motion  control  applications 


1.  Introduction 

Variable  structure  control  (VSC)  with  sliding  mode 
control  (SMC)  is  a  special  type  of  control  technique  that 
is  capable  of  making  a  control  system  very  robust  with 
respect  to  system  parameter  variations  and  external 
disturbances  [1].  In  addition,  the  technique  provide  good 
control  performance  and  an  easy  way  to  design  the 
control  law  for  a  wide  spectrum  of  system  types  including 
linear  systems,  nonlinear  systems,  multi-input/multi¬ 
output  systems,  discrete-time  models,  large-scale  and 
infinite  dimensional  systems,  and  stochastic  systems. 
Because  of  these  properties,  today,  research  and 
development  continue  to  apply  SMC  to  a  wide  variety  of 
engineering  systems  [2]. 

The  first  disadvantage  of  SMC  is  the  chattering,  which  is 
the  high  frequency  oscillation  of  the  controller  output. 
The  second  one  is  the  difficulty  in  the  calculation  of  the 
equivalent  control.  A  thorough  knowledge  of  the  plant 
dynamics  is  required  for  this  purpose. 

In  the  literature,  classical  and  intelligent  approaches  are 
used  to  overcome  these  problems.  The  first  classical 
approach  is  the  use  of  a  saturation  function  instead  of  the 
sign  function  to  eliminate  chattering  [3].  The  second  one 


is  the  use  of  an  averaging  filter  for  avoiding  the  difficulty 
in  the  calculation  of  the  equivalent  control  [4]. 

The  intelligent  approaches  are  combinations  of  soft 
computing  technologies  such  as  fuzzy  logic  or  neural 
network  (NN),  and  SMC  theory.  The  first  idea  is  based  on 
fuzzy  adaptation  scheme  for  tuning  sliding  mode 
controller  parameters  in  real  time.  These  schemes  avoid 
chattering  without  the  need  of  off-line  tuning  of  controller 
parameters  and  improve  control  performance  of  SMC  for 
two  degree  of  freedom  (dof)  direct  drive  manipulator  [5]. 
The  second  idea  is  NN  controller  with  learning  rule  based 
on  sliding  mode  algorithm  for  calculation  of  unknown 
part  of  the  equivalent  control  in  presence  of  the  plant 
uncertainties  [6]. 

Last  intelligent  approach  in  the  SMC  literature  is  a 
synergistic  combination  of  NN  and  sliding  mode  control 
to  be  able  to  obtain  a  robust  neuro-controller  with  less 
system  knowledge.  They  used  a  feed-forward  NN  to 
compute  the  equivalent  control  term  in  sliding  mode. 
Thus,  the  requirements  for  the  knowledge  of  dynamics 
and  parameters  are  eliminated  in  the  computation  of  the 
equivalent  control  [7]. 

Radial  base  function  neural  networks  (RBFNN)  have 
recently  played  a  significant  role  in  the  NN  domain. 
Many  researchers  are  working  on  the  theoretical 
investigation  of  this  type  NN.  On  the  other  side,  it  has 
potential  to  classification,  time  series  prediction  and  also 
intelligent  controller  applications.  The  main  reasons  for 
this  result  are  universal  approximation  ability,  but  also 
have  the  so-called  “best  approximation”  property, 
comparative  simplicity  and  computational  efficiency  [8]. 

In  this  paper,  a  new  intelligent  approach  to  sliding  mode 
controller  for  n-link  industrial  robot  manipulators  is 
developed.  In  this  approach,  Gaussian  function  based 
adaptive  RBFNN  to  compute  the  equivalent  control  signal 
of  sliding  mode  controller  is  used.  The  performance  of  the 
proposed  schemes  is  verified  by  simulation  results  on  two 
dof  direct  drive  manipulator.  These  studies  show  that 
adaptive  RBFNN  based  sliding  mode  controller  avoid 
chattering  without  need  a  priori  knowledge  about  plant, 
ensure  a  fast  system  response  and  fast  convergence  of 
system  states  to  the  sliding  surface  and  good  trajectory 
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following  performance  with  payload  variations.  These 
characteristics  make  the  proposed  control  technique  an 
attractive  solution  for  trajectory  control  of  robot 
manipulators. 

The  organization  of  the  paper  is  as  follows.  In  Section  II, 
the  classical  SMC  design  and  the  base  of  our  proposed 
controller  are  presented.  In  section  III,  the  structure  of  the 
Gaussian  Neuro  Sliding  Mode  Controller  (GNSMC),  the 
structure  and  the  on-line  weight  adaptation  algorithm  of 
RBFNN  are  discussed  in  detail.  In  section  IV,  the 
performance  verification  of  the  GNSMC  system  is 
presented  by  extensive  simulation  studies  on  a  direct 
drive  two  dof  SCARA  type  manipulator.  The  paper 
concludes  with  the  presentation  of  some  typical 
simulation  results  obtained  for  the  control  of  a  direct  drive 
SCARA  type  robot. 

2.  Design  of  sliding  mode  controller 

The  control  mechanism  will  be  derived  for  a  nonlinear, 
non-autonomous,  multi-input  multi-output  system  in  the 
following  form  : 

m 

x,w=f,  (*,*)+ 3>„(x,o«j  a) 

j= 1 

where  the  vector  U  of  components  uj  is  the  control  input 
vector  and  the  state  X  is  composed  of  the  jq’s  and  their 
first  (arl)  derivatives.  Such  systems  are  called  square 
systems  since  they  have  as  many  control  inputs  uj  as 
outputs  to  be  controlled  xr-  [3].  The  system  can  be  written 
in  a  more  compact  form  as  letting 

*4s  ^  . Jta  4  ....tl  U  =  k  .  UmJ  (2) 

and  the  system  equation  becomes, 

X(t)  =  F(X,t)  +  B(X,t)U(t)  (3) 

where  the  system  state  vector  is  X  £  91” ,  control  signal 

vector  is  U  e  ,  t  is  time  and  B  is  (nxm)  input  gain 
matrix.  The  aim  is  to  drive  the  states  of  the  system  into 
sliding  surface  S  (mxl)  defined  by 

S«{X:a(X,f)“fl*f)-ff.(X)  =  0}  (4) 

where,  (p(t)  =  GXd(t )  is  the  time  dependent  part  of  the 
sliding  function  cr(X,f)and  consists  of  desired 
(reference)  state  vectors  Xd  and  slope  matrix  of  sliding 
surface  G  (mxn).  cra(X)  =  GX(t)  denotes  the  state 
dependent  part  of  sliding  function  cr(Xj) . 

The  derivation  of  the  control  involves  the  selection  of  a 
Lyapunov  function  V(S)  and  a  desired  form  V  the 


derivative  of  Lyapunov  function.  The  selected  Lyapunov 


function  is 

V  -—aT<y 

(5) 

2 

Therefore, 

V  =  cTa 

(6) 

It  is  desired  that 

V--aTD  sign(o) 

(7) 

thus,  equation  (7)  equals  to  (8) 

ot6  =  -otD  sign(o) 

(8) 

where  D  is  (mxm)  positive  definite  diagonal  gain  matrix. 
Thus,  the  derivative  of  the  Lyapunov  function  will  be 
negative  definite  and  this  will  ensure  stability. 

The  expression  for  the  derivative  for  the  sliding  function 
is 


6  =q>-^X=  <p-G(F(X,t)  +  B(X,t)U(t))  (9) 

and  by  putting  (9)  into  (8),  the  sliding  mode  control  input 
signal  can  be  obtained  as  follows, 

U ( t )  =  Ueq (f)  +  (GB)-'  Dsign(a)  =  Ueq (f )  +  A U (/)  (10) 

where  U  (t)  is  the  equivalent  control  and  it  is  written  as, 

Ueq{t  )  =  -{GBT\GF(X,t)-<p)  (11) 

and  A  U(t)  is  the  additional  control  term  and  it  is  written 
as, 


A  U(t)  =  (GB)'1  Dsign(a)  =  Ksign(o)  (12) 

The  equivalent  control  equation  (11)  and  the  additional 
control  term  equation  (12)  are  starting  point  to  design 
Gaussian  function  based  adaptive  RBFNN  for  computing 
the  equivalent  control  signal  of  sliding  mode  controller 
(SMC). 

3.  Gaussian  neuro  sliding  mode  controller 
(GNSMC) 

The  equivalent  control  equation  (11)  contains  F(X)  and  B 
system  matrices.  Therefore  this  is  difficult  to  compute 
exact  value  of  the  equivalent  control  without  strong  a 
priori  knowledge  about  the  system.  In  other  words,  a 
whole  knowledge  of  the  system  dynamics  (or  inverse 
dynamics)  and  the  system  parameters  is  required  to  be 
able  to  compute  the  equivalent  control.  This  is  actually 
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very  rare  in  practice.  To  be  able  to  solve  this  problem,  one 
can  use  a  RBFNN  to  compute  the  equivalent  control. 

In  the  proposed  structure,  the  equivalent  control  term  in 
sliding  mode  control  is  computed  by  an  adaptive  RBFNN. 
The  additional  term  in  sliding  mode  is  also  computed  and 
summed  with  the  output  of  the  RBFNN  to  form  the 
control  signal.  The  output  of  the  secondary  controller  is 
accepted  as  a  measure  of  error  to  update  the  weights  of 
the  RBFNN.  The  overall  system  with  the  proposed 
controller  is  given  in  Fig.  1. 


Figure  1.  Illustration  of  the  structure  of  the  proposed 
GNSMC  system 

The  structure  of  RBFNN  is  shown  in  figure  2  and  consists 
of  only  three  layers.  The  first  layer  is  connection  to  the 
desired  and  the  actual  states.  The  second  layer  is  the  only 
hidden  layer.  It  performs  non-linear  transformation  from 
the  input  vector  space  to  the  internal  vector  space.  The 
last  layer  is  the  output  layer  that  equals  to  the  number  of 
inputs  of  the  robot  and  transforms  internal  vector  space 
into  an  output  one  in  linear  manner.  Neurons  of  the 
hidden  layer  are  radial  base  functions.  The  type  of  these 
functions  is  theoretically  unimportant  and  Gaussian 
functions 

ilMI2 

h{l)  =  e2  *  A  >  0  (13) 

are  used.  Neurons  in  the  output  layer  calculate  linear 
function 

z 

fi(I)  =  wi0b  +  2)  WijhjU)  i  =  l,..m  (14) 

j=i 

where  index  i  denotes  output  neuron  /,  index  j  runs 
through  all  the  hidden  layer  neurons,  /  is  an  input  vector, 
c  is  the  center  of  RBF  neurons  of  the  hidden  layer,  A  is 
the  width  of  RBF  in  neurons,  b  is  a  bias  input  and  wtj  is  a 
connection  weight  from  hidden  neuron  j  to  output  neuron 
i[  8], 


In  order  to  obtain  a  high  performance  GNSMC  solution  to 
control  of  nonlinear  system,  it  is  essential  to  define  the 
proper  RBFNN  architecture.  The  network  architecture  is 
determined  by  the  number  of  nodes  in  each  layer  and  the 
location  of  the  function  centers.  The  determination  of  the 
number  of  nodes  in  input  layer  and  output  layer  is 
selected  from  the  equivalent  control  equation  (11).  In  the 
computation  of  the  equivalent  control,  all  the  desired  and 
actual  states  are  used.  Therefore,  the  number  of  nodes  in 
input  layer  are  the  number  of  desired  and  the  actual 
states.  And  the  number  of  nodes  in  output  layer  are  the 
number  of  control  inputs  to  plant.  For  example  in 
application  of  robot  control,  the  number  of  nodes  in  the 
output  layer  are  determined  by  the  number  of  the 
actuators  of  the  robot. 


Figure  2.  Illustration  of  the  RBFNN  structure 


The  selection  of  the  number  of  hidden  layer  nodes  and  the 
location  of  centers  will  have  profound  effects  on  the 
performance  of  the  RBFNN  in  GNSMC.  The  number  of 
neurons  in  the  hidden  layer  should  be  selected  such  that 
the  RBFNN  has  mapping  capability  for  inverse  dynamics 
of  the  plant.  The  learning  of  RBFNN  is  divided  into  two 
parts  usually  because  the  hidden  layer  differ  from  the 
output  one.  First,  the  non-linear  hidden  is  determined  by 
some  kind  of  clustering  algorithm  and  then  the  output  one 
is  optimized  either  by  gradient-descent  procedure  that 
represents  a  generalization  of  the  least  mean  square 
(LMS)  algorithm.  K-means  clustering  algorithm  is  used  to 
find  the  proper  location  of  centers  in  hidden  layer.  And  p- 
nearest  neighbour  heuristic  rule  is  used  to  determine  the 
width  of  basis  functions  [8]  [9]. 

The  inputs  of  RBFNN  I  consist  of  desired  and  actual 
states. 

I  =  [(Xd)T  (15) 
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The  output  of  RBFNN  is  equivalent  control  term  Ueq. 
And  it  can  be  computed  as  follows, 

z 

Ueq,  =wi0b  +  '£wiJhj{I)  i  =  l,..m  (16) 

The  first  step  in  the  development  of  such  learning 
procedure  of  RBFNN  is  to  define  the  instantaneous  value 
of  the  cost  function.  The  cost  function  is  selected  as, 


e  =  \l{Ue<ldj  ~ Ueqj J  =  |  J(a U sJ  (17) 

Z  j= 1  Z  ;=1 

The  weight  adaptation  algorithm  of  RBFNN  is  a 
combination  of  two  learning  phase.  In  first  phase, 
positions  of  the  radial  base  functions  of  the  hidden  layer 
is  set  by  k-means  clustering  algorithm.  Then,  p-nearest 
neighbour  heuristic  rule  is  used  to  determine  their  widths. 
This  phase  is  off-line  and  a  part  of  the  construction  of 
RBFNN. 


The  second  part  of  learning  is  based  on  gradient-descent 
procedure  for  the  optimizing  of  all  free  parameters  of 
RBFNN.  The  phase  is  on-line  progress.  The  requirement 
is  to  find  the  free  parameters  wtj ,  c ]  and  so  as  to 

minimize  e .  The  result  of  this  minimization  are  given  as 
follows  [8]  [9]. 


Adaptation  formulas  for  the  linear  weights, 
de(t) 


=  -*,(Oty/) 


wij(t  +  l)  =  wij(t)-ril 


de(t) 

dw^t) 


j  =  0,l,..,z 


(18) 

(19) 


Adaptation  formulas  for  the  positions  of  RBF  centers, 


de(t) 

dCj(t) 


1  m  r  , 
U)hj  (/)[/  -  Cj  (oj 

A  i=l 


(20) 


Cj(t  +  l)  =  cj(t)-r]2-f^-  j  =  1,..,  z  (21) 

Adaptation  formulas  for  the  width  of  standard  Gaussian 
functions, 


few 

fe,W 


m 

1=1 


X){t) 


(22) 


A;(r  +  1  )  =  XJ(t)-r)3^L  ;  =  z  (23) 

dAj(t) 

In  adaptation  formulas,  fih  ju2,  fa  denote  the  learning  rate. 


4.  GNSMC  system  verification  by 
simulation 

The  performance  of  the  GNSMC  system  presented  in 
Section  3  is  verified  by  extensive  simulation  studies  on  a 
direct  drive  two  dof  SCARA  type  manipulator  sketched  in 
Fig.  3. 


Link  2 


Figure  3.  Illustration  of  the  SCARA  type  manipulator 


4.1.  Manipulator  Dynamics 

The  dynamical  equations  of  this  manipulator  are  given  by 
the  following  equation  [10]: 

M(q)q  +  V(q,q)+fc  =U  (24) 

where,  q  is  the  vector  of  joint  angles  q}  and  q2  shown  in 
Fig.3.  U  is  the  torque  vector  applied  to  the  joints,  M  is  the 
inertia  matrix,  V  is  the  vector  of  centripetal  and  coriolis 
forces,  fc  stands  for  coulomb  friction  and  M  and  V  can  be 
written  explicitly  as 


M(q)  = 


Pi  +2  p3  cos  (q2) 
P2+  Pi  COs(q2  ) 


p2+p3COS(q2) 

Pi 


(25) 


V(q,q)  = 


<?2 (27i  +q2)P-i  sin (q2) 
<?i2  Pi  sin(q2) 


(26) 


where />,=  3.1877 ,p2=  0.1168  and 0.1630  [10]. 


4.2.  Application  of  the  GNSMC  scheme  to  the 
control  of  the  manipulator 


The  states  are  selected  as  the  angular  positions  and  their 

derivatives: 

xx  -  q  and  x2  =  q  . 

Then  the  following  state-space  form  representation  of  the 
robot  manipulator  is  obtained: 


A 


x2 

~M~](x])(V(xvx2)  +  fc 


(27) 
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Figure  4.  Desired  trajectory  of  end  effector  on  x-y  plane 

to  compute  additional  control  term  in  equation  (12), 
proper  selection  of  gain  matrices  (G  and  D)  should  be 
done.  In  our  simulation  studies  G  and  D  were  selected  as 
follows, 

[5  0  1  O' 

0  5  0  1 


and  p  = 


2  0 
0  2 


trajectory  of  end  effector  on  x-y  plane  is  shown  in 
Fig.6.(d)-(e).  It  shows  that  GNSMC  satisfies  robust 
control  to  payload  variations. 


Figure  5.  Angular  position  references 

5.  Conclusions 


In  addition,  to  eliminate  chattering,  shifted  sigmoid 
function  is  used  for  computing  additional  control  term  as 
follows, 

A  U(t)  =  (G#)-1  D<j>{a)  =  K<j>(o)  (28) 

Where  0(.)  is  a  shifted  sigmoid  function  and  it  is  defined 
as. 


In  this  paper,  a  new  intelligent  approach  to  sliding  mode 
controller  for  n-link  industrial  robot  manipulators  is 
developed.  In  this  approach,  we  used  Gaussian  function 
based  on-line  adaptive  RBFNN  to  compute  the  equivalent 
control  signal  of  sliding  mode  controller.  And  also  it  has 
one  great  advantage  that  is  an  on-line  controller  which 
means  that  control  and  learning  takes  place  at  the  same 
time. 


^--1  (29) 

1  +  e  1 

For  computing  the  equivalent  control  term,  we  employed 
three  layer  RBFNN  with  1  =  8  (number  of  input  nodes),  z 
=  8  (number  of  hidden  nodes)  and  m=2  (number  of 
output  nodes).  The  initial  weight  of  RBFNN  have  to 
enable  the  system  to  satisfy  stability  criterition.  In  our 
simulation  random  small  initial  weights  are  selected  to 
make  RBFNN  output  insignificant  at  initial  stage  of  the 
control  process.  Moreover,  all  adaptation  rates  (u,,  u2,  u3 ) 
of  RBFNN  is  selected  as  0.01. 

The  desired  end  effector  trajectory  used  for  simulation 
studies  in  Fig.  4.  It  corresponds  to  the  state  references 
shown  in  Fig.  5.  The  simulation  results  are  presented  in 
Fig.  6.  The  solid  and  dashed  lines  belong  to  first  and 
second  link,  respectively.  The  robot  perfectly  follows  the 
desired  trajectory. 

In  order  to  compare  the  effects  of  payload  change  while 
the  robot  is  working  40  kg  payload  was  added  and  the 
effects  of  this  payload  to  trajectory  control  were 
examined.  The  payload  was  added  at  t=10  sec  while  the 
whole  task  of  the  manipulator  takes  20  sec.  Actual 


The  simulation  studies  show  that  GNSMC  avoid 
chattering  without  need  a  priori  knowledge  about  plant, 
ensure  a  fast  system  response  and  fast  convergence  of 
system  states  to  the  sliding  surface  and  good  trajectory 
following  performance  with  payload  variations.  These 
characteristics  make  the  proposed  control  technique  an 
attractive  solution  for  trajectory  control  of  robot 
manipulators. 
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U1&U2(Nm)  Ueq1&Ueq2(Nm) 


(e) 

Figure  6.  GNSMC  performance  during  20s  trajectory 
tracking  of  the  direct  drive  two  dof  SCARA  type 
manipulator,  (a)  Angular  tracking  errors,  (b)  Equivalent 
control  signals,  (c)  Control  signals,  (d)  Angular  tracking 
errors  with  payload,  (e)  Actual  trajectory  with  payload 
variation. 
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Abstract 

The  paper  attempts  to  present  a  few  results  obtained 
during  the  research  work  sustained  by  the  author  within 
the  domain  of  control  automatic  systems.  The  results 
presented  in  here  illustrate  some  possibilities  of 
structural  optimisation  and  functioning  in  case  of 
dimensional  control  automatic  systems  used  in  the 
bearing  industry  plainly  in  the  case  of  ball  bearing. 

1.  Introduction 


dozing.  During  the  research  work  numerical  simulations 
of  the  movement  laws  by  changing  the  main  influence 
factors  have  been  done. 

The  analysis  of  the  results  obtained  during  the 
optimisation  process  lead  to  the  elaboration  of  new 
possibilities  of  assuring  the  structures  of  optimum 
automatisation  feeding  from  the  efficiency  and  energy 
consumption  point  of  view,  for  different  types  of  pieces. 

2.  Theoretical  aspects 


The  research  objectives  have  been  underlined  within 
the  frame  of  solving  a  specific  problem  raised  in  the 
field  of  machine  manufacturing  industry-  automatic 
systems  for  dimensional  control  productivity 
improvement  characteristics  and  performances. 

The  analysis  of  feeding  systems  has  been  initiated 
within  the  frame  of  scientific  research  objectives 
proposed  by  the  author  for  which  the  obtained  results 
have  imposed  an  approach  of  the  problem  from  a 
theoretical  as  well  as  a  practical  point  of  view. 

In  this  paper  are  being  approached  a  few  aspects 
concerning  the  statistical  and  dynamic  behaviour  of 
automatic  systems  for  ball  control  with  the  aim  of 
achieving  the  best  structures  from  the  efficiency  point  of 
view.  Theoretical  studies  consist  of  mathematical 
models  for  different  types  of  feeding  and  transport 
structures.  Their  analyses  aimed  at  the  static  and 
dynamic  behaviour  of  spherical  components  during  the 
different  manipulating  conditions. 

In  such  circumstances,  there  have  been  underlined  a 
series  of  influence  factors  that  influence  both  the  static 
and  the  dynamic  behaviour  of  the  spherical  parts  in 
different  systems  like  the  ones  for  feeding,  transport  and 


The  static  analyses  have  underlined  a  group  of  forces 
acting  upon  the  half  -  finished  products  during  their 
feeding  process  (in  other  words,  in  contact  with  the 
catching  elements). 

In  the  case  of  feeding  systems  having  disks  with  nets 
there  has  been  analysed  the  general  case  in  that  the 
disk  has  an  inclination  with  a  y  angle  with  respect  to  the 
horizontal,  the  parts  being  captured  from  the  tub  in  its 
inferior  position  and  released  in  the  superior  one. 

The  spherical  piece,  located  in  the  inferior  position  of 
the  tub  and  disk,  is  being  stressed  by  a  system  of  forces. 
The  disk,  through  the  nest  wall,  provides  the  force  F, 
force  whose  influence  on  the  part  depends  on  the  nest 
geometry  as  well  as  the  dimensional  characteristics  of 
its  walls  (see  Fig.  1.). 

Equation  (1)  shows  the  condition  that  has  to  be 
fulfilled  by  the  disk  through  the  force  developed  and 
that  acts  upon  the  piece,  in  order  to  achieve  a  rotational 
movement  in  the  case  of  the  latter. 


Q/ioasy  +  any(psxE\f/-snip)] 
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Figure  1.  Static  behaviour  of  spherical  pieces  on 
feeding  disk  with  nest. 


An  important  aspect  that  arises  during  the  working 
process  of  the  feeding  systems  refers  to  parts’  (half- 
finished  components)  behaviour  in  the  tub  before  their 
trapping  by  the  catching  and  transport  elements.  From 
a  static  point  of  view  the  analysis  of  this  behaviour  can 
focus  on  finding  the  coupling  forces  and  the  limit 
equilibrium  conditions  among  the  pieces. 

In  the  feeding  tubs,  the  half-finished  components  are 
being  located  in  a  bulk  shape,  actually  in  a  random 
position,  so  finally  this  has  to  reach  in  the 
neighbourhood  of  the  catching  and  transport  elements 
in  order  to  be  released  from  the  system.  In  order  to  get  a 
simplification  of  the  analysis  there  has  been  taken  into 
account  only  the  homogeneous,  spherical  shapes  for  the 
half-  finished  components,  and  those  that  are  into  a 
tangent  position  related  to  one  another. 

Isolating  two  pieces  from  the  tub  (see  figure  2)  and 
taking  into  account  their  tangential  position  and  the  fact 
that  they  are  relying  on  tow  planes  (imaginary  ones) 
that  intersect  after  a  horizontal  axis  having  the  a,  and 
a2  angles  related  to  the  horizontal  plane  (these 
imaginary  planes  are  materialising  in  fact  the  pieces 
from  the  tub  on  which  the  parts  considered  are  relying 
on),  the  equilibrium  position  has  been  deduced, 
considering  the  equation  (2)  and  coupling  forces  as  in 
equation  (3). 

tgp  =  ~{ctK  a ,  ~  ctga,);  (2) 

j  _  ff g\jj sin2q; H  sin!a2-2smais]na2(xyfai 1 
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Figure.  2.  Static  behaviour  oj  the  pieces  in  the 
bunker  tub. 

The  p  angle  is  known  as  natural  breaker  angle.  Half- 
finished  components  movement  is  achieved  mainly 
because  of  superior  layers  displacement  influenced  by 
the  natural  breaker  (3  angle  and  only  in  a  small  amount 
because  of  the  displacement  of  the  whole  quantity  of 
pieces  located  at  the  bottom  inside  the  bunker.  As  one 
can  see  in  the  equation  (2),  the  natural  breaker  p  angle 
can  be  influenced  by  the  tub  walls  profile,  respectively 
by  its  inclination  angle. 

Dynamic  behaviour  analysis,  respectively  the  writing 
of  the  spherical  piece  movement  governing  equations 
with  respect  to  a  fixed  coordinate  system  can  be  done 
based  on  analytical  mechanics,  namely,  Lagrange’s 
equations.  Consequently  the  coordinate  system  OxoyoZo 
(fixed)  was  considered  to  be  the  relating  one  in  order  to 
determine  the  movement  governing  equations  for  a 
spherical  piece  having  the  radius  and  mass  nio  and 
which  is  being  displaced  by  the  conveyer  disk  from  the 
inferior  position  of  the  tub  to  the  superior  part  where  it 
will  be  released  to  the  exhaust  sewer.  As  well  as  before, 
these  feeding  systems  with  disk  having  nests, 
respectively  with  rotor  having  horizontal  palettes,  can 
be  considered  particular  types  of  the  one  with  disks 
having  inclined  nests.  Also,  in  this  case  can  be  written 
Lagrange’s  equation  with  respect  to  the  coordinate 
system  Ox0yoZ0. 

In  the  dynamic  behaviour  analysis  of  the  spherical 
piece  in  these  feeding  systems  of  the  spherical  piece  in 
these  feeding  systems  one  assumes  that  the  ball  has  two 
degrees  of  freedom:  -  a  rotation  around  Oz0  axis, 
because  of  the  conveyor  disk;  -  radial  displacement 
because  of  the  interstice  between  pallets,  respectively 
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because  of  the  relative  displacement  among  piece,  tub 
and  conveyor  as  a  consequence  of  manufacturing 
characteristics  of  the  latter.  Nest,  two  generalized 
coordinates  were  chosen:  the  rotational  angle  §  and 
radius  p.  Lagrange’s  equations  related  to  the  fixed 
system  Oxyz,  taking  into  account  the  generalised 
coordinated:  radius  p  and  rotational  angle  $  can  be 
written  as  follows  to  get  the  equation  governing  the 
movement  in  radial  direction  (see  eq.  4). 

In  order  to  analyse  the  dynamical  behaviour  of  the 
half-finished  components  in  the  inclined  tub  there  had 
been  considered  a  half-finished  (rigid)  in  movement 
inside  the  feeding  tub  forced  to  stay  in  a  permanent 
contact  with  the  latter.  The  relative  displacement  of  both 
rigid  bodies  being  in  contact  can  be  done  through 
gliding  or  rolling,  Lagrange’s  equation  with  multipliers 
has  been  used  to  write  the  equation  of  movement, 
together  with  the  nonholonomic  couplings  leading  to  a 
system  with  1+s  equations  having  1+s  unknowns:  qi, 

K  In  case  the  piece  is  a  spherical 
(homogeneous)  one  and  it  is  rolling  (with  a  swivel 
movement)  without  gliding  between  piece  and  tub  a 
nonholonomic  coupling  is  being  achieved.  Choosing  the 
horizontal  plane  like  the  xOy  plane  of  the  fixed 
coordinate  system  (Oxyz),  having  Oz  in  the  upper 
direction  of  the  vertical  and  Gx’y’z’  coordinate  system 
jointly  with  the  sphere  one  can  see  that  the  point  G  will 
be  permanently  in  plane  z=R  (figure  3.). 


Figure  3.  Dynamic  behaviour  of  the  piece  in  the 
bunker  tub. 

The  generalised  coordinates  xq,  y0,  0,  vy,  4>  Satisfied 
two  coupling  connections  unintegrable,  therefore  we  get 
a  nonholonomic  system,  a  sclerenome  one,  having  three 
degrees  of  freedom.  In  case  the  tub  is  inclined  the 
coupling  between  both  bodies  becomes  holonomous 
because  the  rotational  axis,  at  a  rolling  of  the  piece 
without  gliding  in  the  plane  of  inclined  tub,  keeps  a 
constant  direction.  From  Lagrange’s  equations  have 
been  obtained  three  prime  integrals  (5)  that  allow  a 
reduction  of  the  problems  to  quadratics. 


<p  +  y>  co sO  =  CD 
6 2  +  H>2  sin2#  =  C2 (5) 

<  V  sin2  #  =  C3(]  +  COS0). 

After  calculations  have  been  obtained  the  movement 
laws  in  the  case  of  spherical  pieces  located  on  the 
inclined  disk  having  an  angle  y  with  respect  to 
horizontal. 


xa  =  jjgt2sinr;  e  =  j^gt2siny.  (6) 


In  case  in  that  between  piece  and  disk  there  is  both 
gliding  friction  (having  a  coefficient  p)  and  rolling 
friction  (having  a  coefficient  s),  the  movement  law  can 
be  written  as  in  (7)  for  pieces  moving  on  the  bunker’s 
disk. 


XG 


GR(R  smy  -s-cosy)  2 
( mR2  +  Ig)  ‘ 


(7) 


One  can  consider  that  in  case  of  a  fast  acceleration  in 
movement,  a  ratio  s/R  as  small  as  possible  it  will  be 
favourable.  In  the  same  case  can  be  accomplished  the 
condition  for  a  rolling  movement  only,  without  friction. 
In  case  of  a  cylindrical  piece  this  condition  becomes,  as 
follows: 


Considering  the  theoretical  results,  the  authors  have 
perfected  a  new  testing  plant  to  realise  the  static  and 
dynamic  characteristics'  correlation  with  the  specific 
functional  characteristics  and  to  determine  the  main  and 
the  most  important  optimal  working  parameters.  The 
authors  have  based  their  studies  on  the  automatic 
processing  and  dimensional  control  systems  used  in  the 
bearing  and  assembling  component  production  [1]. 

All  these  studies  are  very  important  in  the  field  of 
precision  mechanics  and  mechatronics  systems. 

The  theoretical  studies  have  generated  some 
conclusions  that  led  the  authors  to  elaborate  the  rollers' 
laws  of  motion  in  different  operating  conditions  [2]: 

•  Spherical  parts  inside  the  rotating  and  inclined 
feed  hopper  of  the  feeding  systems; 

•  Spherical  parts  transported  by  holding  elements; 

•  Spherical  parts  coming  out  from  the  feeding  and 
transporting  systems; 

•  Cylindrical  parts  coming  out  from  the  feeding  and 
transporting  systems; 

•  Cylindrical  parts  transported  by  holding  elements; 

•  Cylindrical  parts  inside  the  rotating  and  inclined 
feed  hopper  of  the  feeding  systems. 

Considering  the  theoretical  analysis,  the  authors  give 
emphasis  to  the  influence  factors  that  can  determine  the 
optimised  structure  realisation  [2]. 
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Some  of  the  most  important  factors  are: 

•  the  holding  elements  rotative  speed; 

•  the  tank  rake; 

•  the  parts’  dimensional  characteristics; 

•  the  pats’  features  of  constructions; 

•  the  parts’  quality  coefficient; 

•  the  holding  elements  dimensional  characteristics; 

•  the  holding  elements  features  of  constructions; 

•  the  loopihg  unit  mechanical  characteristics. 

3.  Experimental  aspects 


on  the  tank  rake  and  the  rotation  speed.  It  also  presents 
the  correlation  among  the  static,  dynamic  behaviour  and 
the  specific  functional  characteristics.  Analysing  this 
chart,  the  authors  can  remark  the  tank  rakes  and  the 
rotation  speeds  that  assure  a  the  high  productivity  for 
each  of  part  types.  The  main  purpose  of  this  research  is 
to  generate  optimised  feeding,  transport,  proportioning 
and  alignment  systems  for  more  mechanical  efficiency. 

The  computer  processes  all  the  information,  testing 
the  analysed  system  and  setting  the  working  parameters 
to  obtain  an  optimised^  mechanical  system. 


In  order  to  analyse  the  correlation  of  the  static  and 
dynamic  characteristics  and  the  specific  functional 
conditions,  the  authors  devised  and  realised  an  original 
testing  plant.  The  law-chart  of  this  equipment  is 
presented  in  figure  4.  The  computer  traces  the  whole 
process  of  deposition,  catching,  handling,  transport  and 
exhaust  of  the  parts  from  the  tank,  analysing  the 
necessary  information  and  signals  from  the  sensors  and 
generates  the  decisions  on  the  optimised  conditions  of 
the  feeding  system. 


Figure  4.  Testing  plant  law-chart. 

This  testing  plant  reproduces  the  mechanical  structure 
of  a  feeding  system  with  varied  transporting  rotors 
(hooked  disks,  carved  disks  and  catch  pin  disks).  The 
theoretical  study  imposed  conditions  on  the 
experimental  parameters.  The  handling,  catching  and 
transporting  elements  must  turn  round  with  a  rotation 
speed  with  a  variation  range  among  1  and  18  rotations 
per  minute.  The  tank  rake  must  vary  among  0  and  90 
degrees,  namely  from  horizontal  position  to  vertical 
position.  The  primary  data  that  will  be  processed  by  the 
computer  are:  the  rotation  speed  of  the  manipulation 
elements  (Sn),  the  tank  rake  (&,),  the  time  (t),  the 
working  productivity  (Sq)  and  the  part  characteristics 
(shape,  dimensions,  appearance  and  material).  Figure  5 
presents  an  example  of  a  chart  generated  inside  the 
experimental  process  for  the  spherical  parts.  This 
example  presents  the  productivity  variation  dependent 
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Figure  5.  The  chart  of  the  productivity  variation. 

4.  Conclusion 


This  method  provides  an  automatic  process  control  as 
well  as  a  data  base  for  the  select  on  of  the  working 
parameters  that  can  assure  the  most  efficient 
mechanical  structure  used  in  the  automation  of 
dimensional  control  systems. 

In  conclusion,  an  important  data  base  was  gained 
and  an  automatic  process  control  for  optimising  the 
feeding,  transport,  proportioning  and  alignment  systems 
achieved.  For  further  research  the  authors  want  to  test 
the  system  with  the  aid  of  neuronal-networks.  For  the 
catching  probability  increase  and  the  analyse  of  all 
dynamic  aspects  a  fuzzy  method  will  be  proposed. 
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Abstract:  This  paper  presents  a  contribution  to  E.J.Davison’s  computational  method  for  stability  analysis  of 
mechatronic  systems,  which  are  essentially  non-linear  dynamic  systems,  via  numerical  construction  of  quadratic 
Lyapunov  functions.  It  makes  use  of  E.J.Davison’s  main  algorithm  and  H.H.Rosenbrock’s  method  for  finding 
extremum  (max. /min.)  values  of  a  function,  while  contributing  to  the  derivation  of  numerical  scheme  via 
appropriate  algorithmic  expansion  of  matrix  equations  involved.  This  way  an  expanded  and  improved  estimate  of 
the  approximating  hypervolume  of  asymptotic  stability  sub-domain  is  obtained  that  is  closer  to  the  stability  and/or 
the  bifurcation  boundary. 

Keywords:  Mechatronic  systems;  non-linear  systems;  autonomous  dynamics;  asymptotic  stability;  Lyapunov 
approach;  computational  methods. 


I.  INTRODUCTION 

The  asymptotic  stability  is  one  the  structural 
properties  of  general  dynamic  systems  is  required  as  a 
sine-qua-non  condition  in  any  engineering  system, 
and  so  is  the  case  of  motion  and,  in  general,  of 
mechatronic  control  systems.  It  should  be  noted, 
however,  mechatronic  control  systems  are  essentially 
non-linear  as  is  the  motion  dynamics  they  are  aimed 
to  control.  Therefore  their  stability  too  has  to  be 
addressed  as  such  if  the  design  and  implementation 
constitute  the  actual  goal  of  thier  investigation  from 
the  viewpoint  of  applied  science. 

Almost  half  a  century  has  elapsed  since  the  re¬ 
discovery  of  Lyapunov’s  1892  dissertation  (A.M. 
Lyapunov,  Probleme  General  de  la  Stabilite  du 
Mouvement  Annals  of  Mathematical  Studies  No.  17. 
Princeton  N.J.:  Princeton  University  Press,  1949),  and 
a  number  of  research  studies  for  the  extension  of 
Lyapunov  stability  theory  and  novel  theoretical 
findings  have  been  elaborated  (via  differential 
geometric  approach,  e.g.,  A.  Isidori,  1989,  J.-J.E. 
Slotine  &  W.Li,  1991;  H.K.  Khalil,  1992).  In 
practical  applications,  however,  it  is  rather  important 
to  have  techniques  to  establish  guaranteed  asymptotic 
stability  in  a  wider  feasible  region  in  order  to  design 
control  systems.  The  main  concern,  due  to  the  essence 


of  Lyapunov’s  second  or  direct  method  to  the  stability 
of  motion,  is  the  construction  of  most  appropriate 
Lyapunov  function  that  guarantees  the  largest  possible 
asymptotic  stability  region  in  terms  of  magnitude 
departures  from  the  equilibrium  state  vector. 
Hundreds  of  contributions  may  be  found  in  the 
literature,  and  therefore  is  impossible  to  cite  them  in 
here.  From  the  applications  point  of  view,  however, 
the  ones  that  may  be  efficiently  used  in  a  computer- 
assisted  environment,  in  particular  on  a  PC  based 
platform,  appear  to  be  of  substantial  value. 

In  our  investigations  along  these  lines  [6]- [9]  and 
with  regard  to  non-linear  behaviour  of  the  wide  class 
of  mechatronic  systems,  recently  we  have  revisited 
some  of  the  early  developments  which  draw  from  the 
very  sources  of  A.M.Lyapunov  and  his  many 
followers  in  systems  and  control  science  and 
engineering;  just  to  name  a  few,  V.I.Zubov  (1955), 
W.Hahn  (1959),  R.E. Kalman  and  J.E.  Bretram 
(1960),  J.LaSalle  and  S.Lefschetz  [11],  A.M.  Letov 
[12],  S.  Marggolis  and  W.  Vogt  (1963),  F.N.  Bailley 
[1],  S.  Barnett 

The  works  of  E.J.  Davison  and  his  collaborators 
[3]-[5]  are  particularly  important,  for  they  resulted  in 
a  computer-assisted  method  which  is  known  to  give 
the  largest  approximate  of  asymptotic  stability  sub- 
domain  for  a  given  non-linear  systems  model. 
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Moreover,  Davison’s  work  has  been  motivated  by  the 
goal  of  deriving  a  constructive  computational  method 
which  would  give  automatically  (that  is,  without  trial- 
and-error  iteration)  the  largest,  numerically  attainable 
region  of  asymptotic  stability  for  a  given  systems 
model  of  arbitrary  order  via  quadratic  Lyapunov 
functions  [5], 

In  our  research,  we  have  found  these  contributions 
to  be  of  particular  value  in  applications  and  worth  re¬ 
elaborating  within  the  information  processing 
potential  of  nowadays  standard  PC  platform  [13].  In 
doing  so,  we  have  found  that  H.H.  Rosenbrock’s 
results  [14]  on  using  rotating-co-ordinates  and  hill¬ 
climbing  strategy  for  static  minimisation  are  rather 
instrumental  [5],  though  other  similar  techniques  can 
be  employed.  This  paper  gives  a  summarised 
presentation  of  our  results;  due  to  space  limitation, 
results  of  a  number  of  computer  simulation 
experiments  will  be  given  in  the  course  of 
presentation  (Figures  1,  2,  3).  The  comparison 
analysis  has  been  worked  out  with  respect  to  the 
original  examples  used  in  Davison-Kurak  [5]:  Van 
Der  Pol’s  dynamic  systems;  second-order  bang-bang 
servo-mechanisms;  and  a  third-order,  anal alyti cal 
non-linear  dynamic  system. 

II.  AN  OUTLINE  OF  DAVISON’S  TECHNIQUE  [5] 

It  is  well  known,  there  is  no  general  method  for 
determining  the  best  suitable  Lyapunov  function  for 
the  purpose  of  guaranteeing  asymptotic  stability  for 
non-linear  dynamical  systems  of  arbitrary  order 
because  Lyapunov  theory  gives  sufficient  conditions 
only.  Solely  in  the  case  of  linear  models  these 
conditions  are  known  to  be  necessary  as  well. 
Moreover,  the  problem  of  computationally  oriented 
methods  for  asymptotic  stability  investigation  is  even 
more  involved.  Nevertheless,  there  exist  a  number  of 
contributions  to  computer-assisted  determination  of 
the  largest  sub-domain  of  the  domain  of  asymptotic 
stability  for  non-linear  dynamical  models  of  motion  in 
mechtronics  systems  such  as  represented  by  the  quite 
general  class  (1).  Firstly,  Davison  and  Man  have 
given  a  computational  solution  to  the  well-known 
matrix  equation  occurring  in  Lyapunov  stability 
analysis  [3]  while  Davison  and  Cowan  for  stability 
region  of  any  second-order  system  [4].  Secondly,  in 
[5],  Davison  and  Kurak  gave  a  constructive 
computational  method  for  asymptotic  stability  via  the 
class  of  quadratic  Lyapunov  functions  and 
demonstrated  that  this  approach  yields  by  far  better 
estimates  than  all  previously  derived  computer- 
assisted  techniques.  The  main  idea  behind  Davison’s 
method  is  to  convert  the  problem  of  finding  the 
“best”  quadratic  Lyapunov  function,  yielding  the 
maximum  stability  boundary  on  the  grounds  of 
LaSalle  and  Lefschetz  theorem  [9]  on  sufficient 


conditions  for  determining  a  sub-domain  in  the 
asymptotic  stability  domain  ,  into  a  parameter 
minimisation  problem  and  then  to  resolve  it  by  using 
Rosenbrock’s  method  [14],  In  the  case  of 
discontinuous  system  models,  the  first  step  is  to  find  a 
most  appropriate  equivalent  continuous  model 
approximating  the  original  one  [5],  [6]. 

It  is  assumed  that  a  motion  or  dynamics  of 
mechatronic  systems  is  represented  in  the  form  of  the 
well  known  general  model  of  vector  differential 
equation 

X  =  f(x)  ,/( 0)  =  0  (1) 

and  its  equilibrium  position  around  the  zero-vector,  x 
=  0,  in  the  n-dimensional  linear  space  having  a  well- 
defined  Euclidean  norm  (the  output  equation  is  not  of 
particular  concern  here). 

The  great  potential  of  Lyapunov  functions  and 
Lyaspunov  theory  lies  in  its  applicability  for  analysis 
and  design  of  non-linear  dynamic  systems  provided  a 
quality  estimate  of  system’s  asymptotic  stability  sub- 
domain  within  the  respective  domain  can  be 
determined  [1],  [2],  [5],  [14]. 

It  is  assumed  that  a  model  of  non-linear  system 
of  type  (1)  and  fulfilling  the  above  quoted 
assumptions  is  available.  Therefore  a  mechatronic 
system  of  the  class  (1)  can  be  assumed  to  be  locally 
asymptotically  stable  in  the  vicinity  of  the 
equilibrium,  and  that  the  eigenvalues  of  the  Jacobean 
matrix  at  the  equilibrium  vector  are  in  the  left-hand 
part  of  the  complex  plane.  Then  LaSalle-Lefschetz 
theorem,  which  is  given  below,  may  be  applied 

Theorem  [11] 

The  equilibrium  point  x  =  0  of  system  (1)  is 
asymptotically  stable,  if  there  exist  a  scalar  function 
V(x),  a  Lyapunov  Function,  having  continuous  first- 
partial  derivatives,  and  a  domain  £2  containing  the 
equilibrium  point,  such  that: 

(1)  r(x)>0,  Vx  e  Q  ,  x^O 

(2)  F(0)  =  0 

(3)  F(x)  =  f  '(x) grad F(x)  <  0,  Vx  e  Q  ,  x^O 

(4)  F( 0)  =  0 

and  if  C  e  positive  constant  such  that  the  surface 
V(x)=C  is  contained  in  Q,  then  the  domain  which  is 
defined  by  V(x)-  C  is  in  the  domain  of  asymptotic 
stability  for  the  system  (1). 

This  theoretical  result  clearly  suggests  the 
conclusion  that  for  a  particular  Lyapunov  function 
V(x),  the  surface  with  the  largest  value,  say  V(x)  - 
Cmax,  which  satisfies  LaSalle-Lefschetz  Theorem, 
gives  the  boundary  of  the  largest  asymptotic  stability 
domain  for  the  system.  It  is  desirable  therefore  to  find 
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a  particular  Lyapunov  Function  V*(x)  such  that 
the  hypervolume,  contained  in  Cl  and  defined  by 
Vw  =  C*iax ,  is  a  maximum. 

The  candidate  for  an  appropriate  Lyapunov 
function  to  system  class  (1),  which  fullfils  conditions 
(1),  (2)  arid  (4)  obviously,  is  the  quadratic  one: 

V(x)=x  Ax,  (2) 

where  A  =  A ,  and  A  is  a  positive  definite  matrix 

(usually  denoted  by  A  >0).  Then  it  remains  to  ensure 
that  condition  (3)  of  the  theorem  above  is  fulfilled.  It 
is  readily  obtained  from  (1)  and  (2)  that 

V(x)  =  2  f’(x)Ax.  (3) 

For  this  purpose,  it  is  desired  to  determine  matrix  A 
and  positive  constant  C,  so  that  the  condition  (3)  in 
the  theorem  holds  in  a  way  that  the  volume  contained 
in  the  closed  surface 

x'Ax  =  C  (4) 

in  the  state  space  of  the  system  is  maximum,  that  is 
“best”  approximation  of  the  asymptotic  stability 
region  for  the  system.  It  may  be  assumed  that  the 
matrix  in  Eq.  (4)  is  scaled  by  constant  C  to  give 
matrix  A ,  and  therefore  the  problem  reduces  to  find  a 
positive  definite  matrix  A  so  that  the  condition  (3) 
holds  and  so  that  the  volume  is  contained  in  the 
closed  surface 

x’Ax  =  1,  (6) 

and  given  by  equation 


and 

(2)  f'(x)Ax<  0,  VxeQ, 

<  ,  (9) 

where  Cl  =  \x\x  Ax  <1 ,  x^Oj 

That  is,  a  minimisation  type  of  organised  search  for  a 
positive  definite  matrix  A  which  will  satisfy  condition 
(3)  of  LaSalle-Lefschetz  theorem  while  maximising 
the  volume  of  the  asymptotic  stability  sub-domain  of 
the  system. 

In  general,  there  may  exist  many  local  minima  to 
the  above  posed  minimisation.  Therefore  it  is  very 
important  that  the  starting  values  for  the  unknown 
parameters  be  chosen  as  close  as  possible  to  those 
values  which  correspond  to  the  global  minimum  of 

n 

product  n(M4}-  It  is  this  feature  that  gives  rise 

»=i 

to  the  essential  importance  of  using  Rosenbrock’s 
method  [14],  and  to  the  importance  of  proper  choice 
of  initial  value  Ao  for  the  matrix  sought  [5].  For 
proofs  and  more  detail,  see  the  original  works  [3],  [5], 

[14]. 

III.  A  SUMMARY  OF  THE  IMPLEMENTED 
ALGORITHMIC  PROCEDURE 

As  a  rule,  the  whole  constructive,  computational 
analysis  of  Davison's  method  is  performed  via  double¬ 
precision  arithmetics.  In  addition,  a  fine  grid 
constructed  using  of  rotating  co-ordinates  [14],  [5]  is 
needed. 

Step  L  Once,  an  autonomous  non-linear  model  of 
the  system  investigated,  which  satisfies  the  basic 
requirements  of  LaSalle-Lefshetz  theorem,  is 
available  in  terms  of  vector  state  equation 


volume  oc 


(?) 


*  =  /(*) 

represented  by  means  of  its  state  variables  and  thier 
time  derivatives: 


is  the  maximum  one.  Here, 

^i(A)  ,  i  —  1,2, . represent  the  eigenvalues 

of  matrix  A,  and  the  maximum  sought  is  obtained  via 
the  minimum  of  the  product  of  all  eigenvalues. 


x  =  <xl,x2,x3? 


and  for  which  it  is  fulfilled  f(0)=0  and  its  Jacobean 
matrix  at  x=0  is  stable,  then,  provided  the  condition 


The  equivalent  parameter  minimisation  problem 
may  be  defined  in  the  following  way:  Find  A  =  A’ 

n 

so  as  to  minimise  product  subject  to  the 

i=l 

constraints  given  below: 

(1)  min  X.,(^)>0  (8) 

7=1,2 . n 


Rerfe)} 

^  J  x=0 

is  also  satisfied,  the  constructive  computations  for  the 
“best”  appropriate  Lyapunov  Function  may  begin. 

The  respective  Lyapunov  function  for  the  system 
investigated  is  sought  within  the  class  represented  by 
the  well-known  matrix-vector  formula 
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V(x)=  x'A0x. 

Note  that  here,  vectors  x  and  x’  may  be  viewed  as 
generating  the  uniformly  dense  sets  of  respective  n- 
dimensional  space.  Also,  at  the  beginning  they  may  be 
viewed  as  giving  the  intersecting  points  of  the  unit 
vectors  within  the  sub-domain  of  asymptotic  stability. 
The  positive  definite  matrix  A0  is  obtained  by  solving 
the  Lyapunov  equation 


J*  Aq  +  A0J  =  — C 


where  matrix  J  is  the  system  Jacobean  evaluated  at  the 
equilibrium  point  x=0: 


J  = 


r 

dx 


( x ) 


j*=0 


(All  derivatives  are  obtained  via  Newton  interpolation 
formula).  Matrix  C  is  an  arbitrary  positive-definite 
matrix  which  may  be  taken  to  be  the  unity  matrix  /. 
LaSalle  and  Lefschetz  have  shown  that,  under  the 
conditions  stated,  the  above  Lyapunov  equation  has 
got  a  unique  positive-  definite  solution.  Therefore  the 
equation 


where  s0  is  an  arbitrary  small  positive  constant 
(usually  taken  to  be  e0  =  0“5  or  smaller  for 

satisfactory  results)  leads  to  obtaining  an  initial 
asymptotic  stability  boundary  for  the  non-linear 
system.  It  should  be  noted  that  the  choice  of  the  initial 
value  for  £0  is  rather  important.  This  initial  region, 
determined  in  the  vicinity  of  the  equilibrium  point, 
represents  a  starting  ground  for  further  steps  2,  3,  and 
4  for  determining  the  maximum  volume  of  the  region 
surrounding  the  equilibrium  within  which  the 
asymptotic  stability  of  the  system  is  guaranteed. 

Step  2.  Further  enlargement  of  the  initial  sub- 
domain  of  asymptotic  stability  is  achieved  by  means  of 
dimensional  search  for  maximising  the  value  of  the 
positive  constant  8  subject  to  the  constraints 


max  f' 

>1.2 . N„  J 


yj 


^yJ(Aa  lz)yt 


A<>  yj  < 


ij 


beginning  with  the  initial  starting  value  8  =£0.  This 
eventually  results  in  obtaining  the  value  £  as  a 
solution,  and  then  the  boundary  of  asymptotic  stability 
sub-domain  is  given  by  the  following  equation 


x'A0x  = 


£ 


max 


This  encompasses  the  whole  region,  enlarged  this 
way,  for  which  it  is  guaranteed  that  the  original  non¬ 
linear  system  has  got  stable  solutions  for  any  initial 
state  vector.  Further  computations  evolve  around 
Davison's  fundamental  idea  of  maximum  expansion 
of  the  volume  of  guaranteed  asymptotic  stability  sub- 
domain  via  approaching  the  real  asymptotic  stability 
boundary  from  the  inside  of  the  region,  which  is 
expressed  by  means  of  mathematical  formulation 
given  in  term  of  representation  (7)-(9). 

Step  3 Further  enlargement  of  the  asymptotic 
stability  region,  established  in  the  previous  step, 
proceeds  by  using  the  positive  constant  smax  in  order 
to  determine  the  new  (temporary)  initial  matrix  value 

A  which,  in  turn,  is  used  in  the  computational 
analysis  within  Step  4  to  obtain  matrix  A  that  defines 
the  surface  of  the  maximum  attainable  rapprochement 
to  the  real  asymptotic  stability  boundary.  This 
constructive  computational  analysis  is  a  kind  of 

optimisation  procedure  for  the  elements  of  matrix  A , 

taking  as  initial  starting  value  for  A  the  re-evaluated 
(A0  /  £max  ) ,  proceeding  further  via  the  minimising 

the  respective  eigenvalues  of  A ;  the  eigenvalues  are 
computed  via  well-known  either  QR  or  SVD  method 
[13].  This  minimisation  is  performed  by 
implementing  Rosenbrock's  method  of  hill-climbing 
for  the  searching  procedure  needed  as  follows: 

A 

where  A,max(/1)  is  the  largest  eigenvalue  of  matrix  A 
while  satisfying  the  inequality  constraints 

(1)  min  X;(^)>0 

/=1,2 . n 


f  \ 

' 

max  ' 

i=  1.2 . iV„ 

A=1.2 . p 

/• 

k  yj 

This  procedure  results  in  determining  matrix  A  by 
means  of  which  the  boundary  of  asymptotic  stability 
sub-domain 

_  * 

x'  A  x  =  € 

is  evaluated. 

It  should  be  noted  that  this  step  is  rather 
important  in  Davison's  method.  For  it  gives  the  initial 
starting  value  which  is  needed  in  Step  4  which 
computes  the  minimisation  of  the  product  of  the 
eigenvelues  of  A,  defining  the  final  hypersurface  of 
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asymptotic  stability  sub-domain  for  the  non-linear 
dynamic  system  investigated. 

Remark  By  minimising  the  maximum  eigen-value  of 

matrix  A,  which  is  inversely  proportional  to  the 
square  of  the  length  of  the  smaller  minor  axis  of  the 
n-dimensional  quadratic  surface,  the  quadratic  surface 
is  forced  to  become  less  eccentric  [5].  Our 
experiments  have  confirmed  Davison-Kurak 
statement  that  if  this  step  is  omitted  in  the 
computational  analysis  as  a  whole  and  the  product  of 
the  eigenvalues  of  A  is  used  as  the  cost  function,  then 
the  minimum  eigenvalue  of  the  initially  chosen  matrix 

A  decreases  rapidly  with  a  little  change  in  the 
remaining  eigenvalues.  This  corresponds  to  local 

n 

minimum  of  the  function  and  ,  as  a 

1=1 

consequence,  the  n-dimensional  quadratic  surface 
defines  a  typical  region  with  a  large  eccentricity  and 
small  volume. 

Step  4 '  Parameter  minimisation  by  using 
Rosenbrock’s  method  [14]  within  this  step  performs  a 

kind  of  optimisation  of  the  elements  of  the  initial 

_  * 

starting  matrix  A  by  means  of  the  minimisation 
procedure  defined  as  follows: 

mm  fl{M/f)} 

7=1 

while  satisfying  the  constraints 


(1)  min  X,(A)>  0 

1=1.2 _ J? 


f  \ 

' 

(2)  max  ‘ 

'  i=1  •)  u 

/• 

k  }>j 

1 — 

Ay,  • 

k= 1.2 . ,p 

l 

to  obtain  x'Ax  =  1 ,  where  matrix  A  is  the  resulting 
matrix,  representing  the  final  asymptotic  stability 
boundary  for  the  non-linear  system  investigated. 

Step  5.  T  It  should  be  noted,  provided  LaSalle- 
Lefschetz  theorem  is  fulfilled,  then  the  boundary 
surface  that  is  evaluated  at  the  end  of  Step  4  is 
guaranteed  to  be  contained  within  the  exact,  real 
boundary  of  asymptotic  stability  for  this  large  class  of 
non-linear  dynamic  systems  considered  in  here.  It  is 

necessary  therefore  to  test  the  negativity  of  the  first 

• 

time  derivative  of  Lyapunov  function  which  is 

constructed  in  previous  computation,  at  all  points 
within  the  stability  domain 

x  e  {x| x'  Ax  <  1,  x  *  o} 


by  using  an  extremely  fine  grid  of  p  points  (number 
of  grid  points  between  the  origin  and  a  given  point  at 
the  quadratic  surface)  and  N  unity  vectors  within 
Euclidean  state  space  of  the  systems.  Due  to 
limitations,  this  detail  is  omitted,  but  it  is  pointed  out 
the  grid  is  formed  by  storing  pre-determined  unit 
vectors  such  that  they  uniformly  span  the  n- 
dimensional  space  following  to  the  concept  of  rotating 
unit  vectors  and  evaluating  their  number  needed  via 

Gregory-Newton  formula  [14].  If  a  non-negative 

• 

value  V(x)  occurs  at  some  point(s),  then  N  and  p 
are  increased  and  steps  2,  3,  4,  and  5  repeated. 

IV.  BASIC  DATA  ON  THE  SOFTWARE 
IMPLEMENTATION 

The  program  package,  implementing  this  imp¬ 
roved  Davison’s  method,  has  been  developed  as  a 
DOS- WINDOWS  application  on  a  PC  486  platform 
using  Borland  C++  and  the  philosophy  of  object- 
oriented  programming.  It  is  constituted  by  a  two 
subprograms  VOVED.CPP  and  REALIZ.CPP,  the 
former  one  via  VOVED.EXE  and  using  class  ‘edit’ 
creates  datafiles  FUNC.CPP  and  FUNCFP.CPP  which 
are  then  used  by  the  latter,  after  being  called-in  and 
complied  by  compiler  bcc.exe. 

A  separate  subprogram  MATRIX.  CPP  and  a  class 
‘matrix’  are  performing  all  matrix  manipulations  and 
operations.  Subprogram  REALIZ.CPP  makes  use  of 
object-programming  functions  NAJDW.  CPP, 
NAJDIAo.CPP,  EM  AX.  CPP,  VECTOR.  CPP, 
MINLAM.CPP,  EIGENVAL.CPP,  and 
MINPRLAM.CPP  for  computational  procedures 
within  Davison’s  method. 

Program  function  L_GRAFIK.CPP  enables 
graphical  presentation  of  projections  of  the  asymptotic 
stability  volume  on  co-ordinate  planes  of  the  state 
space,  while  datafile  REPORT.DAT  enables  printouts. 
For  systems  up  to  10th  order,  a  perfect  man-machine 
co-operative  symbiosis  is  achieved;  thereafter  some 
noticeable  computational  delay  is  taking  place 
following  the  increase  of  the  order  of  system 
dynamics. 


V.  CONCLUDING  REMARKS 

The  essential  requirement  is  that  the  right-hand 
side  vector  function  of  the  model  be  continuous  and 
sufficiently  structured  so  as  to  guarantee  uniqueness 
of  solution  to  the  model  equation.  If  not,  as  in  the  case 
of  Davison’s  example  [5]  on  second-order  bang-bang 
servomechanisms,  a  most  adequate  approximation 
must  be  sought  beforehand.  All  other  work  is 
performed  by  this  constructive  computational  method. 
The  use  may  be  extended  to  complex  systems  [9] . 
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Abstract. 

This  paper  proposes  a  cross-coupling  controller  for  a  two 
degree s-of -freedom  platform.  The  cross-coupling  controller  is 
fed  with  the  contour  error  along  the  trajectory ,  and  in  turn 
produces  correcting  signals,  which  fed  as  additional  input  for 
each  axis,  so  that  the  magnitude  of  the  error  is  reduced.  The 
relationship  between  the  cross -coupling  controller  gains,  the 
trajectory  and  the  P1D  controller  gains  of  each  axis  is 
determined  by  pole  placement.  Simulation  results  show  that 
the  cross-coupling  controller  is  capable  of  reducing  the 
contour  error  by  30%. 

I.  Introduction 

Recently,  several  prototypes  of  CNC  machine 
tools,  based  on  six  Degrees-of-Freedom  (DOF)  parallel 
manipulators  such  as  Stewart  platform  [1-4],  have  been 
proposed.  This  kind  of  machine  tools  posses  the 
advantages  of  high  force/torque  capacity,  high  structural 
rigidity  and  better  accuracy  [5-9]. 

In  order  to  ensure  the  high  accuracy  of  the  CNC 
machine  tools,  appropriate  controllers  or  control 
schemes  are  needed.  Several  methods  have  been 
proposed  and  many  advances  have  been  accomplished 
for  conventional  CNC  machine  tools  [10].  One  of  these 
methods  is  the  cross-coupling  controller,  first  proposed 
in  [li] 

The  idea  behind  the  cross-coupling  controller  is 
that  as  the  system  tracks  a  given  contour,  the  contour 
error  can  be  determined  and  then  used  to  produce  a 
compensation  signals  to  the  system’s  axes  so  that  the 
error  will  be  reduced  or  eliminated.  In  order  to  perform 
this  task,  a  real-time  model  of  the  contour  error  as  well 
as  an  effective  compensating  control  law  should  be 
employed. 

Since  the  mathematical  treatment  of  a  six  DOF 
platform  is  rather  very  complex  [5-7],  the  above 
methodology  will  be  demonstrated  on  a  two  DOF 
platform.  A  two  DOF  platform  differs  from  the 
conventional  six  DOF  platform  in  its  upper  plate,  which 
is  reduced  to  a  mass  point  located  at  the  revolute  joint 
that  connects  its  two  legs.  Although  some  characteristics 


of  conventional  platforms  may  not  be  revealed  using 
this  two  DOF  platform,  the  results  of  this 
investigation  are  still  applicable. 

2.  Two  DOF  Platform  Description, 
Kinematics  and  Control 

The  structure  of  a  2-DOF  platform,  shown  in  Figure 
1,  is  a  triangle  where  one  side,  called  base,  has  a  fixed 
length  and  attached  to  ground.  The  other  two  sides  are 
actuated  by  hydraulic  cylinders  and  are  called  legs. 
The  legs  are  connected  to  ground  and  to  each  other  by 
revolute  joints.  The  platform  is  vertical  and  as  such 
subjected  to  gravity. 


Figure  1.  2-DOF  platform  Architecture 

The  base  of  the  2-DOF  platform  is  aligned 
with  the  X  axis,  while  the  actuation  legs  are  in  the 
XY  plane.  The  origin  of  the  coordinate  system  is 
chosen  at  the  midpoint  of  the  base.  The  legs  Lx  and 
L2  move  P(xyy)  in  2-DOF  space. 

The  inverse  kinematics  is  given  by: 


V 

sj(b-x)2  +  y2 

UJ 

fb  +  xf  +  y2 
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The  forward  kinematics  is  given  by: 


l\  -  L\ 


4  b 

V(4 -(46*  +  Z,?  -Z,22)2 
4  b 


(2) 


The  differential  relation  between  leg  length  and  the 
position  P(x,y)  is  given  by 


'dLt' 

x  -  b 

j-. 

1 - 

dx 

_< dLt 

x  +  b 

y 

dy_ 

Li 

l2 

Denote  the  following  position  vectors:  Jr  a 
position  vector  of  a  point  on  die  planned  contour;  P 
a  position  vector  of  the  corresponding  point  on  actual 

contour;  and  P*  a  position  vector  of  a  point  on  the 
planned  contour  which  is  closest  to  P . 

The  following  error  vector,  E ,  is  given  by: 

E-P*-P  (5) 

The  contour  error  vector,  Er ,  is  given  by: 

e,=p;-p=e-(p'-p;)  (6) 


Each  axis  is  implemented  by  a  hydraulic  cylinder 
and  controlled  by  a  PID  controller  the  same  as  described 
in  [9J.  ITAE  (the  Integral  of  Time  multiplied  by 
Absolute  Error)  performance  index  was  selected  for  die 
design  of  the  PID  controller.  The  transfer  function  of 
each  leg  is  in  die  form: 


M-v) 


w)- 


Ft(s)  i=  1.2  (4) 


W(s)  '  W(s) 
where:  A*  The  reference  position  of  each  leg 

A.  The  leg  displacement  of  each  leg, 


A]  The  external  disturbance  to  each  leg, 

W’s  Polynomials  in  s 


fS  ^  A 

Let  V  and  V  be  the  unit  vectors  of 
velocities  associated  with  P  *  and  P  respectively, 
and  V  is  the  unit  vector  of  velocity  associated  with 
die  chord  P*  —P* .  Since  die  contour  error,  Er ,  is 
normal  to  the  planned  contour,  the  direction  of  the 
chord  P*  —  P*  and  die  tangent  of  the  planned 
contour  at  A,*  are  almost  the  same  for  a  small 
sampling  segment.  Thus,  the  contour  error,  Er ,  is 

almost  nonnal  to  P*  -  P* ,  and  therefore  P*  -  P* 
can  be  approximated  by  the  projection  of  the 

following  error,  E ,  along  die  unit  vector  V  : 


3.  Contour  Error  Model 

The  contour  error  is  the  deviation  between  the 
required  and  the  actual  contour  produced  during  die 
machining  process  |9-1 1  ].  The  contour  error  for  an 
arbitrary  curve  is  illustrated  in  Figure  2. 


Fig.  2.  Contour  error  for  an  arbitrary  curve 


P*  -P*  *(E-V)V  (7) 

Then,  Hq.  (II)  can  approximated  as  follows. 

Er*E-{E-V)V  (8) 

Since  displacement  during  one  sampling  time 
of  the  controller  is  very  small,  and  the  difference 

A  ^  -**  fS. 

between  V  and  V  is  very  small  too.  Also  V  is  not 
available  in  real  time,  therefore  by  replacing  V  by 
V  ,  Eq.  (8)  becomes 

Er  *E-{E-V')V'  (9) 

4,  Cross-Coupling  Controller  Design 

From  kinematics  viewpoint,  die  contour  error 
of  a  platform  must  be  reflected  as  die  deviation  of  all 
the  axes  from  their  desired  displacements.  As 

discussed  above,  the  Jacobean  matrix,  J p ,  provides 

die  relationship  between  die  motions  in  the  world  and 
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the  joint  coordinate  systems.  Thus  the  cross-coupling 
control  law  can  be  expressed  as: 


s  =  KcJpEr 


6  =  [f*,  £2]t  The  cross-coupling  control  signal 


Kc  Kc 

Kc  =  '  11  12  The  cross-coupling  controller 

a  _  a  _ 

c2i  cn  _ 


The  block  diagram  of  the  cross-coupling 
controller  for  a  2-DOF  platform  is  shown  in  Figure  3. 


E=I\x,y)-P\x\y) 

where 


V  X 

c,y)  = 

y 


l\  -  l\  Vr 

4  b  4  b 


*  ~\  r+2  *2  /—,♦ 
r' (*',/)=  .  =  — ^ 

y  4b  4b 


T  =  (4bLi)2  - (4b2  +L]-L22y 

r'  =  (4bL’>y  -  (4b2 + l1  -  L’ly 


Fig.  3.  Architecture  of  2-D  cross-coupling  controller 

To  determine  a  relationship  between  the  PID 
gains  and  the  cross-coupling  controller  gains,  it  is 
assumed  that  the  external  force  and  internal  dynamic 
force  are  negligible  (omitting  the  second  term  in  Eq.  4): 

W,(s)T.,.  W,(s) 

WsWml‘(s)'wUs,(s>  12  <m 

where  W3(s)  is  a  polynomial  in  s. 

At  tliis  point,  there  is  a  need  to  express  G]  and 

e2  and  their  derivatives,  depends  on  tlie  order  of  W(s), 
in  tenn  of  leg  variables  in  order  to  achieve  an  explicit 
transfer  function  between  tlie  reference  leg  length  and 
tlie  system  output  (actual  leg  length).  Substitute  Eq.  9  in 
Eq.  1 1  yields: 


KcJp(E-(EV')V')  (12) 


Tlie  expression  for  s  and  its  derivatives  in  term  of  leg 
variables  Li  (/  =  1,  2)  is  obtained  by  solving  ioxE  as 
shown  in  Figure  2. 


^=r/M 


y  J  i;(462  -T I  -t-Z.%)  L\(AE  +l1  -El)  L\ 

2iV F7  lb VP  J 


The  following  error,  E ,  is  linearized  and  high 
order  terms  are  neglected.  The  contour  error  is 
expressed  as: 


=  =Cp(L-L*) 

L£2] 


where  Cp  is  a  matrix  of  terms  which  are  function  of 
tlie  pose  and  the  velocity  of  the  platform  as  well  as  Kc 
.  Cp  is  selected  by  choosing  the  poles  and  zeros  of  the 
system  in  order  to  insure  a  certain  performance. 

5.  Simulation  Results 

Two  trajectories  were  used  for  the  simulation: 
a  5”  square  and  a  circle  with  10”  diameter. 

The  cross-coupling  controller  gain,  Kc ,  varies 

along  tlie  trajectory,  but  in  these  simulations  an 
average  constant  value  is  used: 


4.249  -0.415 

-0.782  6.068 


The  mismatched  system  is  assumed  to  have  deviation 
of  ±2%  in  gain  and  ±%5  in  natural  frequency. 
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(a) 


(b) 

Fig.  4.  The  contour  error  for  mismatched  system  (a 
-  Square,  b  -  Circle) 

Figure  4  illustrate  the  effect  of  the  cross-coupling 
controller  on  the  contour  errors.  As  shown,  the  contour 
error  along  the  circular  trajectory  was  reduced  by  about 
30%,  and  by  50%  along  the  square  trajectory. 

6.  CONCLUSION 

The  cross-coupling  controller  was  designed  by 
giving  the  contour  error  model  and  cross-coupling 
control  law.  Since  the  cross-coupling  controller  is 
related  to  the  real-time  contour  error  feedback,  the 
cross-coupling  controller  gain  is  closely  related  to  the 
position  and  velocity  of  the  platform  along  the 
trajectoiy.  The  average  values  should  be  chosen  for 
constant  controller  gain. 

The  simulation  results  illustrate  hat  the  cross-coupling 
control  system  has  the  ability  to  substantially  reduce  the 


contour  errors.  Thus,  the  application  of  this  method  to 
a  six  DOF  platform  is  very  promising. 
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Abstract 

This  paper  presents  the  implementation  of  an  intelligent 
position  controller  for  a  PMSM  drive  based  on  a  last 
generation  pC  DSP  (TMS320F24x)  suitable  for 
multiple  axes  motion  control  systems .  The  problem  of 
path  generation  with  variable  target  position  and 
movement  constraints  is  considered  and  solved  in  real¬ 
time  by  the  same  drive  controller .  Specific  profiles  for 
the  torque  and  speed  based  on  parametric  sinusoidal 
acceleration  are  used \  On  the  fly  changes  of  target 
position  as  well  as  of  other  parameters  of  the  movement 
are  allowed  and  real-time  calculation  and  tracking  of 
the  new  position  path  are  done.  Experimental  results 
showing  the  system  performance  for  different  movement 
characteristics  are  presented. 

1.  Introduction 

Motion  control  systems  exist  in  all  classes  of 
machines  especially  in  those  for  the  execution  of  co¬ 
ordinated  motion  in  two  or  more  axes,  such  as  industrial 
robot  manipulators,  packing  machines,  etc..  The  motion 
control  of  a  machine  or  a  system  consists  in  principle  of 
the  following  tasks:  optimised  speed  control  of  the  axis, 
position  control  in  every  axis,  path  generation  for  each 
axis  and  co-ordination  of  the  motion  among  the 
axes  [1]. 

With  regard  to  the  position  control,  different 
problems  arise  depending  on  the  kind  of  motion  to  be 
performed  (kind  of  target,  speed,  number  of  axes),  the 
required  dynamic  and  precision.  One  of  the  most 
difficult  situations  is  certainly  the  positioning  with 
variable  target  position  and  movement  constraints.  A 
typical  case  is  represented  by  a  packing  machine  which 
uses  a  thin  PVC  film  to  pack  objects  of  different  sizes. 
This  is  an  example  of  multiple  axes  motion  control 
system  where  the  speed  and  position  of  the  target  are 
variable  (depending  on  the  size  and  weight  of  the  object 
to  be  packed).  Smooth  speed  changes  are  needed  in 
order  to  limit  the  maximum  jerk  and  to  avoid  the 


breaking  of  the  film.  Thus  specific  profiles  for  the 
torque  and  speed  must  be  generated  in  real-time 
depending  on  the  characteristics  of  the  movement,  that 
is  target  position  and  movement  constraints  (limitations 
on  mechanical  speed  and  accelerations  reached  during 
the  movement).  Moreover  on  the  fly  changes  of  target 
position  as  well  as  other  parameters  of  the  movement 
should  be  allowed  and  real-time  calculation  and 
tracking  of  the  new  position  path  should  be  done. 

One  of  the  profiles  that  could  be  used  to  satisfy  the 
previously  stated  needs  is  the  one  which  uses  a 
parametric  sinusoidal  acceleration.  In  this  case  smooth 
speed  and  position  following  a  cycloidal  trajectory  are 
obtained  and  different  paths  can  be  generated  by 
varying  both  the  peak  value  and  the  period  of  the 
acceleration. 

Until  now  the  classical  motion  control  system 
structure  allocated  the  path  generation  task  in  an 
intelligent  master  controller  and  considered  the  drives  as 
actuators  being  able  to  control  the  position,  the  speed 
and  the  torque  in  the  corresponding  axis  [2]  [3].  This 
leads  to  unavoidable  problems  when  complex  paths  for 
multiple  axes  are  being  generated.  Particularly,  both  the 
demand  for  real-time  calculation  and  the  requirements 
for  high  communication  throughputs  of  the  master 
increase  dramatically.  Fast  and  accurate  motion  control 
cannot  be  achieved  without  the  recourse  to  improved, 
high  performance  control  systems. 

With  the  introduction  of  pC  DSPs  (Digital  Signal 
Processors)  dedicated  to  control  of  electrical  drives,  the 
task  of  path  generation  can  be  performed  by  the  same 
drive  controller  in  real-time.  Consequently  the  task  of 
co-ordination  of  the  different  axes  can  be  executed  by  a 
low-cost  general  purpose  pC. 

In  this  paper  the  implementation  of  an  intelligent 
position  controller  for  a  PMSM  based  on  a  last 
generation  pC  DSP  (TMS320F24x)  is  presented.  The 
problem  of  cycloidal  path  generation  with  variable 
target  position  and  movement  constraints  is  considered 
and  solved  in  real-time  by  the  same  drive  controller.  All 
the  parameters  of  the  drive  system  can  be  changed  in 
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real-time  through  a  standard  RS-232  serial  link. 
Experimental  results  showing  the  system  performance 
for  different  movement  characteristics  are  presented. 


2.  Path  generation  problem 


A  one  axis  generic  movement  could  be  decomposed 
in  an  acceleration  phase  (A),  a  constant  speed  phase  (C) 
and  a  deceleration  phase  (D)  (see  Figure  1). 

By  assuming  for  phases  A  and  D  a  parametric 
sinusoidal  acceleration,  it  is  possible  to  establish  the 
following  relations: 
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where  different  time  axes,  have  been  considered  for 
simplicity  in  each  phase.  In  the  previous  relations,  \ 


and  tsi  (i  =  A,  C,D)  are  a  set  of  parameters 
characterising  the  movement,  whose  significance  is 
indicated  in  the  list  of  used  symbols. 

Analitically  it  is  possible  to  relate  these  parameters 
to  the  basic  physical  characteristics  of  the  movement 
( amaxA  >  amaxD  and  vmax  )  as  follows: 
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Moreover  the  position  increments  that  is  possible  to 
obtain  in  each  phase  are  given  by: 


A9^  =hA  +2hA - - -  (4) 

vmax  ~v0 

A6C  =  hc  (5) 

A0d  -hD  (6) 

and  the  total  positions  increment  that  is  possible  to 
obtain  by  the  sequencing  of  the  phases  A,  C  and  D  is 

A0  =  +  A0C  +  A0d  -hA  +  2hA - — - +  +  hD  (7) 


Hence  a  possible  formulation  of  the  path  generation 
problem  is  to  find  out  the  0,Y*j,  Vi(t)  and  at(t) 
profiles  (in  term  of  the  parameters  ht ,  tsi )  in  order  to 
have  the  total  movement  A0  (refer  to  Figure  1  and  (7)) 
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equal  to  a  target  A0* .  Input  parameters  of  this  problem 
are:  the  initial  condition  of  speed  v0i  the  assumed 
constraints  on  the  acceleration  ( amaxA  )  and  deceleration 
( amaxD )  anc*  the  maximum  allowable  speed  (v^o) 

during  the  movement.  The  situation  is  resumed  in 
Figure  2. ' 


AO*  with  respect  to  the  position  increment  obtained  by 
using  only  phases  A  and  D  (tsC  =  0 )  and  the  maximum 
allowed  value  of  the  speed  (vmax  =  vmax0).  This 
increment  is  given  by 

A®  ad  =  =  hA  +  2  hA - - —  +  hD  (8) 

V max  ~~  ^0 


)M0 

jec(d 


We  think  it  right  to  point  out  that  amaxA  and  amaxD 

are  the  exact  values  of  the  maximum  accelerations 
during  phases  A  and  D  while  vmax0  is  an  upper  bound. 
Thus,  depending  on  the  particular  movement 
characteristic,  this  limit  should  be  reached  or  not.  In  this 
sense  vmax  is  considered  an  output  parameter  of  the 
problem. 

Moreover  depending  on  the  sign  of  the  initial  speed 
v0  and  on  target  position  A0* ,  four  kinds  of  movement 
are  possible,  as  it  is  summarised  in  Table  1 . 


Table  1 .  Possible  kind  of  movements. 


Positive  movement  without 
direction  inversion 
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Negative  movement  without 
direction  inversion 

> 

CD 

* 

A 

O 

o 

A 

o 

Positive  movement  with 
direction  inversion 

AG*  >0 
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Negative  movement  with 
direction  inversion 

AG*  <0 

o 

A 

o 

3.  A  solution  of  path  generation  problem 

The  general  solution  of  the  previously  stated  problem 
exist,  is  unique  and  will  be  given  in  the  case  of  a 
positive  movement  without  direction  inversion.  It  is 
easy  to  extend  the  obtained  results  to  the  other  three 
cases  indicated  in  Table  1. 

We  can  firstly  notice  that  the  solution  of  path 
generation  problem  depends  on  the  value  of  the  target 


If  the  target  A0*  is  equal  to  A0^D ,  then  the  problem  is 
simply  solved  by  using  the  values  of  ht  and  tsi 
(i  =  A,D)  given  by  (1)  and  (3). 

The  situation  is  more  complex  when  A0*  >  AQad  or 
A0  <A0^d. 

In  the  first  case  the  values  of  vmax0 ,  amaxA  and 
amaxD  are  not  high  enough  to  allow  the  required 

position  increment  to  be  generated  without  introducing 
a  constant  speed  phase.  Hence,  by  assuming 
vmax  ~vmax0’  position  increment  that  must  be 
introduced  during  this  phase  is 

he  -  A0  -  =  A0  -  hA  +  2 hA  - - f  hD  (9) 

its  application  time  being 
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lsC  ~ 

V  V  V 
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In  the  second  case  the  position  increment  that  is 
possible  to  obtain  by  using  only  phases  A  and  D  and  the 
maximum  speed  (vmax  =  vmax0 )  is  greater  than  the 
required  one.  Then  the  maximum  value  of  speed 
reached  during  the  movement  must  be  properly  reduced 
to  a  value  vmax  <vmax0  small  enough  to  satisfy 

A0*  =  A0^d  .  By  substituting  in  (8)  the  values  of  hA 
and  hD  given  by  (1)  and  (3)  and  imposing 
A0*  =  A0^d  ,  it  is  possible  to  obtain: 


/-rt^maxAamaxD'  ^  ^^maxD^O  /  i  i  \ 

Vmax  =  J— -  (J1) 

V  amaxA  amaxD 

It  could  be  noted  that  (1 1)  is  independent  on  the  sign 
of  v0  and  always  gives  a  positive  value  for  . 

The  last  consideration  allows  us  to  unify  the  solution 
of  path  generation  problem:  we  always  evaluate  vmax 
given  by  (11)  but  we  limit  this  value  to  vmax0  in  the 
case  vmax  >vmax0.  The  values  of  h{  and  tsi  are  given 
by  (1),  (3),  (9)  and  (10).  Figure  3  shows  the  block 


90 


diagram  of  the  algorithm  solving  the  path  generation  4#  Control  scheme 
problem  in  the  considered  case. 


:  1 

®  no  need  for  constant 
speed  phase  (C)  ^ 

vmax  ^  VjnaxO 

(2)  F  I  *sfl| 

1 7  constant  speed  phase  * 

(C)  is  needed 

Yes(2) 

in  case  (1)  ^ 

hc  —  0  t  —  0 
in  case  (2) 

hC  >  0  =>  'jC  >  0 


Calculate  hA,tsA,  hc<3>,  tjv  h[),  r 


Cend  Profile  calculation 

Figure  3.  Solution  of  path  generation  problem  in  case  of 
positive  movement  without  direction  inversion. 


The  PMSM  based  drive  controller  is  shown  in 
Figure  4.  It  is  based  on  the  field-oriented  control 
principle  arranged  in  the  dq  rotating  frame  aligned  with 
the  rotor  flux.  The  motor  currents  in  the  two-phase 
stator  reference  frame  aft  are  calculated  by  the 
measurement  of  two  of  the  actual  phase  currents  and  a 
three-phases  to  aft  transformation.  Hence,  the  dq 
components  are  obtained  using  the  rotor  position 
information.  The  mechanical  speed  and  the  quadrature 
ig  component  are  controlled  respectively  to  the 

reference  values  given  by  the  position  and  speed 
controllers,  whilst  the  direct  id  component  is  controlled 
at  zero  in  order  to  minimise  the  current  vs.  torque  ratio 
of  the  motor.  The  outputs  of  the  current  controllers, 
representing  the  voltage  references,  are  then  impressed 
to  the  motor  using  the  Space  Vector  Pulse  Width 
Modulation  technique  (SVPWM),  once  an  inverse 
transformation  from  the  rotating  to  the  fixed  stator 
reference  has  been  performed.  Position  path  feeding  the 
position  regulator  is  generated  starting  from  the  target 
position  and  taking  into  account  the  constraints  on 
mechanical  speed  and  accelerations.  Almost  all  the  used 
controllers  are  standard  PI  regulators  except  for  the 
position  controller  where  only  a  proportional  regulator 
has  been  considered  and  a  feed  forward  component  has 
been  added  to  increase  the  dynamic  response  of  the 
system. 

5.  Control  hardware 

The  control  hardware  used  to  implement  the 
intelligent  position  controller  is  based  on  a  Texas 
Instruments  TMS320F240  pC  Digital  Signal  Processor 
control  board  dedicated  to  control  of  electrical  drives 
which  also  integrates  an  IGBT  based  Intelligent  Power 
Module  (Figure  5). 
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7.  Experimental  results 
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Figure  5.  Control  hardware. 

A  host  PC  is  used  to  run  both  the  DSP  development 
and  debugger  tools  and  to  simulate  the  behaviour  of  a 
master  controller  in  a  motion  control  system.  A  scope 
has  been  used  to  display  in  real-time  the  variables 
calculated  inside  the  control  algorithm  by  means  of  a  4 
channels  digital  to  analog  interface  mapped  on  the  I/O 
addressing  space  of  the  pC  DSP. 

The  pC  is  provided  with  544  words  (16-bit)  of  data 
Dual  Access  RAM  (used  to  store  control  variables)  and 
1 6Kwords  of  program  Flash-EEPROM  (used  to  store 
control  program  and  look-up  tables)  thus  allowing  a 
single-chip  solution  for  motor  control. 

6.  Software  architecture 

The  control  software  has  been  designed  according  to 
the  multi-process  structure  reported  in  Figure  6. 


Figure  6.  Multi-process  software  organisation. 


The  control  process  is  executed  every  sampling 
period  (100  ps)  performing  both  motor  control  and  real¬ 
time  path  generation  based  on  the  parameters  and 
reference  values  received  through  the  concurrent  serial 
communication  process.  A  power  drive  fault  process  has 
also  been  included  to  protect  the  power  module.  All  the 
software  has  been  implemented  using  assembly 
language  and  fixed  precision  numerical  representation. 


Figure  7  to  Figure  10  present  some  of  the  results 
obtained  by  using  the  proposed  system.  The  base  values 
of  3000  rpm  for  speed  and  3000  rad/s2  for  accelerations 
are  assumed.  Figure  7  shows  speed  and  position  during 
a  movement  where  the  speed  limitation  vmax0  is 
reached,  that  is  phase  "C"  (constant  speed  phase)  is 
present.  Figure  8  shows  the  same  variables  during  the 
same  movement  where  the  speed  limitation  vmax0  has 
been  increased  to  a  value  high  enough  that  phase  "C"  is 
not  present.  Finally  an  on  the  fly  target  position  change 
without  movement  inversion  has  been  considered  in 
Figure  9  and  Figure  10.  Speed,  position,  quadrature 
component  and  phase  currents  are  shown. 


Figure  7.  Speed  (up)  and  position  (down)  (speed  limitation  is 
reached  and  constant  speed  phase  is  present  (A0*-0*- 
00  1471,  VmaxO  0-5,  QmaxA — 0.5,  3tmaxD — 0.35) 


Figure  8.  Speed  (up)  and  position  (down)  (speed  limitation  is 
not  reached  and  constant  speed  phase  is  not  present) 
(A0*=0*-0O=147C,  vmaxO=0.7,  amaxA~ 0.5,  ***0=0.35). 
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Figure  9.  Speed  (up)  and  position  (down)  during  an  on  the  fly 
target  position  change  (A0i*=0i*-0o=6tt,  A02*=02*- 
00=1471,  VmaiO—0.7,  3maxA=0.St  3maxD—0£). 


Figure  10.  q-axis  current  (up)  and  phase  current  (down) 
during  an  on  the  fly  target  position  change  ( A0  /=0 1 *- 
00—671,  A02  —02  “00  —  1 471,  VmaxO~~ 0*7,  3maxA~~ 0.5, 
<W0=O*6). 


7.  Conclusions 

In  this  paper  the  implementation  of  an  intelligent 
position  controller  for  a  PMSM  drive  based  on  a  last 
generation  pC  DSP  (TMS320F24x)  has  been  presented, 
suitable  for  multiple  axes  motion  control  systems.  The 
problem  of  path  generation  with  variable  target  position 
and  movement  constraints  is  considered  and  solved  in 
real-time  by  the  same  drive  controller.  Specific  profiles 
for  the  torque  and  speed  are  used  based  on  parametric 
sinusoidal  acceleration.  Experimental  results  showing 
the  system  performance  for  different  movement 
characteristics  are  presented. 
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List  of  used  symbols 

>  x.  (i  -  A, C,D)  \  value  of  jc  during  acceleration 
phase  “A”,  constant  speed  phase  “C”  and 
deceleration  phase  UD  ”  respectively. 

>  t ,  t' ,  t” :  time  references  within  phases  A ,  C  and 
D  respectively. 

>  Qf-j,  v,  f),  af-) :  respectively  position  (rad), 
speed  (rad/s)  and  acceleration  (rad/s2)  profiles 
within  phase  /. 

>  0O/ :  initial  position  within  phase  i . 

>  v0 :  initial  condition  for  speed. 

>  tsi :  time  duration  of  phase  i . 

>  hA :  position  increment  within  phase  A  but 
considering  v0  =  0 . 

>  hc,  hD:  position  increment  within  phases  C  and  D 
respectively. 

>  amaXj  •  maximum  value  of  acceleration  within 
phase  i . 

>  vmax  o :  speed  upper  bound. 

>  vmax :  maximum  speed  reached  within  the 
movement. 

>  0* :  target  position. 

>  A0* :  target  position  increment. 

>  A0 :  position  increment  obtained  within  a  generic 
movement. 

>  A0  AD :  position  increment  obtained  within  a 
movement  which  uses  only  phases  A  and  D. 
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Abstract 

An  objective  of  this  study  is  to  find  out  design 
requirements  for  developing  human  symbiotic  robots, 
which  share  working  space  with  human,  and  have  the 
ability  of  carrying  out  physical,  informational,  and 
psychological  interaction.  This  paper  mainly  describes 
design  strategies  of  the  human  symbiotic  robots,  through 
the  development  of  a  test  model  of  the  robots,  WENDY 
(Waseda  ENgineering  Designed  sYmbiont). 

In  order  to  develop  WENDY,  safety  and  dexterity  of  a 
humanoid  robot  Hadaly-2,  which  was  developed  in  1997, 
are  improved  on.  The  performances  of  WENDY  are 
evaluated  by  experiments  of  object  transport  and  egg 
breaking,  which  requires  high  revel  integration  of  whole 
body  system. 

1.  Introduction 

The  development  of  humanoid  robots  which  can 
support  human  labor  by  several  combined  communication 
channels,  such  as  physical  interaction  and  informational 
interaction,  are  expected  to  be  a  measure  against  labor 
shortages  in  aging  societies.  Such  robots  are  distinctively 
called  "human  symbiotic  robots",  for  they  must  have 
additional  abilities  of  sharing  behavioral  space,  working 
space,  and  thinking  space  with  human. 

In  designing  of  the  human  symbiotic  robots, 
capabilities  of  ensuring  safety  while  interacting  with  human 
must  be  given  top  priority.  Under  this  assumption,  the 
Humanoid  Project  at  the  Advanced  Research  Institute  for 
Science  and  Engineering  (RISE),  Waseda  University  has 
developed  Hadaly-1  [1]  and  Hadaly-2  [2]  as  prototypes  of 
the  human  symbiotic  robots. 

Among  these  robots,  Hadaly-1  was  developed  as  an 
informational  assistant  robot  in  1995.  Hadaly-1  can  realize 
informational  interaction  with  human  by  combining  with 
audio-visual  information,  voice  dialogue,  and  gesture 
motion  using  manipulator.  Hadaly-2  was  developed  in 
1997,  from  motivations  for  improving  physical  interaction 
ability  of  Hadaly-1.  The  Working  condition  of  Hadaly-2  is 
shown  in  Figure  1. 

In  designing  Hadaly-2,  an  anthropomorphic  shape  is 
adopted,  since  the  human  symbiotic  robots  must  work  in 
human's  living  space,  which  is  designed  to  suit  with  daily 
life.  Hadaly-2  can  carry  blocks  in  reference  to  the 


operator's  demands,  by  combing  audio-visual  information, 
voice  dialogue,  and  gesture  motion.  Design  requirements 
and  system  integration  strategy  were  clarified  through  the 
development  of  Hadaly-2. 


Figure  1  Working  Condition  of  Hadaly-2 


Based  on  the  successful  results,  this  paper  aims  to 
construct  a  new  human  symbiotic  robot  named  WENDY 
(Waseda  ENgineering  Designed  sYmbiont).  First,  this 
paper  describes  mechanism  design  strategies  for  improving 
dexterity  of  Hadaly-2.  Next,  the  design  strategies  are 
implemented  into  WENDY.  Finally,  performances  of 
WENDY  are  evaluated  by  experiments,  such  as  picking  up 
object  on  the  floor  and  breaking  eggs. 

2.  Design  Strategy  for  Human  Symbiotic  Robots 
2.1  Assurance  of  Safety  and  Dexterity 

Human-robot  interaction,  which  inevitably  occur  in 
human-robot  symbiotic  situation,  can  be  mainly  classified 
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into  physical  interaction  and  informational  interaction. 

Among  these  two  interaction,  physical  interaction  is 
particularly  important,  because  human-robot  symbiosis 
requires  that  robot  must  directly  contact  and  collaborate 
with  human.  For  realization  of  these  physical  interactive 
tasks,  the  important  technical  point  is  that  manipulator  can 
adapt  to  human’s  motion,  while  ensuring  human’s  safety. 

Based  on  this  consideration,  this  section  shows  the 
design  strategy  for  ensuring  safety  and  dexterity  in  a  human 
symbiotic  robot. 

(1)  Safety 

Many  studies  have  been  conducted  on  the  safety  of 
industrial  robots,  however,  the  working  conditions  of  the 
robots  whose  key  feature  the  symbiosis  with  humans  are 
different  from  those  of  industrial  robots,  precluding  the 
spatial  separation  of  robots  and  humans.  Consequently, 
taking  more  positive  safety  measures  with  the  robot  itself  is 
required.  As  an  approach  for  solving  the  problem,  this 
study  categorizes  the  safety  measures  of  human  symbiotic 
robot  into  two  phases,  such  as  impact  safety  and  after 
collision  safety. 

In  regards  to  impact  safety,  our  previous  study  has 
shown  the  effectiveness  of  a  double  safety  measure,  which 
employs  viscoelastic  surface  cover,  and  reflex  movement 
of  joints.  Concretely,  collision  situations  (subjective  and 
objective  collision)  and  control  modes  of  manipulators 
(joihnt  Impedance  control  and  emergency  stops)  are 
classified,  and  the  effect  which  those  parameters  exert  on 
shock  absorption  is  clarified  by  computer  simulation.  The 
simulation  results  are  shown  in  Figure  2. 

First,  attention  is  paid  to  the  effect  of  emergency  stops. 
In  subjective  collisions,  it  was  recognized  that  emergency 
stops  had  the  effect  of  reducing  both  the  peak  force  and  the 
duration  time  from  the  levels  of  collisions  without  control. 
However,  in  objective  collisions,  the  duration  time  rather 
increased,  in  contrast  to  what  happened  in  subjective 
collisions.  Compliance  control  did  not  have  any  peak  force 
reduction  effects  comparable  with  emergency  stop  in 
subjective  collisions,  but  it  had  effects  in  reducing  the 
duration  time  in  objective  collisions. 

From  these  remarks,  the  authors  think  that  the 
adoption  of  design  and  control  method  in  accordance  with 
functions  of  each  body  part  is  effective  for  improving  total 
performances  of  human  symbiotic  robots. 

In  addition  to  the  measurement  for  impact  safety, 
adaptive  motions  for  realizing  after  collision  safety,  by 
coordinating  whole  limb  systems  are  important  for  carrying 
out  cooperative  tasks  with  human.  In  the  case,  capabilities 
of  object  recognition  by  using  visual  and  tactile 
information  and  coordinated  motion  of  whole  body  are 
required. 

These  remarks  are  reflected  in  the  design  strategy  of  a 
head  and  a  body-vehicle  part  of  WENDY,  shown  in  the 


following  paragraph. 
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(b)  Objective  Collision  Case 
(A  human  unilaterally  collides  with  a  robot) 

Figure  2  Impact  Safety  Effect  by  Control  Modes 
(2)  Dexterity 

In  addition  to  safety  measures,  dexterity  is  important 
for  realizing  human  robot  symbiosis.  For  practical  use, 
human  symbiotic  robots  must  have  capabilities  of  handling 
daily  necessities.  Assistance  of  cooking,  for  example, 
requires  ability  of  handling  of  cookware  and  foods  by  using 
abilities  of  object  recognition  and  coordination  of  whole 
body  motion. 

From  these  remarks,  this  study  regards  that  dexterity  of 
human  symbiotic  robot  can  be  improved  by  realizing 
mobility  of  arm-hand  systems  and  body-vehicle  systems. 
Based  on  this  concept,  Hadaly-2  has  carried  out  simple 
tasks  in  human's  living  space.  As  a  next  stage,  we  aim  to 
advance  dexterity  of  Hadaly-2  by  the  following  design 
strategies. 

For  designing  human  symbiotic  robot,  motion  space  in 
both  horizontal  and  vertical  plane  is  required,  because 
human’s  living  space  has  unevenness.  However,  most  of 
conventional  mobile  manipulators  that  contain  Hadaly-2 
are  designed  in  consideration  of  mobility  in  horizontal 
plane.  A  measure  on  this  problem  is  utilized  in  the 
mechanism  design  of  waist  and  vehicle. 

In  addition  to  mobility,  performance  of  arm-hand 
system  must  improve  in  order  to  perform  the  several  tasks 
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which  human  is  doing  in  the  daily  life.  In  the  case,  abilities 
of  object  recognition  and  coordinated  control  of  arm  and 
hand  becomse  more  important. 

In  reference  to  the  design  concepts,  limb  mechanisms 
of  Hadaly-2  are  improved  focusing  on  the  integrity  of  the 
whole  body  system.  The  following  chapters  describe 
concrete  design  strategies. 

2.2  Mechanisms  of  Head  and  Neck 

A  head  subsystem  of  human  symbiotic  robots  should 
act  as  not  only  visual  information  sensor,  but  also  interface 
for  collaborative  tasks  with  human.  For  example,  it  was 
confirmed  that  operators  infer  the  working  condition  of 
Hadaly-2  from  the  motions  of  its  eyes.  So,  this  study 
improves  resolution  of  visual  information  processing,  by 
developing  a  new  head  mechanism  that  employs  two  color 
CCD  cameras  for  detecting  objects  and  humans.  An 
assembly  drawing  of  the  seven  D.O.F.  head  mechanisms  is 
shown  in  Figure  3. 
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Figure  3  Head  Mechanisms 


The  head  mechanism  consists  of  a  three  D.O.F.  eye 
mechanism  and  a  four  D.O.F.  neck  mechanism.  In  regard 
to  the  eye  part,  locations  of  the  CCD  cameras  are  important, 
because  the  distance  between  two  eyes  affects  both  image 
processing  and  human’s  impression.  Two  cameras  are 
separated  as  possible,  under  the  conditions  that  the  vision 
system  can  catch  objects  which  are  located  at  10  [cm]  from 
the  face.  And,  in  order  to  observe  objects  which  are  located 
on  the  floor,  the  neck  mechanism  employs  two  degrees  of 
freedom  in  the  direction  of  flexion  and  extension. 

2:3  Mechanisms  of  Waist 

Waist  mechanisms  are  utilized  for  expanding  the 
working  spaces  in  vertical  plane.  Moreover,  evacuating 
motions  using  redundancy  of  whole  body  are  effective  for 
suppressing  the  impact  forces,  which  generate  in  the  case 
of  collision  between  humans  and  robots.  In  order  to  realize 
the  motions  stable,  a  waist  mechanism  that  employs  three 
degrees  of  freedom  is  developed. 

The  waist  mechanisms  consist  of  one  rotation  axis 
(joint  #1)  and  two  pitching  axes  (joint  #2  and  #3).  By 


rotating  joint  #2  to  the  opposite  direction  of  joint  #3,  the 
waist  can  bend  without  stumbling  over.  Consequently,  the 
end  point  of  the  WAM-10  manipulator  [2]  that  is  attached 
to  the  body  mechanism  can  reach  to  the  point  on  the  floor, 
at  a  distance  of  260[mm]  from  the  rotation  axis  of  the  body. 
An  overview  of  the  developed  waist  mechanism  is  shown 
in  Figure  4. 


Joint  #1 


Figure  4  Waist  Mechanisms 

2.4  Mechanisms  of  Vehicle 

Mobility  in  horizontal  plane  is  necessary  for  human 
symbiotic  robots  to  extend  working  space.  In  that  case, 
however,  stability  of  the  mobile  mechanisms  is  the  most 
essential,  because  human-robot  collaboration  requires 
insurance  of  operator’s  safety.  From  these  remarks,  this 
study  adopted  wheeled  vehicle  mechanisms  among  many 
forms  of  the  mobile  mechanisms. 

Gyroscope  \ . r  _ I 
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Figure  5  Vehicle  Mechanisms 
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Figure  5  shows  the  assembly  drawings  of  the 
developed  two-wheeled  vehicle  mechanisms.  In  designing 
the  mechanisms,  the  eccentric  casters  with  suspensions  are 
located  at  each  vertex  of  rectangle,  since  vehicle  must 
climb  over  bumps  without  stumbled  over. 

The  developed  vehicle  has  enough  power  for  dragging 
the  cables  which  connects  the  internal  actuators  to  the 
external  controllers,  by  utilizing  combination  of  high 
power  motor  (300[W],  AC)  and  harmonic  drive  gear 
(reduction  ratio  1/33).  Specification  of  the  vehicle  is  as 
follows;  maximum  velocity,  5 [km/h];  maximum  payload, 
120[kg];  climbable  height,  l[cm];  weight,  53[kg]; 
maneuvering  error,  1  [%]. 

2.5  Mechanisms  of  Arm  and  Hand 

Manipulators  of  the  human  symbiotic  robots  are  utilized 
as  interface  between  humans  and  robots,  while  carrying 
cooperative  tasks.  So,  safety  and  adaptive  manipulation  is 
essential  for  constructing  human  symbiotic  robots.  In 
consideration  of  these  problems,  the  authors  have  developed 
the  WAM-10  (Waseda  Automatic  Manipulator  #10),  and 
design  requirements  for  carrying  out  several  tasks  which 
involves  physical  interactions  between  robots  and  humans 
were  clarified  [3].  In  this  study,  the  hand  mechanisms  of 
the  WAM-10  are  improved  by  focusing  on  dexterity  of 
pinching  motion. 


Finger  "Tip  Friction 


Figure  6  Adjustment  of  Friction  by  Pressure  Control 

Human  changes  the  poses  of  fingertips  with  adapting 
to  the  shapes  of  objects.  This  motion  can  be  regarded  as 
control  of  friction  by  adjusting  pressure,  which  is  shown  in 
figure  4.  Coefficient  of  adhesive  friction  //  is  changeable 
by  adjusting  areas  of  contact,  in  reference  to  following 
equation  [4]. 

'  (l) 

W  W  p 

6  =  tan-1  n  (2) 


where,  A  :  contact  Area,  W :  contact  force,  s  ,  sq  : 
shearing  stress,  p\  pressure,  a,/3:  constants,  and  0\ 
acute  angle  of  friction  cone. 

From  these  remarks,  this  study  focuses  on  the  design 
parameters  of  fingertips,  such  as  shape,  structure,  material, 
and  size.  For  continuously  adjustment  ability  of  contact 
conditions  from  point  to  area,  curved  shapes  of  the 
fingertip  are  designed  by  using  rubber  and  bone.  A  nail  is 
also  attached  to  the  fingertip,  in  order  to  realize  extremely 
high  pressure  on  the  contact  point. 

3.  Construction  of  WENDY 

3.1  Mechanical  Hardware  and  Functions 

By  combining  the  head,  the  waist,  the  vehicle  and  the 
arm-hand  system,  a  limb  mechanism  of  WENDY  (Waseda 
ENgineering  Designed  sYmbiont)  is  constructed.  An 
assembly  drawing  and  an  overview  are  shown  in  Figure  7, 
and  8.  Specifications  are  arranged  in  Table  I . 
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Figure  7  Assembly  Drawing  of  WENDY 

WENDY  improves  Hadaly-2  in  the  human  interaction 
ability,  since  whole  body  motion  control  and  visual- 
auditory  information  processing  are  implemented  into 
design  concept.  In  the  system,  image  processing  part  is 
utilized  for  measuring  the  relative  position  from  the  base  of 
WENDY  to  the  target  objects,  by  using  stereo  vision.  In  the 
process,  shapes  of  the  objects  is  recognized  by  visual 
images  (512 X480[pixel],  8X3[bit]RGB)  through  GPB-J 
board,  which  is  on  the  market.  The  limb  mechanisms  of 
WENDY  have  the  possibility  of  handling  small  sized 
object,  which  is  located  in  human’s  living  space.  And  it  can 
turn  around  on  the  position  while  maintaining  the  end  point 
positions  of  manipulators. 

3.2  Control  Systems  and  Algorithms 

For  development  of  control  systems  of  WENDY, 
control  architectures  of  Hadaly-2  is  succeeded.  In  the 
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system,  the  host  computer  generates  behavior  codes  by 
using  visual,  and  voice  dialogue  information,  through 
CAAD  (Creating  Actuator  Angle  Data)  operating  system. 


Figure  8  Overview  of  WENDY 


The  CAAD  operating  system  was  developed  for  arm- 
hand  coordination  control  of  Hadaly-2.  The  system  is 
similar  to  finite  state  machine  or  petri-net.  Therefore, 
environmental  conditions,  such  as  scenarios  of  dialogues 
and  arrangements  of  objects,  must  be  previously  measured 
and  installed  into  the  behavior  models.  Each  motion  is 
created  by  using  graphical  user  interface,  and  implemented 
into  the  CAAD.  Then,  the  CAAD  produces  the  limb 
motions  in  reference  to  the  environmental  situations,  by 
combining  the  several  motion  patterns  that  involve  gestures. 
Each  body  control  system  is  received  the  behavior  codes 
from  the  host  computer  through  DP- RAM. 

In  this  system,  mobility  are  produced  by  combining 
straight  line  and  rotation  of  the  vehicle. 

4.  Evaluation  Experiments 

4.1  Image  Processing  and  Head  Motion 

For  evaluation  of  the  head  system,  experiments  on 
visual  image  processing  and  arm-head  coordination  are 
conducted,  respectively.  In  regard  to  the  visual  image 
processing,  recognition  of  objects  shape,  and  measurement 
of  distance  are  adopted  for  evaluation  items.  Contents  of 
the  experiments  are  listed  as  follows; 

(l)Mission:  Search  a  target  object  from  any  number  of 
objects,  which  are  located  on  the  chair.  Then,  measure 
distance  between  the  object  and  the  base  of  WENDY. 


(2)Environment:  Objects  are  located  at  1.0[m]  (x-axis) 
from  the  base.  Background  color  is  blue.  Sizes,  shapes 
and  its  color  are  previously  defined,  (object  #1;  sphere; 
white,  object  #2;  cube;  red,  object  #3;  pyramid;  yellow) 

From  the  experimental  results,  it  is  confirmed  that  the 
ratio  of  success  of  object  recognition  is  approximately  90 
[%],  the  measuring  errors  are  5  [cm]  (z-axis)  and  1  [cm]  (x, 
y-axis),  and  the  time  for  processing  is  250  [ms]. 


Table  1  Specifications  of  WENDY 


DOF 

Finger 

Arm 

Neck 

Eye 

Waist 

Vehicle 

Total 

13X2 

7X2 

4 

3 

3 

2 

52 

Size 

Height 

1471.5 

[mm] 

Width 

930.0 

Depth 

756.0 

Weight 

Hand 

2.0X2 

[kg] 

Arm 

25.0X2 

Neck  +  Eye 

8.0 

Waist 

46.0 

Vehicle 

53.0 

Total 

161.0 

Next,  servo  responses  of  the  head  mechanisms  are 
evaluated  by  the  head-arm  coordination  experiments.  In  the 
experiments,  mechanical  compliance  coefficient  of  the 
manipulator  is  set  to  maximum,  and  operator  shakes  the 
manipulator.  The  applied  external  force  is  approximately 
0.5  [Hz]  sine  curve.  The  eyeshot  of  WENDY  is  controlled 
to  follow  the  end  point  of  the  manipulator,  by  calculating 
DK  of  the  arm  and  IK  of  the  head.  The  distance  between 
hand  and  eye  is  set  to  1 .0  [m],  and  the  angle  of  joint  #4  is 
0.0  [deg].  The  experimental  result  on  x  axis,  is  shown  in 
Figure  9.  It  is  confirmed  that  fine  following  motions  of 
head  mechanisms  are  achieved  by  head-arm  coordination. 

4.2  Safety 

The  shock  absorption  cover  is  constructed  according 
to  the  proposed  design  method,  and  it  evaluated  whether 
the  developed  arm  can  satisfy  impact  safety.  In  designing 
of  a  shock  absorption  cover,  a  material  named 
Memoryfoam  M-38,  which  is  excellent  in  shock  absorption, 
is  adopted.  Moreover,  as  conditions  of  a  collision 
experiment,  subjective  collision  case  of  1  [m/sec]  is 
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selected.  The  developed  arm  and  a  measuring  instrument 
that  equips  a  load  cell  are  made  to  collide,  and  time 
histories  of  contact  force  is  measured. 

A  simulation  result  and  an  experimental  result  are  shown 
in  Figure  10  (a)  and  (b).  Since  the  simulation  result  and  the 
experiment  result  are  in  agreement,  it  was  confirmed  that 
the  manipulator  equipped  with  safety  can  be  developed  in 
reference  of  the  proposed  design  strategies. 
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Figure  9  Following  Motion  of  Head  System 


Figure  10  Shock  Absorption  Effect  of  Manipulator 
4.3  Dexterity 

To  evaluate  dexterity  of  WENDY,  object  picking  up 
and  egg  breaking  are  selected  for  experimental  tasks.  In  the 
object  picking  up  task,  WENDY  approaches  to  the  target 
object  by  using  the  vehicle,  and  picks  the  object  up. 

Maneuvering  path  is  generated  from  the  object 


position,  with  combination  of  linear  lines  and  curves.  Size 
of  the  object  is  8  [mm]  in  width,  10  [mm]  in  depth,  20 
[mm]  in  height,  and  100  [g]  in  weight.  The  block  is  located 
at  700  [mm]  in  front,  and  300  [mm]  to  the  right.  The  ratio 
of  success  of  picking  up  object  is  almost  100  [%].  This 
result  shows  that  WENDY  can  realize  accurate  mobile 
manipulation  by  using  whole  body  motion. 

In  regard  to  the  egg  breaking  tasks,  information  from 
force  sensor  is  not  utilized,  because  the  experiment  mainly 
focuses  on  the  evaluation  of  the  mechanical  characteristics 
of  the  fingertips.  The  experimental  setup  is  shown  in  Figure 
11.  In  the  experiment,  motion  patterns  are  previously 
produced.  The  ratio  of  success  becomes  20  [%]  by  using 
the  new  fingertip,  while  the  ratio  of  success  is  0[%]  by 
using  the  conventional  hemispheric  fingertips.  This  result 
shows  that  the  pressure  control  method  using  fingertip  is 
effective  for  improving  the  dexterity  of  handling. 


Figure  1 1  Experiments  of  Egg  Breaking 


5.  Conclusion 

This  paper  proposes  design  strategies  of  human 
symbiotic  robots  which  has  the  possibilities  for  ensuring 
safety  and  dexterity.  By  using  the  strategies,  a  human 
symbiotic  robot,  WENDY  is  developed. 

Effectiveness  of  the  design  strategies  are  confirmed 
from  the  results  of  several  evaluation  experiments,  such  as 
object  picking  up  and  egg  breaking,  which  require  high 
completeness  of  whole  system. 
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Abstract 

One  of  the  major  aims  in  an  automated  polishing  system 
is  to  keep  the  polishing  force  and  hence  the  material 
removal  rate  constant  The  easiest  way  of  achieving  this 
in  robot  assisted  polishing  is  to  use  through-the-arm  force 
control  where  real  time  trajectory  modifications  result  in 
all  robot  axes  being  moved  simultaneously  for  the 
required  fine  end  point  motion.  The  main  problem 
encountered  in  this  approach  is  low  steady  state  accuracy 
and  low  bandwidth  of  industrial  robots .  The  solution  is  to 
use  an  active  end  effector  for  the  fine  manipulation 
required  for  keeping  the  polishing  force  constant  in  the 
presence  of  robot  tracking  errors  while  the  gross  motion 
is  provided  by  the  robot.  Of  the  two  types  of  available 
active  end  effector  technologies  nonprogrammable  active 
end  effectors  are  treated  here  by  considering  two  such 
devices  and  evaluating  them  in  the  context  of  robot 
assisted  material  removal  as  they  offer  cheaper  solutions 
to  programmable  ones  even  though  they  are  less 
adaptable  to  changes  in  operating  conditions. 

1.  Introduction 

One  of  the  major  aims  in  an  automated  polishing 
system  is  to  keep  the  polishing  force  constant  since 
experimentally  determined  material  removal  rates  and 
transient  polishing  force  are  seen  to  be  highly  correlated 
[3].  The  easiest  way  of  achieving  this  in  robot  assisted 
polishing  is  to  use  through-the-arm  force  control  where 
real  time  trajectory  modifications  in  world  coordinates  are 
used  to  achieve  the  required  fine  endpoint  motions  by 
moving  all  robot  axes  simultaneously.  The  main  problem 
encountered  in  this  approach  is  low  steady  state  accuracy 
as  industrial  robots  do  not  have  high  positioning  accuracy. 
Also,  the  reliance  on  the  robot’s  own  controller  results  in 
large  time  lags  associated  with  robot  joint  servo  responses 
and  a  large  dead  time  because  of  the  required  coordinate 


transformations,  both  of  which  limit  the  achievable 
bandwidth  in  through-the-arm  force  control. 

The  solution  is  to  use  an  active  end  effector  for  the  fine 
manipulation  required  for  keeping  the  polishing  force 
constant  in  the  presence  of  robot  tracking  errors  while  the 
gross  motion  is  provided  by  the  robot.  Active  end 
effectors  can  be  divided  into  the  two  categories  of 
programmable  and  nonprogrammable  active  end  effectors. 
Nonprogrammable  active  end  effectors  usually  use  a 
pressure  regulator  to  indirectly  control  the  polishing  force 
by  keeping  pressure  constant  in  pneumatic  cylinder(s). 
Programmable  active  end  effectors  use  sensory 
information  from  position  and  force  transducers  which  is 
fed  back  to  a  microcomputer  that  calculates  the  control 
action  to  be  taken  based  on  a  suitable  control  law.  While 
preferable  due  to  their  adaptability  to  changes  in  operating 
conditions,  programmable  active  end  effectors  are  also 
costly  due  to  the  need  for  sensors,  the  associated  signal 
conditioning  circuitry  and  a  control  computer.  Hence,  it  is 
beneficial  to  use  cheaper  nonprogrammable  active  end 
effectors  if  they  can  maintain  satisfactory  performance. 

Two  nonprogrammable  active  end  effectors  are 
evaluated  in  this  paper  in  the  context  of  robot  assisted 
polishing.  The  first  one  which  will  be  abbreviated  as  CFD 
(meaning  constant  force  device)  here  was  designed  to 
provide  real  time  control  of  applied  tooling  force  in 
robotic  deburring  and  surface  conditioning  applications 
[1].  The  pneumatic  circuitry  and  mechanical  parts  of  the 
CFD  are  analyzed  here  in  detail  and  used  in  constructing  a 
mathematical  model.  The  model  parameters  of  the 
pneumatic  circuitry  have  been  identified  previously  using 
maximum  likelihood  estimation  by  [4].  This  identified 
model  is  used  to  demonstrate  the  adequacy  of  the  response 
of  the  pneumatic  circuitiy  of  the  constant  force  device  for 
polishing  tasks  that  are  not  too  demanding.  The  bandwidth 
of  the  mechanical  part  of  the  CFD  is  also  determined  to  be 
adequate  for  polishing  tasks.  Contact  force  measured 
during  polishing  is  given  to  demonstrate  this  adequacy. 
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The  second  nonprogrammable  active  end  effector 
treated  here  also  uses  regulation  of  pressure  in  a 
pneumatic  cylinder.  This  device  is  attached  to  a  SCARA 
type  robot  without  vertical  motion  capability  and  is  called 
robot  polisher  here  [2].  The  robot  polisher  is  useful  for 
fast  polishing  of  large  flat  surfaces  or  surfaces  of  slowly 
varying  curvature.  The  robot  polisher  is  evaluated  here 
and  compared  with  the  CFD  regarding  polishing 
performance. 

The  organization  of  the  rest  of  the  paper  is  as  follows. 
The  introduction  is  followed  by  two  sections  on  the  CFD 
and  the  robot  polisher,  respectively.  A  comparison  of  the 
two  devices  and  comments  on  the  use  of 
nonprogrammable  active  end  effectors  is  presented  in  the 
next  section.  The  paper  ends  with  conclusions. 

2.  The  CFD  constant  force  device 

The  CFD  constant  force  device  uses  a  high  precision 
pressure  regulator  to  indirectly  control  polishing  force  by 
keeping  pressure  constant  in  two  opposing  low  friction  air 
cylinders.  The  difference  of  the  constant  pressures  in  the 
two  cylinders  multiplied  by  the  piston  cross-sectional  area 
and  a  geometric  factor  results  in  the  static  applied  load. 
The  desired  force  level  is  set  by  adjusting  the  pressure 
regulators.  The  CFD  is  termed  a  rotary  constant  force 
device  since  a  counterbalance  mechanism  with  rotational 
bearings  is  used  instead  of  linear  bearings.  A  constant 
force  is  applied  on  the  part  under  static  conditions, 
regardless  of  tool  orientation  as  the  tool  weight  is 
balanced  by  the  counterbalance  weight.  This  device  has 
been  used  in  deburring  inner  holes  on  a  jet  engine  casing 
using  compliant  polishing  wheels  (Graf,  1988). 

The  CFD  utilizes  a  pneumatic  pressure  regulator  with  a 
resolution  of  0.0045  psi  to  regulate  pressures  in  its  two 
opposing  cylinders.  The  reason  for  using  two  instead  of 
one  cylinder  is  to  be  able  to  use  the  device  upside  down 
as  well.  Each  single  acting  cylinder  has  a  glass  body  for 
low  friction  and  ball  and  socket  joints  on  the  piston 
connecting  rod  to  ensure  linear  piston  motion. 

2.1.  CFD  pneumatic  model 

The  pneumatic  circuit  for  a  cylinder-regulator 
combination  of  the  CFD  is  shown  schematically  in  Figure 
1.  The  derivation  of  the  pneumatic  model  of  this  cylinder, 
pressure  regulator  combination  has  been  given  in  [4]  and 
is  shown  in  block  diagram  form  in  Figure  2.  pd  and  p  are 
the  desired  and  actual  cylinder  pressures  in  this  figure. 
There  is  a  time  lag  with  time  constant  t  due  to  the 
regulator’s  pressure  sensing  feedback  path.  The  switch 
which  chooses  the  charging  or  discharging  path  with  a 
consequent  change  in  the  flow  coefficient  from  Kv  to  Ksv 
and  vice  versa  is  the  only  nonlinearity  in  this  model,  all 


other  nonlinearities  having  been  simplified  using  small 
signal  linearization.  Robot  tracking  error  and  workpiece 
surface  irregularities  enter  the  feedback  loop  through  the 
disturbance  input  Xc. 

The  pneumatic  model  parameters  Kv,  Ksv,  Kx  and  t  for 
the  regulator,  cylinder  combination  of  the  CFD  have  been 
identified  in  [4]  by  comparing  experimental  step 
displacement  test  results  with  simulated  ones  and  using  a 
maximum  likelihood  estimation  scheme.  It  is  clear  due  to 
the  nonlinearities  involved  that  the  pneumatic  model 
parameters  should  be  identified  for  displacement  levels 
consistent  with  those  the  pistons  will  be  subjected  to 
during  actual  polishing.  During  polishing,  piston 
displacements  arise  from  workpiece  surface  irregularities 
and  robot  tracking  errors.  The  effect  of  robot  tracking 
errors  is  given  more  attention  here  since  the  workpiece 
surface  irregularities  are  not  very  large  in  magnitude  for 
polishing  tasks. 

An  experimentally  determined  robot  tracking  error 
profile  is  shown  in  Figure  3  for  the  six-axis  GMF  S-400 
industrial  robot  used  here  to  hold  the  CFD  during 
polishing  passes.  This  profile  has  a  low  frequency 
component  with  a  half  wave  type  shape  and  lower 
amplitude  components  at  1  to  2  Hz  superimposed.  The 
peak  to  peak  (abbreviated  p-p  in  the  following)  amplitude 
of  the  1  to  2  Hz  component  is  of  the  order  of  1  mil.  This 
1-2  Hz  component  of  the  robot  tracking  error  will  not 
even  excite  the  pressure  regulation  mechanism  because 
the  corresponding  changes  in  pressure  are  within  the  finite 
resolution  of  the  pressure  regulator.  The  larger 
displacements  that  the  pistons  will  be  subjected  to  during 
the  beginning  and  end  of  the  robot  motion  occur  gradually 
rather  than  in  a  stepwise  fashion  and  the  pressure  regulator 
response  should  be  fast  enough  to  compensate  for  them.  In 
short,  the  pneumatic  circuitry  of  the  CFD  is  expected  to 
reject  this  robot  tracking  error  profile  which  enters  the 
loop  at  the  disturbance  input  after  being  scaled  down  due 
to  the  geometry  of  the  CFD.  In  order  to  test  this  assertion 
directly,  a  high  precision  linear  positioning  stage  under 
closed  loop  position  control  was  connected  directly  to  one 
of  the  CFD  cylinders  under  regulated  pressure  control  to 
subject  it  to  the  robot  tracking  error  input.  The  measured 
cylinder  pressure  response  shown  in  Figure  4  has  a 
maximum  change  in  pressure  that  is  less  than  the  pressure 
regulation  system’s  resolution  of  0.005  psi,  showing 
successful  pressure  regulation. 

2.2.  CFD  mechanical  model 

The  mechanical  structure  of  the  CFD  consists  of  a 
parallelogram  mechanism  fixed  to  the  robot  or  a  stationary 
base  through  two  hinged  joints  (Oi  and  02  in  Figure  5). 
The  tool  weight  Mtg  is  balanced  by  the  counterbalance 
weight  Mcwg  and  the  difference  in  cylinder  pressure  forces 
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Fpd  and  Fpu  is  used  to  obtain  a  constant  workpiece 
interaction  force  in  the  static  case.  The  simplified 
equivalent  model  of  the  CFD  mechanical  structure  shown 
in  Figure  6  is  used  in  deriving  the  equations  of  motion.  In 
this  figure,  all  the  mass  has  been  lumped  into  the  tool 
mass  Mt  and  the  counterweight  mass  Mcw,  assuming 
massless  links.  The  normal  contact  and  pressure  forces  are 
given  by 

Fn  ~  Kpr  (xn  F  xtr  F  Xwp  )  ( 1 ) 

Fp=Ac(Pcd-Pcu)  (2) 

where  x*  is  the  robot  tracking  error,  xn  is  the  tool  position 
relative  to  the  CFD,  and  Kpr  is  the  equivalent  stiffness  of 
the  polishing  tool  and  robot-tool  holder.  Modeling  the 
dynamics  of  this  device  and  using  (1)  and  (2)  results  in 

(M,+MmXlJl3)2}cn+K  prx„  =  -Kpr  (x,,  +xwp)  + 
(hlh)Ac(Pcd~Pcu)-  (3) 

The  mechanical  model  of  the  CFD  given  by  (3)  is,  thus, 
seen  to  be  a  mass-spring  system  excited  by  robot  tracking 
errors,  workpiece  surface  irregularities  and  dynamic 
pressure  forces.  When  typical  numerical  values  are 
substituted  in  Equation  (3),  the  bandwidth  of  the 
mechanical  system  is  determined  as  14  Hz 

2.3.  The  overall  model 

The  whole  CFD  model  is  shown  in  block  diagram  form 
in  Figure  7.  Robot  tracking  errors,  workpiece  surface 
irregularities  and  any  net  pressure  changes  act  as  inputs  to 
the  mechanical  mass-spring  structure  which  compensates 
for  them  in  the  0  to  14  Hz  frequency  range.  Robot 
tracking  errors  and  workpiece  surface  irregularities  cause 
the  CFD  pistons  to  move  inside  the  cylinders  causing 
transient  changes  in  pressures  and  hence  a  change  in  the 
pressure  force  in  (3),  but  the  regulators  adjust  the 
pressures  to  the  set  point  values  and  xn  follows  the 
negative  of  x^+x^  resulting  in  a  constant  force  Fn  if  the 
disturbances  occur  at  low  enough  frequencies. 

The  adequacy  of  the  pneumatic  circuitry  of  the  CFD  in 
accommodating  robot  tracking  errors  was  pointed  out 
earlier.  The  mechanical  model  of  the  CFD  with  a 
resonance  at  14  Hz  will  also  not  cause  any  problems  at  the 
much  lower  robot  tracking  error  frequencies.  Hence,  good 
force  regulation  performance  in  polishing  is  expected 
from  the  CFD  constant  force  device. 

2.4.  Force  control  experiments 

The  normal  component  of  the  polishing  force  recorded 
during  a  polishing  pass  with  a  compliant  polishing  wheel 


is  displayed  in  Figure  8  on  a  magnified  scale.  There  are 
low  frequency  waviness  (0.06  Hz)  and  1-2  Hz  components 
of  0.2  lbf  and  0.06  lbf  p-p  magnitude,  respectively.  The 
higher  frequency  component  is  at  the  same  frequency  as 
the  robot  tracking  error.  These  normal  force  fluctuations 
suggest  that  the  CFD  is  not  capable  of  compensating  for 
robot  tracking  errors  during  polishing.  This  contradicts 
earlier  results  since  the  CFD  device  does  a  poorer  job  of 
regulating  the  polishing  force  than  would  be  expected  by 
the  performance  of  the  single  cylinder,  regulator 
combination  and  the  large  bandwidth  of  its  mechanical 
structure. 

After  carefully  considering  several  possibilities  in 
order  to  explain  the  presence  of  unexpected  polishing 
force  fluctuations,  it  has  been  determined  experimentally 
that  the  problem  results  from  clearances  and  backlash  in 
the  hinged  and  ball  and  socket  joints  of  the  rotary 
mechanical  structure  of  the  CFD  giving  rise  to  residual 
tool  motion  of  up  to  0.55  mils  that  stays  as  a  net  tool- 
workpiece  interference  but  is  not  transmitted  to  the 
pistons.  A  residual  too  displacement  of  0.55  mils  during 
polishing  corresponds  to  a  0.22  lbf  change  in  force  for  a 
compliant  polishing  wheel  with  a  400  lbtfin  stiffness, 
explaining  the  reason  for  the  force  fluctuations  observed 
during  polishing  tests.  This  is  mainly  due  to  the  rotary 
construction  of  the  CFD,  necessitating  the  presence  of 
several  joints.  Due  to  clearances  and  backlash  in  these 
joints,  displacements  of  up  to  0.55  mils  at  the  tool  plate 
are  not  transmitted  back  to  the  pistons  and  remain  as 
polishing  wheel-workpiece  interference. 

The  disturbance  rejection  frequency  response  of  the 
CFD  shown  in  Figure  9  is  determined  next,  based  on  its 
quasilinear  mathematical  model  which  is  linearized  by 
assuming  that  the  charging  and  discharging  coefficients 
are  the  same.  A  comparison  of  Figure  9  with  the 
disturbance  rejection  characteristic  of  through-the-arm 
force  control  for  the  GMF  S-400  robot  used  here  (see  [3]) 
shows  the  enhanced  disturbance  rejection  obtained  when 
using  the  CFD.  In  theory,  this  nonprogrammable  active 
end  effector  will  effectively  compensate  for  robot  tracking 
errors  in  the  frequency  range  of  0  to  5  Hz  and  achieve 
tight  force  control.  This  is  not  seen  in  practice  since  the 
joint  clearance  effects  are  not  present  in  the  model  used  to 
obtain  Figure  9.  This  is  still  a  useful  result  showing  what 
could  be  achieved  in  the  limiting  case  of  a  typical 
nonprogrammable,  pneumatically  powered  active  end 
effector  if  the  design  of  its  mechanical  structure  could  be 
improved. 

3.  The  robot  polisher 

The  robot  polisher  SMR-100H  is  a  three-axis  die 
polishing  machine  which  uses  a  pneumatically  powered, 
nonprogrammable  active  end  effector  to  achieve  force 
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control  indirectly  through  cylinder  pressure  regulation.  Its 
force  control  system  is  shown  schematically  in  Figure  10. 
The  robot  polisher  uses  a  double  acting  cylinder  and  a 
pressure  regulator  to  keep  cylinder  pressure  constant. 
Although  the  connection  between  the  piston  and  movable 
polishing  head  on  the  robot  polisher  end  effector  is  a 
direct  one,  there  are  still  some  clearances  in  the  linear 
drive  rail  mechanism.  Also,  a  high  amount  of  compliance 
is  introduced  by  the  belt  drive  utilized  in  powering  the 
reciprocation  mechanism.  So,  low  frequency  force 
fluctuations  can  be  expected  with  this  device  as  well.  This 
is  seen  in  the  polishing  force  plot  of  Figure  11,  where  a 
reciprocating  stone  is  moved  over  the  workpiece  by  the 
robot  polisher.  The  higher  frequency  force  fluctuations  in 
Figure  11  are  mainly  due  to  the  reciprocating  motion  of 
the  stone  used.  Note  that  the  effect  of  polishing  force 
fluctuations  above  0.2  Hz  will  not  be  reflected  on  the 
workpiece  surface  as  uneven  material  removal  rate  as  they 
are  averaged  out  by  the  reciprocating  stone. 

4.  Comparison  of  the  CFD  and  the  robot 
polisher 

First,  it  should  be  pointed  out  that  well  polished 
workpiece  surfaces  were  obtained  experimentally  using 
both  the  CFD  held  by  the  GMF  S-400  robot  and  the  robot 
polisher.  Even  though  both  devices  achieve  results 
superior  to  the  through-the-arm  technique,  they  both  suffer 
from  disadvantages  some  of  which  could  be  corrected  by 
appropriate  modifications.  This  will  enable  both  devices 
to  be  used  satisfactorily  in  automated  polishing. 

As  for  the  robot  polisher,  since  the  force  plot  obtained 
during  polishing  with  the  robot  polisher  does  show  force 
fluctuations  at  lower  frequencies,  the  material  removal 
rate  achieved  will  not  be  uniform.  Moreover,  the  end 
effector  mechanism  of  the  robot  polisher  does  not  have 
any  tool  weight  cancellation  mechanism  like  the  one  in  the 
CFD.  Hence,  the  tool  orientation  cannot  be  changed 
during  polishing,  a  major  limitation  in  the  polishing  of 
three  dimensional  sculptured  die  surfaces.  This  is  an 
advantage  of  using  a  six  degrees  of  freedom  industrial 
robot,  with  the  ability  to  position  the  tool  at  a  desired 
orientation  on  any  point  on  the  workpiece  surface.  The 
major  advantage  of  this  device  is  that  large  flat  surfaces 
can  be  polished  very  fast  with  little  programming. 

The  responses  of  the  force  control  loop  obtained  using 
the  CFD  nonprogrammable  active  end  effector  are  better 
than  the  corresponding  responses  achieved  by  through- 
the-arm  force  control.  Moreover,  the  CFD  is  capable  of 
performing  polishing  force  regulation  with  hard  polishing 
tools  like  grinding  wheels  and  reciprocating  stones,  an 
impossibility  with  through-the-arm  robot  force  control. 
These  two  points  clearly  show  the  benefits  achieved 


through  the  use  of  an  independently  controlled  active  end 
effector.  A  further  benefit  of  using  the  CFD  device  in 
comparison  to  the  robot  polisher  is  not  having  to  deal  with 
orientation  dependent  force  sensor  offsets  due  to  the 
presence  of  its  mechanical  tool  weight  cancellation 
mechanism.  In  short,  a  rotary  pneumatic  constant  force 
device  like  the  CFD  can  be  used  satisfactorily  in  robot 
assisted  die  polishing,  provided  that  its  design  could  be 
improved. 

A  major  disadvantage  of  nonprogrammable  active  end 
effectors  is  their  nonadaptability  to  large  changes  in 
polishing  process  -  requirements  due  to  their 
nonprogrammability  and  their  inability  to  follow 
commanded  force  trajectories.  The  use  of  a  programmable 
active  end  effector  becomes  a  necessity  in  such  cases. 

5.  Conclusions 

Nonprogrammable  active  end  effectors  have  been 
evaluated  here  as  an  alternative  to  through-the-arm  and 
programmable  active  end  effector  control  strategies  in 
robot  assisted  polishing.  Through-the-arm  robot  force 
control  offers  the  easiest  implementation  but  suffers  from 
its  reliance  on  the  dynamic  response  of  the  massive  robot 
arm  for  fine  motion  manipulation  in  tool  coordinates. 
Programmable  active  end  effector  usage  offers  the  most 
versatile  and  adaptable  solution  at  the  sake  of  increasing 
the  overall  cost. 

Nonprogrammable  active  end  effector  usage  has  been 
shown  to  be  a  less  adaptable  but  cheaper  solution.  Two 
examples  of  nonprogrammable  active  end  effectors  have 
been  analyzed  in  detail  here  regarding  their  performance 
in  robot  assisted  polishing.  It  has  been  shown  that 
satisfactory  polishing  performance  can  be  achieved  using 
both  devices  for  operating  conditions  that  do  not  show 
wide  variations.  Improvements  to  both  devices  have  been 
suggested. 
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Figure  5.  CFD  mechanical  structure 


Figure  3.  Robot  tracking  error  profile 


Figure  6.  Simplified  CFD  mechanical  model 
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Figure  7.  Overall  CFD  model 
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Figure  9.  CFD  disturbance  rejection 


Figure  11.  Normal  force  during  polishing  with  the 
robot  polisher 
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Abstract 

The  current  dynamic  and  turbulent  business  environment 
and  the  approaching  21st  century  have  put  pressure  on 
business  operations  all  around  the  world  to  change 
traditional  methods  of  conducting  business  to  Internet- 
Based  Electronic  Commerce  (or  Internet  Commerce). 
Electronic  Commerce  has  been  an  idealised  trading 
concept  for  many  years.  But  the  lack  of  integrated 
applications  and  the  unavailability  of  universally 
accepted  methods  of  communication  have  been  a 
constant  plague.  As  the  Internet  develops  there  is  every 
possibility  that  e-mail  and  applications-based 
technologies  will  take  over  from  traditional  telephony  as 
the  common  method  of  communications.  Furthermore , 
the  Internet  provides  an  unprecedented  infrastructure  for 
moving  information.  This  will  have  immense 
repercussions  within  the  commercial  world.  This  paper 
analyses  the  key  drivers  and  economic  values  of 
integration  Internet  with  business  operations  in  smart 
manufacturing  management. 


1.  Introduction 

The  market  place  of  the  twenty-first  century  is  evolving 
into  one  of  merging  national  markets  fragmented 
consumer  market,  and  rapidly  changing  product/service 
technologies.  These  changes  are  driving  firms  to  compete, 
simultaneously,  along  several  different  dimensions: 
design,  manufacturing,  distribution,  communication, 
marketing,  sales,  advertisement,  and  others.  To  effectively 
compete  in  global  markets,  firms  must  be  quick  and 
flexible  in  their  response  to  customer  needs.  Businesses 
are  beginning  to  use  the  Internet  for  such  things  as 
shortening  the  development  cycle  of  new  products, 
communicating  with  experts  from  around  the  world, 
receiving  customer  feedback  on  software,  and  accessing 
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supercomputers  for  industrial  research  and  development 
(Gide  and  Soliman,  1997). 

The  Internet  will  be  one  of  the  most  important  business 
tools  that  will  drive  the  formation  of  integrated  supply 
chains  that  concepts  are  already  evident  in  many 
initiatives  for  just-in-time  manufacturing,  continuous 
replenishment,  vendor-managed  inventoiy  and  electronic 
commerce.  A  successful  supply-chain  strategy  depends  to 
a  large  extent  on  the  information  system  backbone  that 
supports  it.  The  Internet  is  a  key  part  of  that 
infrastructure,  promoting  business-to-business  integration, 
ultimately  resulting  in  global  market  expansion  and 
increased  control  over  working. 

On  the  other  hand,  in  a  recent  survey  conducted  by 
Arthur  Andersen’s  Enterprise  Group  and  National  Small 
Business  United  in  the  US  to  determine  what  small 
companies  (entrepreneurs)  really  do  on  the  Internet,  it 
was  revealed  that  they  were  engaged  in  global  business 
(Table  1). 


Table  1.  Percent  of  respondents  who  indicated  the 
following  Internet  uses  apply  to  them. 


•;;SnjalJ  : 
'souths 

m  \ 

impodiers: 

(%} 

lipiiil 

..  m  j 

Use  of  the 

Internet 

23 

35 

39 

Conduct 

onihe.fn 

research 

left 

Hill 

WMmMM 

liiilil  pi 

Host/have  a  Web 
page 

8 

14 

17 

In  industiy,  the  efficient  and  effective  management, 
manipulation  and  use  of  information  is  essential  to 
economic  vitality  and  growth.  According  to  some 
experts,  the  use  the  integration  of  information 
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technologies,  in  an  infrastructure  of  communication 
networks,  hardware  and  software  applications,  databases, 
bulletin  boards  and  other  services,  is  a  critical  ingredients 
in  production  operations  management  success.  Staying 
competitive  in  the  next  decade  will  require  the 
development  of,  and  commitment  to,  a  strategy  focused 
on  improving  manufacturing  (or  service)  cost  and  quality 
(Gide  &  Soliman,  1997). 

2.  What  does  Internet  Commerce  Mean  for 
Manufacturing? 

Electronic  Commerce,  Electronic  Trading  and 
Electronic  Business  are  often  used  interchangeably  and 
many  times  there  is  a  perception  that  these  terms 
principally  refer  to  the  procurement  cycle  -  the  ordering 
and  paying  for  goods  or  services  either  via  electronic 
commerce  technologies  such  as  EDI  or,  more  recently 
and  growing  in  popularity,  on-line  Internet  shopping. 

Internet-Based  E-Commerce  is  not  an  extension  of  EDI 
(Electronic  Data  Interchange)  which  has  been  primarily 
limited  to  computer-to-computer  transactions,  and  has 
not  been  associated  with  major  transformations  of  firms. 
There  is  no  formal  definition  of  the  Internet-Based  E- 
Commerce.  Since,  Internet  commerce  is  still  immature, 
so  is  the  definition.  However,  one  definition  made  by 
Kalakota  (1996),  as  " the  process  of  converting  digital 
inputs  into  value-added  outputs ”.  Basically,  this  process 
involves  taking  information  as  raw  material  and 
producing  value  added  information-based  products  or 
services  out  of  the  original  raw  information. 

So,  electronic  commerce  refers  to  an  on-line  production 
process  owned  by  intermediaries.  Producers  of 
information  interact  with  services  and  other  processed 
information,  such  as  orders,  payments  or  instructions.  In 
reality,  Internet  Commerce  is  about  businesses  and 
consumers  adopting  a  new  process  or  methodology  in 
dealing  with  each  other.  These  processes  are  in  essence 
supported  by  electronic  interactions  that  replace  close 
physical  presence  requirements  or  other  traditional 
means  (Soliman  &  Gide.  1997). 

Internet-Based  E-Commerce  is  giving  a  new  way  to 
electronic  commerce,  with  different  characteristics  than 
traditional  EDI.  Internet  Commerce  is  not  a  repeat  of 
EDI,  but  rather  is  an  evolution  from  the  EDI.  The 
Internet  offers  the  grealcst  potential  for  Electronic 
Commerce  known  to  dale.  According  to  Steel  (1996) 
"there  are  less  than  100.000  EDI  (Electronic  Data 
Interchange)  users  world-wide  after  40  years  or  so  of 
endeavour",  but  Nielsen  Media  Research  and 


CommerceNet  survey  shows  that  78  million  people  used 
the  Web  during  the  first  six  months  of  1998,  and  20 
million  of  those  users  made  purchases  via  the  Web. 
Businesses  are  aggressively  adopting  inter-company  trade 
over  the  Internet  because  they  want  to  cut  costs,  reduce 
order-processing  time,  and  improve  information  flow. 
According  to  the  Internet  marketing  research  firm 
ActivMedia  (1997)  projections  indicate  that  global  Web 
sales  through  2002  could  total  $1.5  trillion,  or  about  3% 
of  combined  Gross  Domestic  Product  (GDP)  for  all 
countries  worldwide.  * 

The  phenomenal  predictions  of  the  size  of  the  Internet 
market  should  be  interpreted  with  some  other  factors  in 
mind.  The  Internet  is  beginning  to  generate  new  sales 
channels,  especially  for  products  and  services,  which  can 
be  delivered  digitally  over  the  net.  And  there  is  no  doubt 
that  bank-assumed  risk  from  credit  card  transactions 
through  SET  processes  will  accelerate  traditional  retail 
sales  over  the  Internet.  But  these  sales  are  still  generally 
no  more  than  sales  substitution,  or  sales  that  would 
previously  have  been  made  by  personal  visits,  mail  order 
or  the  like.  An  interesting  report  on  Web  shopping  and 
purchasing  is  new  data  taken  from  a  Nielsen  Media 
Research/CommerceNet  study  on  Internet  demographics 
released  to  researchers  in  August  1998  (Table:  2). 


'  Table  2.  Top  items  purchased  on  the  Web,  June,  1998 
vs.  September,  1997 


:  Mm  Purchased  • 

June,  1998  j 
(ttiiliioiipeplc); 

Books 

5.6 

2.3 

.  Computet  Ha^ 

2.0- 

Computer  Software 

4.0 

2.8 

•  Travel  ;  (airline ; :  tickets, 
mm  &cari  reservations) 

1.2 

Clothing 

2.7 

0.9 

3.  The  Key  Economic  Values  of  E-Commerce 

There  are  various  types  of  key  measurements  that  must 
be  tracked  prior  to  embarking  on  a  full  implementation. 
Some  of  the  important  key  elements  to  measure  business 
value  are: 

•  Improving  customer  service:  Providing  customers 
self-access  to  their  accounts,  transactions  and  orders, 
is  a  valuable  service  (Mougayar,  1997).  The  level  of 
satisfaction  for  those  customers  interacting 
electronically  will  undoubtedly  rise. 
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•  Reducing  costs:  The  most  basic  cost  reductions 
could  be  related  to  publishing  costs,  which  include 
the  cost  of  production,  printing  and  distribution 
(Mougayar,  1997).  Furthermore,  marketing  and 
selling  costs  are  also  lower  in  an  electronically 
enabled  commerce  environment. 

•  Providing  Business  Intelligence:  In  the  electronic 
commerce  world  businesses  need  to  know  much 
more  about  their  clients.  Electronic  commerce  makes 
it  possible  to  market  to  specific  individuals  based  on 
their  patterns  of  (purchasing  and  browsing) 
behaviour.  Hence  they  need  to  capture,  and  to 
analyse,  as  much  information  as  possible  about  each 
individual  purchase  (or  cancelled  purchase)  in  order 
to  build  up  customer's  profiles.  This  is  achieved  in 
much  the  same  way  that  neighbourhood  stores  once 
did,  through  personal  acquaintance  with  the 
consumer  and  continuous  contact  The  use  of  this 
analysed  data  leads  to  what  is  being  called  "market 
response"  systems  or  "adaptive  marketing". 

•  Process  simplification:  Instead  of  using  paper, 
using  the  World  Wide  Web  (WWW)  simplifies  and 
speeds  the  approval  process  (Mougayar,  1997). 

•  Generating  new  revenue:  The  new  Internet-Based 
Electronic  Marketplace  generates  new  revenue  by 
selling  new  products  and  services  specifically 
designed  for  the  electronic  marketplace  (Mougayar, 
1997).  Existing  product  or  services  can  also  be  sold 
on  the  Internet. 

•  Taking  faster  decisions:  By  receiving  information 
about  competition  through  an  Intranet  information 
retrieval  database,  it  would  be  possible  to  develop  a 
competitive  strategy  much  faster  than  otherwise.  The 
drivers  for  manufacturing  are  customer’s  needs  and 
time.  Time  is  a  major  source  of  competitive 
advantage  and  competitive  pressures  requiring 
production  schedules  to  be  shortened. 

4.  Manufacturing  Value  of  Internet 
Commerce 

The  cost-benefit  justification  of  Internet  access  and 
Information  Systems  in  general  is.  and  always  be,  a 
difficult  one  to  prove  but  due  to  access  of  real  time  data, 
this  would  provide  long  term  benefit  of  immeasurable 
value.  To  be  able  to  analyze  the  value  of  Internet 
Commerce  usage,  it  is  helpful  to  have  a  measuring  ruler. 
For  individuals,  one  can  compare  the  cost  of  an  Internet 


connection  to  the  cost  of  using  the  telephone.  It  has  been 
demonstrated  that  e-mail  is  cheaper  than  phone  to 
communicate  long  distance  with  a  number  of  people. 

The  Internet  Commerce  usage  can  make  it  possible  to 
reduce  the  amount  of  time  or  effort  required  to  perform 
certain  tasks:  cost  savings  and  benefits  from  providing 
sales  and  customer  support  online  and  increase  the 
potential  of  collaborative  partnerships  established  over 
the  Internet.  The  primaiy  reasons  for  implementation  of 
Internet  Commerce  is  illustrated  below  in  Table  3. 


Table  3.  The  Main  Reasons  for  Implementation 
of  Internet  Commerce  (Source:  Computerworld 
1998) 


Reason  fear  Using  Internet 
Commerce' 

Percentage^)  • 

Cost  Savings 

35 

Qv&oxmSwniX 

m  • 

Revenue  Generation 

18 

Marketing  •  • 

Others 

2 

Total 

100 

Many  businesses  are  using  the  Internet  to  contain  long¬ 
distance  telephone  and  mailing  costs.  Recent  studies 
have  shown  that  businesses  can  save  thousands 
of  dollars  using  e-mail,  in  lieu  of  some  long  distance 
phone  calls  and  postal  deliveries.  For  example,  with  first- 
class  letters  costing  45  cents  each,  a  mailing  of  1,000 
pieces  to  customers  would  cost  $450  for  postage  alone, 
whereas  the  same  information  sent  by  e-mail  would  cost 
2  to  3  cents  each  -  and  the  messages  would  arrive  in 
seconds  as  opposed  to  days  or  weeks.  Overnight  mail 
(which  typically  costs  at  least  $4-$6  for  each  delivery) 
can  not  compete  with  e-mail  for  speed  or  cost.  Long 
distance  telephone  charges,  particularly  international 
charges,  are  reduced  by  use  of  e-mail. 

Businesses  spend  much  more  money  than  individuals 
on  their  phone.  As  an  example,  one  provider’s  estimated 
cost  of  an  Internet  connection  for  a  site  of  15,000  users 
was  about  8  cents  per  month  per  user.  The  same  Internet 
connection  for  100  users  would  cost  about  $12.50  per 
month  which  is  smaller  than  the  cost  of  $15  per  month 
for  a  phone.  In  most  cases,  the  cost  of  an  Internet  will  be 
far  less  than  an  equivalent  telephone  connection  and  far 
more  valuable  in  the  long  run.  An  average  individual 
user  spends  $20  to  $30  monthly  on  an  Internet 
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connection  which  compares  favourably  with  the  cost  of  a 
telephone  line  (Estrada,  1993). 

According  to  Mougayar  (1997)  there  are  various  types 
of  key  measurements  that  must  be  tracked  prior  to 
embarking  on  a  full  implementation.  The  old  adage  “if 
you  can’t  measure  it,  you  can’t  tract  it”  applies.  Some  of 
the  important  key  elements  to  measure  business  value 
are:  reducing  costs,  process  simplification,  improving 
customer  service,  generating  new  revenue  and  taking 
faster  decisions. 

5.  Current  Challenges  to  Internet-Based  E- 
Commerce 

There  are  two  main  drawbacks  or  challenges  in  using 
Internet-Based  E-commerce,  these  are:  security  issues 
and  payment  tools  (Soliman  &  Gide,  1997).  These  two 
issues  are  receiving  the  highest  priority  and  the  best 
attention  they  deserve,  both  from  vendors  and  users  and 
implementers. 

5.1.  Security  and  Privacy  Issues 

While  EDI  users  enjoy  a  high  level  of  reliability  and 
security,  they  are  often  restricted  to  the  exchange  of  data 
with  users  of  the  same  Value  Added  Network  (VAN).  For 
Internet  commerce  to  really  transform  the  way  we  do 
business  a  secure  solution  that  works  globally  is  required. 
To  achieve  this,  a  series  of  international  standards  needs 
to  be  agreed  and  vendors  need  to  carry  out  a  rigorous 
program  of  interoperability  tests.  Moreover,  as  trade 
moves  beyond  national  boundaries,  a  common  legal 
infrastructure  must  be  agreed.  For  example,  a  contract 
that  has  been  digitally  signed  in  one  country  needs  be 
recognised  in  other  countries.  Recent  researches  show 
that  that  there  is  a  growing  confidence  in  solving  the 
Internet  security  issues  that  have  been  very  widely 
publicised.  Even  though  security  is  a  challenge  it  is  not 
the  main  barrier  to  E-Commerce  adaptation. 

5.2,  Payment  Methods 

There  is  confusion  over  the  availability  and  choice  of 
Internet  payments  tools.  In  addition,  there  are  no 
interoperability  standards  to  make  one  work  with 
another.  Over  the  past  two  years,  new  payment  tools  from 
small  companies  have  emerged.  The  difficulty  with  E- 
Commerce-at  the  moment -is  that  all  on  line  payment 
methods  are  new  and  relatively  untested.  It  is  vital  that 
the  business  obtains  payment  at  no  less  favourable  terms 
than  it  currently  does.  Similarly  the  banks  involved  in  the 


process  want  to  be  sure  about  security  and  costs  of 
processing  the  transaction  within  their  existing  security 
and  operating  restraints.  Some  of  the  new  E-Commerce 
payment  tools  that  can  be  used  in  manufacturing  and 
business  operations  are: 

Electronic  Cash  (Digital  Cash)-  It  is  a  token-based 
currency  which  translates  into  equivalent  real  currency 
units  that  are  guaranteed  by  a  bank,  (eg  DigiCash). 

Smart  Cards-  Smarf  Cards  can  be  used  with  or  without  a 
stored  value.  If  they  have  a  stored  value  which  contains 
“real  digital  cash”,  they  are  known  as  “Cash  Cards” 
because  they  replace  carrying  cash  (eg  Mondex). 

Electronic  Cheques-  These  are  the  equivalent  of  paper 
based  cheques.  Authentication  and  verification  are 
usually  performed  instantaneously  by  using  digital 
signatures  and  time-stamping  controls  during  the 
transaction  (eg  CheckFree). 

Encrypted  Credit  Cards-  There  are  varying  degrees  of 
encryption  implementations  credit  of  credit  cards  over 
the  Internet,  with  the  SET  (Secure  Electronic 
Transactions)  holding  the  most  promise  (eg  CyberCash). 

6.  Integrating  Internet  with  Business 
Operations: 

At  the  moment,  according  to  many  E-Commerce 
experts  generally  there  are  3  types  of  Internet-based  E- 
Commerce  applications: 


■  Business  to  Business 

■  Business  to  Consumer,  and 

■  Business  to  Employee. 


Business.'::::;::!: 
V  . '3 

JL 

Figure  1.  Electronic  commerce  application  types  in 
the  virtual  market. 


109 


6.1.  Business-to-Business  E-Commerce 


Business-to-Business  E-Commerce  is  complementary 
to  EDI  in  that  it  is  beginning  to  be  used  for  non¬ 
production,  non-replenishment  applications.  The  widely 
used  current  terms  used  to  describe  the  function  of  E- 
Commerce  are  "Business  to  Business"  and  "Business  to 
Consumer".  The  expression  "business-to-business"  is 
inexact  and  sometimes  misleading.  In  E-Commerce 
systems  it  is  not  always  possible  to  tell  who  is  accessing 
the  automated  point  of  salc/point  of  contact.  It  could  be  a 
retail  consumer  buying  in  wholesale  quantities;  it  could 
be  a  business  buying  in  retail  quantities-and  many  other 
variants.  Business-to-Business  electronic  ordering 
processes  are  generally  designed  to  empower  business 
managers.  The  business  server  can  normally  be  accessed 
through  the  corporate  Intranet,  or  an  Extranet.  Once  the 
log  on  process  is  completed  then  the  appropriate 
catalogue  is  selected.  Ultimately  all  authorised  suppliers' 
catalogues  will  be  accessible  by  this  business  server. 
Suppliers'  sites  are  not  normally  publicly  promoted  or 
advertised;  they  are  trade  sites,  limited  to  their  own 
closed  user  group  of  clients.  Payment  is  made  using  a 
bank  purchase  card  selected  from  an  electronic  wallet. 

6.2.  The  Business-to-Consumer  E-Commerce 

The  Business-to-Consunicr  E-Commerce  is  designed  to 
complement  normal  retail  shopping,  mail  order  and 
direct  marketing.  It  can  accommodate  delivery  of  soft,  or 
digital  goods,  such  as  published  material,  software,  audio 
and  video  products. 

6.3.  Business-to-Emplovee  E-Commerce 

Business-to-Employee  E -Commerce  is  beginning  to 
develop  a  new  market  place.  Many  companies  now  allow 
employee  to  buy  using  the  corporate  Intranet.  A  variant 
from  an  emulated  business-to-consumer  application  is 
where  employees  may  have  purchases  deducted  from  the 
payroll,  or  from  allowances.  Allowances  or  entitlements 
for  clothes  or  equipment  are  often  the  norm  in  the  armed 
services,  police,  fire  services,  airlines,  banks,  health 
services  and  so  on.  Not  only  do  these  systems  have  to 
keep  track  of  entitlements  and  usage,  but  accounts,  sizes 
and  up  to  date  measurements,  location  and  other 
variables.  This  application  promises  to  be  an  important 
hybrid  of  business-to-consumer  and  business-to-business. 


7.  Economic  Benefits  of  E-Commerce  for 
Business  Operations 


To  date  the  major  benefits  from  the  Internet  include 
improved  internal  and  external  communications.  The 
Web  has  specifically  brought  a  new  marketing  medium 
and  enhanced  information  resource.  Innovative 
applications  are  starting  to  appear  which  allow  for  sales 
and  database  interrogation.  Other  benefits  such  as  e-mail 
and  file  transfer  functionality,  Web  utilisation  gave  many 
companies  [ Internet  presence '  and  provided  them  with 
opportunities  to  develop  and  expand  new  services.  A 
framework  for  realising  the  optimum  level  of  benefits 
from  using  the  E-Commerce  over  the  Internet  is 
illustrated  below  in  Figure  2. 


High  level1 

Medium  to; :  : 
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business  lls-i 
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(Intermediate) 
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Optimum  Benefits 
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Management 
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Returns 
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•  Management  .  Ji; 
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e-banking 
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i  □  if  On-Line1  Catalogues  arid 
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□  .Secure  Transactions 
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;Qv ;  iOn-LiriP  Gpirimiihicatibri 
'  □ . .  ’ .  Qn/Off-Line  Delivery 
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Information 
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Figure  2.  Framework  for  optimum  level  of 
adaptation  and  benefits  of  e-commerce  over  the 
Internet. 
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According  to  the  Cisco  Systems  Inc.,  the  leading 
maker  of  Internet  equipment,  more  than  $1  trillion  to  $2 
trillion  worth  of  goods  and  services  will  be  sold  on  the 
Net  by  2002.  On  the  other  hand,  Gartner,  a  top  industiy 
consulting  firm,  has  estimated  business-to-business 
electronic  commerce  will  be  12  to  15  times  larger  than 
consumer  markets  for  the  next  few  years,  with  consumer 
sales  only  catching  up  with  business  markets  midway 
into  the  next  decade.  Businesses-to-business  e-commerce 
involves  companies  and  their  suppliers  while  consumer 
markets  include  home  shopping,  banking,  health  care 
and  broadband  communications  to  the  home.  Although 
sales  between  businesses  will  dominate  electronic 
commerce  in  the  near  term.  Cisco  estimates  that  by  2002 
consumer-oriented  business  would  represent  50  percent 
of  the  Internet  economy. 

8.  Conclusions 

This  paper  analysed  the  key  drivers  and  economic 
values  of  integration  Internet  with  business  operations  in 
smart  manufacturing  management. 

Even  though  E-Commcrce  makes  sense  theoretically 
for  business  activities,  the  reality  is  that  it  has  to  integrate 
with  internal  and  externa!  processes  that  are  already  in 
place.  Sometimes,  this  imegration  is  a  challenge  linked 
to  a  major  re-engineering  exercise  accompanied  by 
resistance  to  change. 

Despite  the  benefits  and  success  stories  a  number  of 
issues  remain  to  be  resolved  such  as  security,  privacy, 
and  payment  tools.  It  is  believed  that  this  concern 
regarding  security  will  be  lessened  due  to  a  series  of 
international  developments  (Gide  &  Soliman,  1997). 
Other  issues  regarding  the  growth  of  the  Internet  are  the 
lack  of:  a  public  key  infrastructure  (particularly  for 
international  trade),  governmental  stance,  access, 
reliability  (service  levels),  integrated  applications  and 
understanding/awareness  of  the  Internet-Based  E- 
Commerce  capabilities,  and  finally  the  relative  cost  of 
required  technologies. 
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Abstract 

This  paper  describes  the  architecture 
and  base  concepts  of  building  the  real  time 
decision  support  expert  systems  on  the  example 
of  a  prototype  for  monitoring  and  management 
of  some  complex  objects  (the  main  circulating 
pump,  capacitor  plant,  ejector  plant)  of  a  nuclear 
power  block,  implemented  on  the  basis  of  the 
tools  G2+GDA. 

1.  Introduction 

Real  time  decision  support  systems 
(RTDSS)  are  hardware-software  complexes, 
intended  for  the  help  to  the  decision  making 
persons  (DMP)  at  management  of  complex  objects 
and  processes  of  a  various  nature  in  conditions  of 
rigid  temporary  restrictions.  When  searching  the 
decisions,  expert  models,  constructed  on  the  basis 
of  expert  knowledge,  and  heuristic  methods  of 
decision  search,  are  used.  According  to  a  modern 
classification  of  software,  RTDSS  are  a  class  of 
integrated  intelligent  (expert)  systems  of  a 
semiotic  type,  combining  strict  mathematical 
decision  search  methods  with  nonstrict,  heuristic 
methods,  based  on  expert  knowledge  [I-3J. 

The  necessity  of  creating  RTDSS  is 
caused  by  continuously  growing  complexity  of 
controlled  objects  and  processes  with  simultaneous 
time  reduction,  yielded  by  DMP  on  problem 
situation  analysis  and  acceptance  of  necessary 
managing  actions. 


Conceptually  joining  the  approaches  and 
methods  of  the  decision  support  theory,  theory  of 
information  systems,  artificial  intelligence  and 
using  the  objective  and  subjective  information. 
RTDSS  provides  to  DMP  with  the  analysis  of  a 
soluble  problem  and  directs  him  during  decision 
search  for  increasing  of  a  decision  efficiency. 

One  of  the  basic  problems  at  designing 
RTDSS  is  a  choice  of  the  suitable  formal 
apparatus  for  a  description  of  the  decision  support 
process  and  a  construction  on  its  base  of  an 
adequate  (correct)  decision  making  model 

(DMM).  As  such  apparatus  the  production  systems 
are  usually  used.  However,  available  design  tools 
of  expert  systems  are  oriented  towards  static 
problem  domains,  i.e.  on  situations,  not  requiring 
the  corrections  of  DMM  and  decision  support 
strategies  during  decision  search  [2], 

2.  Specifity  of  RTDSS 

Peculiarities  of  problems,  solved  by 

RTDSS,  are: 

•  the  necessity  of  the  temporary  factor  account 

at  the  problem  situation  description  and 

during  decision  search; 

•  the  necessity  of  decision  making  in  conditions 
of  temporaiy  restrictions,  determined  by  a  real 
controlled  process; 

•  impossibility  of  obtaining  all  objective 

information,  necessary  for  the  decision,  and. 
iri  this  connection,  use  of  subjective,  expert 
information; 

•  complexity  of  a  search,  the  necessity  of  an 
active  participation  of  DMP; 

•  presence  of  nondeterminism,  the  necessity  of 
a  correction  and  an  introduction  of  additional 
information  during  decision  search. 

A  basic  purpose  of  RTDSS  is  to  help  to 
DMP  at  the  control  of  complex  objects  and 
processes,  revealing  and  preventioning  of  dangers, 
development  of  the  recommendations,  i.e.  to  help 
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in  the  support  of  an  abnormal  regime  before  it  will 
become  irreversible.  Developed  RTDSS  for 
operative-dispatching  monitoring  and 
management  of  a  nuclear  station  power  block  and, 
in  particular,  its  subsystems  for  monitoring  and 
management  of  the  main  circulating  pump  (MCP). 
capacitor  (CAP)  and  ejector  plant  (EJ),  concern 
just  such  class  [4]. 

The  choice  of  the  tools  G2  for 
implementing  RTDSS  is  caused  by  integration  in 
it  of  basic  high-effective  technologies  of  complex 
program  product  development:  object-oriented 
programming;  open  system  technology  and  client- 
server  one;  the  active  object  graphics;  structured 
natural  language  and  hypertext  for  information 
representation;  decision  search,  based  on 
production  rules,  procedures,  dynamic  (imitative) 
models;  parallel  fulfillment  in  real  time  of 
independent  processes;  the  friendly  interface  with 
various  types  of  the  users  (DMP,  system  manager, 
expert,  knowledge  engineer,  programmer);  a 
combination  of  technology  of  intelligent  (expert) 
systems,  based  on  knowledge,  with  technology  of 
traditional  programming. 

The  structure  of  base  tools  G2+GDA, 
necessary  for  RTDSS  designing,  consists  of  the 
interactive  editor,  tools  of  the  graphic  interface 
with  the  user,  object-oriented  graphics,  graphic 
real  time  monitoring  windows  and  animation, 
tools  for  display  of  connections  between  objects, 
interaction  with  external  environment,  imitative 
modeling  and  processing  of  complex  rules  and 
procedures,  tools  for  messages  and  explanations. 

3.  The  Architecture  of  RTDSS 

Such  objects  as  MCP,  CAP  and  EJ  of  a 
nuclear  station  power  block  are  not  made  serially. 
Each  object  is  unique  and,  hence,  RTDSS  for 
object  management  is  also  unique.  But  at 
designing  RTDSS  for  various  objects  it  is  possible 
to  use  the  same  hardware  platform  and  tools. 
Moreover,  within  the  framework  of  G2  class  tools  - 
it  is  possible  to  design  a  tool  environment  of  the 
same  type  of  RTDSS.  Such  tools  should  give 
limited,  but  rather  complete  set  of  primitives  for 
knowledge  representation  about  an  allocated  class 
of  objects  and  processes  and  about  methods  of 
management  by  them.  Naturally,  the  tool 
environment,  oriented  towards  dynamic  RTDSS, 
should  be  open  for  updating  by  new  constructive 
elements. 

The  generalized  architecture  RTDSS  is 
given  in  fig.  1.  In  contrast  with  traditional  expert 
systems  in  RTDSS  it  is  necessary  to  include 
additional  modeling  blocks  and  forecasting  blocks 
for  analysis  and  estimation  of  accepted  decision 


consequences  and  choice  of  the  best 
recommendations.  These  blocks  are  implemented 
on  the  basis  of  the  imitative  modeling  system  of 
G2. 

We  shall  briefly  view  the  basic  concepts 
of  RTDSS  base  module  realization  at  the  example 
of  the  prototype  for  monitoring  and  management 
of  MCP  of  a  power  block  [4]. 

Data  base  (DB)  is  a  storehouse  of  object 
management  information,  acting  with  gauges  from 
a  subsystem  of  modeling  and  from  DMP.  The  data 
in  DB  represent  a  set  of  continuous  and  discrete 
parameters  of  MCP  (CAP,  EJ).  For  database 
representation,  the  class  hierarchy  is  realized.  The 
class  definition  is  given  by  means  of  the  special 
tables,  including  such  attributes  as:  a  name  of  a 
class  (Class  name),  a  name  of  a  parent  class 
(Superior  class),  description  of  class  specific 
attributes  (Class  specific  attributes). 

A  base  class  of  MCP  parameters  is  a  class 
of  gcn-parameter  -  a  successor  of  a  class  "object”. 
Each  MCP  parameter  has  a  developed  name 
represented  by  DMP,  what  is  reflected  by  attribute 
"fullname".  For  displaying  continuous  and  discrete 
parameters  two  subclasses  of  a  class  "gcn- 
parameter”  -  continuous-parameter  and  discrete- 
parameter  -  are  defined.  All  parameters  are 
displayed  as  icons.  The  color  of  an  icon  of 
continuous  parameters  is  changed  in  depending  on 
a  position  of  a  current  parameter  value.  In  a 
normal  condition  the  icon  color  is  green,  at  an 
abnormal  condition  -  red  color.  The  change  of 
color  occurs  according  to  production  rules  as 
whenever  at  obtaining  the  value  by  attribute 
"state",  for  example: 

whenever  the  state  of  any  continuous- 
parameter  CP  receives  a  value  and  when  not  (the 
state  of  CP  is  normal)  then  change  the  body  icon- 
color  of  CP  to  red . 

The  value  is  also  given  to  attribute  "state" 
on  a  rule  "whenever"  at  obtaining  of  a  current 
value  of  a  continuous  parameter.  For  example: 

whenever  the  cur-value  of  any 
continuous-parameter  CP  receives  a  value  and 
when  the  cur-value  of  CP  the  top-set  of  CP  then 
conclude  that  the  state  of  CP  is  upper. 

DMP  can  change  the  current  value  of  any 
parameter,  if  for  any  reasons,  a  source  of 
parameter  data  (gauge)  has  ceased  to  function. 

Knowledge  base  (KB)  contains  expert 
knowledge,  on  the  basis  of  which  analysis  of  object 
state  will  be  carried  out.  The  kind,  in  which 
knowledge  in  KB  is  presented,  is  determined  by  a 
knowledge  representation  model.  The  greatest 
spread  in  intelligent  systems  has  received  a 
production  model  with  rules  of  the  type: 

If  antecedent  then  consecventl  f, 
else  consecvent2  J. 
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Fig.  1.  The  Generalized  Architecture  of  RTDSS 


In  RTDSS  an  antecedent  represents  a 
logic  expression  concerning  parameters  of  an 
object  state.  A  consequent  describes  some  action, 
which  should  be  undertaken  in  case  of  the  validity 
(consecventl)  or  not  validity  (consecvent2)  of  an 
antecedent.  These  actions  can  consist  in 
distribution  of  a  command  to  the  information 
block,  in  a  change  of  information  about  object 
state,  in  activation  of  some  knowledge  from  KB. 
The  state  estimation  and  the  result  distribution  are 
a  total  of  such  action  sequence.  KB  of  RTDSS 
should  satisfy  to  a  number  of  the  requirements, 
among  which:  presence  of  knowledge,  allowing  to 
cany  out  the  analysis  state  of  an  object  at 
incomplete  information;  presence  for  DMP  of  an 
opportunity  of  updating  KB  in  a  working  mode, 
and.  hence,  presence  of  advanced  means  of  the 
checking  up  entered  data  on  correctness,  when  the 


faulty  changes  in  KB  at  functioning  an  object  are 
inadmissible. 

The  instruction  block  directs  actions  of 
DMP  in  planned  transitive  modes.  It  works 
automatically  (on  a  situation)  at  switching  on  the 
appropriate  mode.  The  information  on  a  mode  of 
functioning  of  object  acts  from  DB.  The  above 
requirements  cause  a  necessity  of  information 
representation  in  KB  in  the  most  convenient  for 
DMP  recognition  a  graphic  form.  The  example  of 
a  fragment  of  external  representation  of  KB  for 
DMP  as  the  decision  tree  with  the  necessary 
explanatory  is  presented  in  fig. 2,  where  the  poly¬ 
screen  for  DMP  is  given. 

A  solver  is  a  procedure  which  implement 
the  algorithms  (strategies)  of  application  of 
knowledge  from  KB  to  data  from  DB.  Solver  as 
well  as  KB  is  a  component  of  DMM. 
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Fig.  2.  The  example  of  poly-screen  for  DMP  (prototype  for  MCP) 


Under  designing  RTDSS  for  monitoring 
and  controlling  the  sophisticated  objects,  in 
particularly,  the  semiotic  systems  of  operative¬ 
dispatching  control  by  nuclear  stations,  traditional 
methods  of  information  search  are  impossible  in 
principle  due  to  huge  amount  of  interrelated  and 
sometimes  inconsistent  facts  and  laws  describing 
these  objects.  Therefore  for  manipulating  and 
processing  knowledge,  representing  by  a  collection 
of  production  rules,  it  is  necessary  to  have  an 
inference  engine  which  implements  decision¬ 
making  procedures  and  forms  a  control 
recommendation  list  for  such  systems. 

We  have  focused  on  parallel  inference 
methods  and  procedures  because  only  parallelism 
of  inference  procedures  allows  to  increase  the 
speed  of  inference  processes  and  to  treat  a  huge 
amount  of  productions.  As  the  inference  method 
we  have  chosen  the  parallel  inference  method  on  a 
special  type  semantic  networks,  possessing  several 
types  of  parallelism  (or-parallelism,  and- 
parallelism,  dcdp-parallelism). 

As  semiotic  systems  are  open  ones,  they 
must  take  into  account  semantic  and  pragmatic 
aspects  of  knowledge  processing.  Moreover  they 
assume  the  support  of  commonsense  reasoning, 
the  methods  of  plausible  inference  based  on  the 
apparatus  of  non-traditional  logics  (nonmonotonic. 


modal,  fuzzy  and  abductive  ones).  To  operate  with 
exceptions  of  knowledge  base,  it  is  suggested  to 
apply  default  logic  by  R.  Reiter  [5], 

The  forecasting  block  carries  out 
functions  of  forecasting  of  abnormal  situations  and 
consequences  of  managing  actions.  The 
forecasting  is  made  on  a  DMP  command  on  the 
basis  of  data  on  a  current  state  of  controlled  object, 
acting  from  DB,  and  on  the  basis  of  knowledge, 
stored  in  KB.  The  managing  action  is  either  a  set 
by  DMP,  or  the  recommendation  given  by  G2 
solver. 

The  information  mapping  block  carries 
out  functions  of  information  representation  to 
DMP.  Initial  data  for  it  are  data  from  DB,  results 
of  an  estimation  of  a  object  state,  received  by  a 
solver,  results  of  the  forecasts,  made  by  the 
forecasting  block,  and  instructions,  given  out  by 
the  instruction  block. 

The  information,  on  the  one  hand,  should 
be  displayed  in  the  form  convenient  for  a  fast 
recognition  by  DMP  and,  on  the  other  hand, 
should  be  as  it  possible  more  complete.  These 
requirements  contradict  each  other,  as  the 
increasing  of  volume  of  the  sign  information 
decreases  an  ability  of  the  person  to  perceive  it. 
The  problem  is  decided  by  means  of  the  multilevel 
circuit  of  information  mapping  with  application  of 
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a  hypertext  technology  and  cognitive  graphics. 
The  information  mapping  unit  for  a  prototype  is 
the  display  of  a  workstation.  The  motionless  image 
on  the  screen  corresponds  to  a  static  object  state, 
and  the  movement  displays  transition  of  an  object 
in  a  new  condition. 

For  mapping  the  information  a  number  of 
working  spaces  is  entered  (fig.2): 

1.  Working  space  with  the  scheme  of 
MCP  (CAP.  EJ).  its  auxiliary  systems  and  gauges. 
MCP  is  submitted  by  an  icon  of  a  class  gen.  The 
given  class  has  a  few  ports  for  connection  with 
elements  of  the  pump  auxiliary  subsystems:  an 
independent  contour,  circulation  of  oil  in  bearings, 
locking  water,  cooling  liquids.  The  gauges  are 
represented  on  groups  by  graphic  images  of 
parameters  fixed  by  them  MCP.  The  working 
space  of  parameter  becomes  visible  at  pressing  of  a 
mouse  key  on  an  icon  of  parameter.  Dynamics  of 
processes,  occurring  in  MCP.  is  displayed  by 
change  of  graphic  images  color  of  parameters. 
Looking  on  the  scheme,  DMP  can  qualitatively 
estimate  a  state  of  MCP  and  define,  which 
parameters  are  outside  of  a  range  of  allowable 
values  (these  parameters  are  allocated  by  red 
color).  For  more  detailed  acquainting  with  a  state 
of  MCP.  the  operator  has  access  to  the  parameter 
workspaces. 

2.  Working  space  of  the  urgent  messages. 
At  a  normal  condition  in  the  working  space  of  the 
urgent  messages  there  is  only  one  message 
"System  MCP  is  in  norm".  This  message  has  a 
green  background  and  does  not  signal  about  any 
anomaly.  At  occurrence  of  abnormal  situations  in 
working  space  there  are  the  appropriate  messages 
on  a  red  background. 

3.  Working  space  of  KB.  In  it  the 
decision  tree  of  DMM  is  located.  In  basic  this 
space  is  intended  for  the  expert  and  knowledge 
engineer  for  creating  and  testing  KB.  In  a  decision 
making  mode  the  means  for  a  choice, 
concealment,  moving  and  change  of  the  size  of  the 
specified  working  spaces  are  given  to  DMP.  With 
the  help  of  these  means  he  can  design  project 
interface  with  the  application.  In  addition  to  DB 
the  messages  base  is  realized  which  contains  all 
diagnostic  messages,  that  can  be  given  to  the 
operator  in  a  decision  making  mode.  The  message 
base  places  in  separate  working  space  and  consists 
of  copies  of  a  class  "gen-message",  being  a 
subclasses  of  a  built-in  class  G2  message  with 
attribute  “text”.  The  class  "gen-message"  is 
complemented  by  integer  attribute  "message- 
number"  and  logic  attribute  "message  actuality”. 

Modeling  block  simulates  behavior  of  an 
object.  It  can  act  as  the  agent  of  data  about  an 
object  state  at  a  stage  of  system  testing  and  at  a 
decision  making  stage  for  comparison  with  data, 
acting  from  gauges.  The  modeling  block  can  also 


be  used  together  with  the  forecasting  block  for  the 
forecast  of  abnormal  situations  and  consequences 
of  managing  actions. 

4.  Conclusion 

During  implementation  of  the  RTDSS 
prototype  for  monitoring  and  management  of  a 
nuclear  station  power  block  [4],  31  classes  of 
objects.  12  subclasses  of  variables,  2  subclasses  of 
connections  between  objects,  7  relations  between 
objects  were  defined;  38  generalized  rules,  45 
procedures  and  4  functions  were  written.  DB  about 
MCP  includes  33  parameters,  about  CAP  -  21 
parameters,  about  EJ  -  32  parameters.  For  each 
continuous  parameter,  a  subworkspace  and  objects 
for  graphic  display  of  dynamics  of  changing  its 
values  are  defined.  The  internal  representation  of 
the  prototypes  occupies  860K. 

This  work  was  supported  by  the  Russian 
Fund  of  Fundamental  Researches  (project  99-01- 
00049). 
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Abstract 

This  paper  announces  the  world’s  first  binocular 
stereovision  system  for  robots  that  solves  the  problem  of 
the  virtual  image  which  occurs  when  the  viewed  object 
consists  of  repeated  patterns.  This  new  technique,  which 
was  invented  by  the  principal  author,  is  called  the 
rotation  disparity  mechanism.  The  technique  also 
addresses  occlusion,  a  problem  unique  to  stereovision 
systems,  making  it  possible  to  extract  occlusion  areas 
from  images.  The  new  binocular  vision  system 
comprises  four  CCD  cameras,  two  for  each  eye. 

1.  Virtual  Image  Problem  and  Rotation  Disparity 
Mechanism 

The  virtual  image  problem  has  long  been  considered  a 
critical  defect  of  usual  binocular  stereovision  systems  and 
has  posed  difficulties  for  researchers  trying  to  create 
highly  functional  artificial  vision  systems.  Some 
researchers  have  noted  that  virtual  images  seldom  occur 
in  typical  working  environments  while  others  have 
declared  that,  since  even  human  vision  cannot  overcome 
the  virtual  image  problem,  it  is  an  acceptable  defect  of 
robot  vision.  The  former  statement  simply  admits  the 
total  absence  of  a  safety  mechanism  should  a  virtual 
image  occur.  The  latter  statement  is  utterly  unacceptable 
considering  the  fact  that  humans  are  in  fact  not  annoyed 
by  virtual  images. 

Two  methods  are  currently  available  to  solve  the 
virtual  image  problem. 

One  is  to  discard  the  idea  of  usual  binocular 
stereovision  and  instead  develop  a  stereovision  system 
with  three  or  more  eyes.  The  other  is  to  improve  on  the 
concept  of  a  binocular  system,  and  attempt  to  enable  a 
binocular  vision  robot  to  detect  by  its  own  movement 
any  virtual  images  that  may  occur. 

With  the  former  method,  researchers  typically  study 
techniques  to  discriminate  virtual  and  real  images  with 
the  third  vision  component.  Since  the  positional 
information  on  the  detected  object  is  used  to  distinguish 
between  virtual  and  real  images,  such  a  system  does  not 
guarantee,  in  principle,  that  all  virtual 


images  will  be  eliminated.  With  the  latter  method, 
safe  robot  motion  cannot  be  guaranteed  unless  virtual 
images  are  detected  before  the  robot  starts  to  move. 

The  principal  author  therefore  concluded  that  when 
using  a  binocular  system  similar  to  human  vision,  a 
technique  to  detect  virtual  images  would  be  required  to 
be  built  in  to  the  binocular  stereovision  system  itself  and 
called  this  technique  the  rotation  disparity  mechanism. 

The  rotation  disparity  mechanism  is  essentially  based 
on  the  high-speed  movement  of  at  least  one  of  the  two 
eyes  of  the  system,  rather  than  increasing  the  number  of 
eyes  from  two  to  three  or  more  (a  multi-eye  system).  The 
motion  of  this  one  eye  will  bring  about  the  motion  of 
virtual  images  and  by  detecting  this  motion,  virtual 
images  are  discriminated.  Importantly,  real  images  do 
not  move  when  a  single  eye  is  moved  at  high  speed 
(Figure  1).  Any  form  of  high-speed  motion  generates 
similar  effects.  The  authors  selected  rotational  movement 
because  rotation  can  be  controlled  more  easily  and  both 
horizontal  and  vertical  parallax  information  can  be 
evaluated  simultaneously. 

The  authors'  technique  is  different  from  multi-eye 
vision,  a  conventional  research  theme,  because  the 
motion  of  at  least  one  eye  is  controlled.  The  new 
technique  further  does  not  physically  require  a  number  of 
imaging  cameras.  It  is  also  different  from  vision  research 
that  uses  saccadic  movement  because  the  vibration  of 
saccadic  movement  is  fixed  in  one  particular  direction, 
and  for  this  reason  it  is  not  actively  used  to  eliminate 
virtual  images. 

2.  Robot  Vision  System  Featuring  the  Proposed 
Rotation  Disparity  Mechanism 

The  robot  vision  system  proposed  by  the  authors 
comprises  four  CCD  cameras,  two  DC  motors,  two 
rotary  encoders,  and  six  DC  servo  motors. 

Two  CCD  cameras  are  paired  to  create  two  eyes  in  the 
system.  One  camera  in  each  pair,  with  a  zoom  function 
and  a  narrow  focal  field,  is  used  to  recognize  objects.  The 
other  camera  in  each  pair,  with  a  wide-angle  lens, 
captures  3 -dimensional  space  in  the  line  of  sight  using 
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the  rotation  disparity  mechanism.  The  first  camera  thus 
has  a  function  equivalent  to  the  central  vision  of  human 
eyesight,  and  the  latter  is  equivalent  to  peripheral  vision 
(Figure  2).  This  vision  system  has  a  three-tier  structure. 
The  bottom  tier  accommodates  one  DC  motor  and  one 
rotary  encoder  which  are  used  to  move  the  cameras 
installed  horizontally  on  the  top  tier.  The  middle  tier 
holds  one  DC  servo  motor  to  precisely  move  the  cameras 
in  synch  with  the  DC  motors  on  the  bottom  tier.  The 
top  tier  also  holds  the  pan  and  tilt  mechanism  for  the 
cameras  and  measures  rotation  disparity  with  the 
peripheral  vision  camera.  Specifically,  the  peripheral 
vision  camera  is  rotated  about  the  center  axis  of  the 
central  vision  camera  while  the  direction  of  the  central 
line  of  sight  is  kept  synchronized  with  that  of  the  central 
vision  camera.  The  peripheral  vision  camera  is  initially 
positioned  directly  above  the  central  vision  camera.  The 
direction  of  rotation  is  such  that  the  base  distance 
between  the  right  and  left  cameras  increases,  or  the 
peripheral  vision  cameras  rotate  in  opposite  directions. 
The  cameras  start  rotating  when  a  virtual  image  is 
detected  in  an  image. 


Fig.l  Principle  of  virtual  image  cancellation 
by  the  rotation  disparity  mechanism 


R 1  ,R2,R3:  Real  Objects 

VI  ,v2,v3:  Virtual  Images 

C1,C2:  Peripheral  Cameras  of  New  Stereovision 


Fig.  2  A  Prototype  of  New  Binocular  Stereovision 

3  Virtual  Image  Cancellation  Techniques 

This  section  defines  the  meaning  of  virtual  images  and 
describes  a  technique  to  cancel  virtual  images  using  our 
new  mechanism. 

3,1  Occurrence  of  Virtual  Images 

A  virtual  image  occurs  when  one  attempts  to  calculate 
the  distance  in  a  3-dimensional  space  using  two  or  more 
image  data  captured  from  different  positions. 

A  distance  calculation  technique  is  comprised  of 
finding  a  certain  same  image  which  is  only  partially 
included  in  two  or  more  image  data  and  performing  the 
necessary  calculations  based  on  the  position  of  the 
detected  image  and  the  position  of  the  imaging 
equipment  used  (Figure  3). 

Virtual  images  can  occur  for  the  following  reasons: 

1 .  Noise  in  image  information 

2.  Ambiguous  interpretation  caused  by  quantization 
errors 

3.  Repetitive  information  on  an  image 

Virtual  images  can  occur  due  to  1),  however,  noise 
information  can  never  be  permanent.  Noise  can  be 
eliminated  within  a  time  period  that  is  small  enough  not 
to  obstruct  real-time  observation. 

As  for  2),  two  or  more  image  data,  which  are  actually 
different  from  each  other,  can  be  interpreted  as  the  same 
information  due  to  quantization  errors  during  image 
capture.  Such  errors  occur  due  to  the  quantization 
characteristics  of  the  imaging  equipment  and  variables  of 
the  external  environment.  Thus,  like  1),  the  ambiguous 
information  cannot  be  permanent  and  can  therefore  be 
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(b) 


unreal  image  below  the  corridor 
real  image  on  the  corridor 
unreal  image  above  the  corridor 

Fig.  3  Binocular  stereo  vision  and  the  virtual  images 


119 


K  * 


r  \  \  »  * 

\  \  \  \  ' 
\  \  \  \  ; 
\  Q  6  4 

N  \  \  \  | 

N  n  \  V  ; 


nq  'o  t>  b  Q  V 

n  \  '  \  ; 

^  q  b  6  i 

Qbb'  1 


JUU  '  » 

T?A  \  ■ 

\\\  \ ; 
\W  \> 


\s  \'  1 
'Vi 
44 


Fig4.  Candidates  of  the  Virtual  Image 

eliminated.  Quantization  errors  can  also  be  reduced  by 
combining  data  when  searching  for  the  same  information. 

As  for  3),  virtual  images  cannot  be  avoided  if  the 
image  information  is  repetitive. 

Thus,  points  1  and  2  relate  to  noise  rather  than  the 
occurrence  of  virtual  images,  and  point  3  is  the  true 
virtual  image  issue. 

3.2  Techniques  for  Canceling  Virtual  Images 

This  section  describes  three  techniques  for  canceling 
virtual  images  using  the  binocular  stereovision  system 
incorporating  the  rotational  disparity  mechanism 
proposed  by  the  authors.  The  first  technique  is  used  for 
observing  a  static  3-dimensional  space.  Static  space  here 
means  that  the  object  space  does  not  change 
substantially.  The  second  technique  is  used  for  observing 
a  changing  space.  The  last  technique  involves  the  use  of 
a  neutral  network. 


Fig.  5  Principle  of  virtual  image 
cancel  lationby  the  rotation 
disparity  mechanism 
C1,C2:  Peripheral  Cameras 
CL,CR:  Central  Cameras 


The  basic  concept  for  identifying  virtual  images  was 
described  in  section  1  using  an  example  of  a 

2- dimensional  plane.  Specifically,  this  technique  detects 
positional  information  in  the  3-dimensional  space  using 
a  vision  system,  then  detects  positional  changes  in  the 

3- dimensional  space  using  the  rotation  disparity 
mechanism. 

Candidate  virtual  images  can  easily  be  identified  when 
two  or  more  objects  are  in  the  same  line  of  sight  (Figure 
4).  In  Figure  4,  three  objects  are  present  on  line  of  sight 
L.  It  is  unknown  which  is  the  real  image.  It  is  obvious 
that  at  least  one  real  image  exists. 

3.2.2  Observing  Dynamic  Space 


3.2.1  Observing  Static  Space 

A  moving  vision  system  may  be  classified  into  this 
category  provided  the  movement  information  is  clearly 
known.  Static  space  observation  includes  cases  where  the 
motion  of  the  object  is  very  slow  compared  with  the 
motion  of  the  camera  by  the  rotation  disparity 
mechanism. 


The  dynamic  space  problem  can  be  reduced  to  the 
3.2.1  problem  if  changes  in  the  object  space  are 
detectable.  Such  changes  can  easily  be  detected  using  the 
central  vision  camera  of  the  new  vision  system  proposed 
by  the  authors  in  combination  with  a  technique  for  detect 
floating  points  or  by  other  techniques.  When  the  object 
space  changes,  information  derived  from  both  central  and 
peripheral  vision  cameras  is  integrated  to  identify  virtual 
images  (Figure  5).  This,  however,  applies  only  when  the 
object  is  moving  at  a  relatively  high  speed.  One  may 
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Fig.6  General  Desighn  of  the  Stereovision 


thus  believe  that  it  is  more  natural  for  the  vision 
system  to  track  the  object  with  the  central  vision  camera 
rather  than  attempting  to  capture  solid  information.  In 
this  figure,  Cl  and  C2,  and  CL  and  CR  are  the 
peripheral  and  central  vision  cameras,  respectively.  Cl' 
and  C2?,  representing  the  rotation  disparity  mechanism, 
show  the  result  of  the  individual  motion  of  Cl  and  C2 
cameras.  In  this  example,  the  motion  of  the  rotation 
disparity  mechanism  from  Cl  to  Cl',  or  from  C2  to  C2? 
is  slow  compared  with  the  motion  of  the  object  from  R1 
to  RF,  so  it  is  necessary  to  measure  the  motion  of  the 
object  using  the  CL  and  CR  central  vision  cameras. 

3.2.3  Using  a  Neutral  Network 

This  technique  is  described  only  for  an  object  which  is 
moving  veiy  slowly  compared  with  the  motion  of  the 
camera  by  the  rotation  disparity  mechanism,  or  when  the 
object  space  is  static. 

The  technique  devised  by  the  author  comprises  the 
following  steps: 

(a)  Detect  solid  information  using  binocular  parallax 

(b)  Find  virtual  images  using  the  rotation  disparity 


mechanism 

(c)  Delete  the  virtual  image  portions  of  the  solid 
information  of  step  (a)  using  the  information  derived  in 
step  (b) 

The  technique  adopted  here  is  similar  to  the  technique 
described  in  3.2.1  except  that  the  former  employs  a 
sequential  calculation  technique  while  this  technique  is  a 
kind  of  conditioned  reflex  technique  using  a  parallel 
processing  neutral  network.  In  step  (a),  solid  information 
is  calculated  using  image  information  derived  from  the 
right  and  left  eyes.  The  basic  portion  of  this  technique 
resembles  the  famous  research  by  Prof.  Paul  M. 
Churchland  at  the  University  of  California,  San  Diego. 
However,  the  authors'  research  is  different  in  that  the 
solid  information  has  the  same  resolution  as  the  number 
of  picture  elements.  Furthermore,  different  objects  in  the 
same  line  of  sight  can  be  managed  separately.  In  other 
words,  our  system  can  measure,  so  far  as  the 
functionality  is  concerned,  the  solid  information  for  all 
objects  in  the  nearest  measurable  position  to  those 
located  at  the  point  of  infinity.  In  reality,  however, 
distant  solid  information  contains  quantization  errors.  In 
step  (b),  the  peripheral  vision  cameras  are  turned  at  high 
speed  using  the  rotation  disparity  mechanism  to  identify 
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Direction  of  Sight 


Space  Celles 

Fig.7  Virtual  and  Real  Images  on  the  Spece  Celles 


any  virtual  images.  The  virtual  images  are  then 
canceled  at  step  (c).  To  be  specific,  the  processing 
system  comprises  a  right  and  a  left  retina,  pattern  fusion 
cell  and  space  cell  (neutral  network).  The  pattern  fusion 
cell  Pi  determines  the  coincidence  of  an  image  projected 
on  the  two  retinas  with  the  horizontal  parallax  value  of 
"i."  The  result  is  transmitted  to  the  space  cell  Si.  Si 
records  the  position  of  the  object  located  at  a  distance 
corresponding  to  the  parallax  value  "i"  in  the  relevant 
vision  system.  For  i  =0,  the  Si  information  suggests 
that  the  object  is  located  at  the  point  at  infinity.  All 
information  from  SO  to  Sn  can  be  calculated.  We  can 
calculate  the  direction  of  the  line  of  sight  and  distance  of 
all  objects  visible  between  the  point  at  infinity  in  the 
environment  viewed  by  the  vision  system  and  the 
distance  equivalent  to  the  parallax  value  "n."  If  in  this 
information,  two  or  more  objects  are  observed  in  the 
same  line  of  sight,  we  then  have  to  determine  if  they  are 
real  or  virtual  images  (Figure  7).  In  Figure  7,  objects  R 
and  V  are  observed  on  the  space  cells  Sk-1  and  Sk+1, 
respectively.  The  rotation  disparity  mechanism  is  now 
identifying  whether  they  are  real  or  virtual.  If,  for 
example,  the  object  V  on  Sk+1  is  virtual,  and  R  real,  the 
object  V  is  moving  in  the  Sk  side  while  R  is  static. 
Suppression  signals  are  sent  to  object  V  on  Sk+1  of  the 
space  cell,  which  deletes  object  V  in  step  (c). 

4  Future  Development  Prospects 

The  basic  concept  described  in  this  paper  was 
announced  at  the  annual  academic  lecture  meeting  of  the 
RoboticsSociety  od  Japan  in  September  1998.  [1]  [2]  At 
this  time,  however,  the  authors  have  not  yet  completed 
this  study.  We  have  so  far  manufactured  a  prototype  of 
the  hardware  section  of  the  binocular  stereovision  system 
with  a  rotation  disparity  mechanism.  The  relevant 
software  is  still  under  development.  The  neutral  network 


is  in  the  concept  stage,  and  we  are  currently 
manufacturing  a  prototype  for  the  basic  portion  of  the 
network. 

Afterword 

This  paper  reported  on  a  new-type  binocular 
stereovision  system  to  solve  the  problem  of  virtual 
images.  It  introduced  the  principle  of  the  rotation 
disparity  mechanism,  the  theoretical  principle  of 
detection  and  elimination  of  virtual  images,  and  a 
technique  to  detect  occlusion.  An  experimental  prototype 
system  was  introduced  to  demonstrate  these  principles. 
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Abstract 

This  paper  presents  a  new  approach  to  visual  servoing 
with  the  stereo  vision.  In  order  to  control  the  position  and 
orientation  of  a  robot  with  respect  to  an  object ,  a  new 
technique  is  proposed  using  a  binocular  stereo  vision. 
The  stereo  vision  enables  us  to  calculate  an  exact  image 
Jacobian  not  only  at  around  a  desired  location  but  also  at 
the  other  locations.  The  suggested  technique  can  guide  a 
robot  manipulator  to  the  desired  location  without  giving 
such  priori  knowledge  as  the  relative  distance  to  the 
desired  location  or  the  model  of  an  object  even  if  the 
initial  positioning  error  is  large.  This  paper  describes  a 
model  of  stereo  vision  and  how  to  generate  feedback 
commands.  The  performance  of  the  proposed  visual 
servoing  system  is  illustrated  by  the  simulation  and 
experimental  results  and  compared  with  the  case  of 
conventional  method  for  a  SCARA  robot. 

The  abstract  is  to  be  in  fully-justified  italicized  text,  at  the 
top  of  the  left-hand  column  as  it  is  here,  below  the  author 
information.  Use  the  word  “ Abstract ”  as  the  title,  in  12- 
point  Times,  boldface  type,  centered  relative  to  the 
column,  initially  capitalized.  The  abstract  is  to  be  in  10- 
point,  single-spaced  type,  and  may  be  up  to  3  in.  (7.62 
cm)  long.  Leave  two  blank  lines  after  the  abstract,  then 
begin  the  main  text.  All  manuscripts  must  be  in  English. 


1.  Introduction 

There  are  mainly  two  ways  to  put  the  visual  feedback 
into  practice.  One  is  called  look-and-move  and  the  other 
is  visual  servoing.  The  former  is  the  method  which 
transforms  the  position  and  orientation  of  an  object 


obtained  by  a  visual  sensor  into  those  in  the  world  frame 
fixed  to  an  environment  and  guides  the  arm  of  the 
manipulator  to  a  desired  location  in  the  world  frame  [1,2]. 
In  this  method,  precise  calibration  of  a  manipulator  and 
camera  system  is  needed.  On  the  contrary,  visual  servoing 
uses  the  Jacobian  matrix  which  relates  the  displacement  of 
an  image  feature  to  the  displacement  of  a  camera  motion 
and  performs  a  closed-loop  control  regarding  the  feature 
as  a  scale  of  the  state.  Therefore,  we  can  construct  a  servo 
system  based  only  on  the  image  and  can  have  a  robust 
control  against  the  calibration  error  because  there  is  no 
need  to  calculate  the  corresponding  location  in  the  world 
frame  [2,3,4].  A  hand  eye  system  is  often  used  in  visual 
feedback  and  there  are  two  ways  of  arranging  the  system. 
One  is  placing  a  camera  and  a  manipulator  separately;  the 
other  is  placing  the  camera  at  the  end-tip  of  the 
manipulator.  The  former  motion  strategy  of  the 
manipulator  becomes  more  complicated  than  the  latter.  In 
the  latter,  it  is  easy  to  control  the  manipulator  using  a 
visual  information  because  the  camera  is  mounted  on  the 
manipulator  end-tip.  In  this  paper,  we  deal  with  the  latter 
method.  In  the  conventional  works,  some  researches  have 
presented  methods  to  control  the  manipulator  position 
with  respect  to  the  object  or  to  track  the  feature  points  on 
an  object  using  a  hand  eye  system  as  the  application  of 
visual  servoing  [3,4].  These  methods  maintain  or 
accomplish  a  desired  relative  position  between  the  camera 
and  the  object  by  monitoring  feature  points  on  the  object 
from  the  camera  [5,6], 

However,  these  have  been  all  done  by  the  hand  eye 
system  with  monocular  visions  and  it  is  necessary  to 
compensate  for  the  loss  of  information  because  the 
original  three-dimensional  information  of  the  scene 
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is  reduced  to  two-dimension  information  on  the 
image.  For  instance,  we  must  add  an  information  of 
the  three-dimension  distance  between  the  feature  point 
and  the  camera  in  advance  or  use  a  model  of  object 
stored  in  the  memory.  Besides,  a  problem  that  the 
manipulator  position  fails  to  converge  to  a  desired 
value  arises  depending  on  the  way  of  selecting  feature 
points  or  when  the  initial  positioning  error  is  not 
small.  It  is  because  some  elements  of  the  image 
Jacobian  cannot  be  computed  with  only  the 
information  of  the  image  and  substituting  approximate 
values  at  the  desired  location  for  them  may  result  in 
large  errors  at  the  other  locations. 

This  paper  presents  a  method  to  solve  this  problem 
by  using  a  binocular  stereo  vision.  The  use  of  stereo 
vision  can  lead  to  an  exact  image  Jacobian  not  only 
at  around  a  desired  location  but  also  at  the  other 
locations.  The  suggested  technique  places  a  robot 
manipulator  to  the  desired  location  without  giving 
such  priori  knowledge  as  the  relative  distance  to  the 
desired  location  or  the  model  of  an  object  even  if  the 
initial  positioning  error  is  large.  This  paper  deals  with 
modeling  of  stereo  vision  and  how  to  generate 
feedback  commands.  The  performance  of  the  proposed 
visual  servoing  system  was  evaluated  by  the 
simulations  and  experiments  and  obtained  results  were 
compared  with  the  conventional  case  for  a  SCARA 
robot. 

2.  Stereo  Vision  Model 

We  define  the  frame  of  a  hand-eye  system  with  the 
stereo  vision  and  use  a  standard  model  of  the  stereo 
camera  whose  optical  axes  are  set  parallel  each  other 
and  perpendicular  to  the  baseline.  The  focal  points  of 
two  cameras  are  apart  at  distance  d  on  the  baseline 
and  the  origin  of  the  camera  frame  2C  is  located  at 

the  center  of  these  cameras.  Fig.  1  represents  the 
schematic  diagram  of  a  suggested  visual  servoing 
system.  In  Fig.  1  two  DSP  vision  boards  (MVB03) 
are  used,  which  were  made  Samsung  Electronics 
Company  in  Korea  based-on  the  TMS320C30  chips. 

An  image  plane  is  orthogonal  to  the  optical  axis  and 
apart  at  distance  /  from  the  focal  point  of  a  camera 
and  the  origins  of  frame  of  the  left  and  right  images 
,  2  /  and  En  are  located  at  the  intersecting  point 
of  the  two  optical  axes  and  the  image  planes.  The 


origin  of  the  world  frame  2  w  ls  located  at  a 

certain  point  in  the  world.  The  x,  y ,  and  ^  axes  of 
the  coordinate  frames  are  shown  in  Fig.  2. 

Now  let  lp=(  lxy  ly)  and  rp=(rxt  ry)  be  the 
projections  onto  the  left  and  right  images  of  a  point 
p  in  the  environment,  which  is  expressed  as 
cp=(cxcycz)T  in  the  camera  frame.  Then  the 
following  equation  is  obtained  (see  Fig.  2). 


'x  cz  =  f(cx  +  0.5flO  (1-a) 

rx  CZ  =  f(cx-  0.5 d)  (1-b) 

ly  cz  =  fc  y  (1-c) 

ry  c2  =  fc  y  (l-d) 


Suppose  that  the  stereo  correspondence  of  feature 
points  between  the  left  and  right  images  are  found. 
In  the  visual  servoing,  we  need  to  know  the  precise 
relation  between  the  moving  velocity  of  camera  and 
the  velocity  of  feature  points  in  the  image,  because 
we  generate  a  feedback  command  of  the  manipulator 
based  on  the  velocity  of  feature  points  in  the  image. 


Fig.  1  Schematic  diagram  of  visual  servoing  system. 


Fig.  2  The  coordinates  system  of  stereo  vision  model. 
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This  relation  can  be  expressed  in  a  matrix  form 
which  is  called  the  image  Jacobian.  Let  us  consider 
n  feature  points  pk{k~\,  •••,  n)  on  the  object  and 
the  coordinates  in  the  left  and  right  images  are 
'/>*(  xk,  !yk)  and  rpk{  rxk,  ryk),  respectively.  Also 
define  the  current  location  of  the  feature  points  in  the 
image  lp  as 

‘p  =  Cx,  rXl  'y,  'y,  %  -  'yn  ryn)T  (2) 


CP  =  cRk{~  Wwcx  (  *p-  ")>,)}-  ‘Ru*pc 

=  -  cwcx  cp-epc  (6) 

—  wy  cz+wxcy—  vz' 

=  —W2CX+WxCZ—Vy 
—  wxcy+ wycx— vz 

Therefore,  substituting  Eq.  (6)  into  Eq.  (3),  we 
have  the  following  equation. 


where  each  element  is  expressed  with  respect  to  the 
virtual  image  frame 

First,  to  make  it  simple,  let  us  consider  a  case 
when  the  number  of  the  feature  points  is  one.  The 
relation  between  the  velocity  of  feature  point  in 
image  rp  and  the  velocity  of  camera  frame  cp  is 
given  as 

!P  =  7,  cp  (3) 

where  PJC  is  the  Jacobian  matrix  which  relates  the 
two  frames.  Now  let  the  translational  velocity 
components  of  camera  be  a  r,  a  y,  and  a  z  and 
the  rotational  velocity  components  be  wx,  toy,  wz 
then  we  can  express  the  camera  velocity  V  as 

V  —  [ ox  oy  az  wx  Wy  wz  ]T 

(4) 

=  [CVCCWC]T 

Then  the  velocity  of  the  feature  point  seen  from  the 
camera  frame  cp  can  be  written 


n  _  It  cl 

P  ~  Jc  P 
=  JV 


(7) 


In  Eq.  (7)  matrix  J  which  expresses  the  relation 

between  velocity  !p  of  the  feature  point  in  the  image 

and  moving  velocity  V  of  the  camera  is  called  the 

image  jacobian. 

From  the  model  of  the  stereo  vision  Eq.  (1),  the 

following  equation  can  be  obtained. 


cx  (2  lx  —  rx)  —  dlx  + 

(8) 
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Above  discussion  is  based  on  the  case  of  one 
feature  point.  In  practical  situation,  however,  the 
visual  servoing  is  realized  by  using  plural  feature 
points.  When  we  use  n  feature  points,  image 
Jacobian  are  given  from  the  coordinates  of 

feature  points  in  the  image.  By  combining  them,  we 
express  the  image  Jacobian  as 


Jm=lJl . JnV  01) 


p  dt 

=  -%cRAu'P~KPc)  (5) 

=f7?,{-  Vx(  7-  WP<- )}+  cRu(  wi>-  wPc) 

where  CRW  is  the  rotation  matrix  from  the  camera  frame 
to  the  world  frame  and  wpc  is  the  location  of  the  origin 

of  the  camera  frame  written  in  the  world  frame.  As  the 
object  is  assumed  to  be  fixed  into  the  world  frame, 
wp=  0.  The  relation  between  cp  and  V  is 


Then,  it  is  possible  to  express  the  relation  of  the 
moving  velocity  of  the  camera  and  the  velocity  of  the 
feature  points  even  in  the  case  of  plural  feature 
points,  that  is, 

'P  =  Inn  V  (12) 

where  we  suppose  that  the  stereo  and  temporal 
correspondence  of  the  feature  points  are  found. 

In  the  case  of  the  monocular,  the  image  Jacobian  J 
has  the  following  form. 


125 


We  now  introduce  the  positional  vector  of  the 
feature  point  in  the  image  of  monocular  vision  using 
the  symbol  mP  =  (mx,  my) .  This  is  the  projection 
of  the  point  expressed  as  CP  =  (cx  cy  cz)T  in  the 
camera  frame  into  the  image  frame  of  the  monocular 
vision,  and  has  the  following  relation. 

mx  =  f  cx  V1  (14-a) 

mx  =  f  cy  V1  ( 1 4-b) 

Substituting  Eq.s  (14-a)  and  (14-b)  into  Eq.  (13) 
yields  another  expression  of  the  image  Jacobian  for 
the  monocular  vision. 


A  disparity  which  corresponds  to  the  depth  of  the 
feature  point,  is  included  in  J  in  the  case  of  the 
stereo  vision,  but  s-term  expressed  in  the  camera 
frame  cz  is  included  in  J  in  the  case  of  the 
monocular  vision. 

3.  Feedback  Command 

In  the  visual  servoing,  the  manipulator  is  controlled 
so  that  the  feature  points  in  the  image  reach  their 
respective  desired  locations. 

We  define  an  error  function  between  the  current 
location  of  the  feature  points  in  image  !p  and  the 
desired  location  !pd  as 

E=Q{Ip-‘pd)  (16) 

where  Q  is  a  matrix  which  stabilizes  the  system. 
Then  the  feedback  law  is  defined  as  following 
equation 


where  G  corresponds  to  a  feedback  gain. 

To  realize  the  visual  servoing,  we  must  choose  Q 
so  that  convergence  is  satisfied  with  the  error  system 
can  be  satisfied  with 


=  Q  'p  (IB) 

=  QJmV 

=  ~G  Q  J im  E 

We  use  pseudo-inverse  matrix  of  the  image 
Jacobian  J im  for  Q  to  make  Q  J im  positive  and 
not  to  make  an  input  extremely  large,  that  is, 

Q=rim=uT,m},myljT,m  (i9) 

Therefore,  the  feedback  command  is  given  as 

V=-Grfm  (  'p-  %)  (20) 

Fig.  3  shows  a  block  diagram  of  the  control  scheme 
described  by  Eq.  (20).  Note  that  the  feedback 
command  u  is  sent  to  the  robot  controller  and  both  the 
transformation  of  u  to  the  desired  velocity  of  each 
joint  angle  qd  and  its  velocity  servo  are  accomplished 
in  the  robot  controller  as  show  in  Fig.  3. 

Futhermore,  as  J im  is  a  4n*6  matrix  and  pseudo¬ 
inverse  matrix  /|m  is  a  6x4 n  matrix,  a  feedback 

command  Eq.  (20)  of  6  degrees  of  freedom  is 
obtained. 


V  —  —  G  E  (17)  Fig.  3  Block  diagram  of  visual  feedback  system. 
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4.  Simulation  and  Experiments 
4.1.  Simulation 

We  have  compared  the  visual  servoing  using  the 
monocular  vision  with  that  using  the  stereo  vision  by 
the  simulation.  In  the  simulation,  feature  points  of  an 
object  are  the  four  corners  of  a  square  whose  side 
dimension  is  300  mm.  In  the  same  condition,  we 
used  four  feature  points  even  in  the  stereo  vision. 
Parameters  used  the  focal  length,  f=16  mm,  baseline 
d=130  mm,  sampling  time  of  50  msec,  gain  A — 1 , 
desired  location  cPd  =  (100  100  500)  rmm  desired 
orientation  in  Euler  angle  (p,0,^)  =(0,0,0)  rad, 
initial  error  (-50  -50  -50)rmm  in  the  translation 
and  (<p  ,  <9  ,  <£)  =  (20,20,20)  rad  in  the  orientation, 
error  image.  We  select  the  four  corners  of  a  rectangle 
whose  size  is  200  x200mm  as  the  feature  points  and 
set  the  translational  error  as  (-150  -150  -450)mm  and 
the  other  values  are  the  same  as  before  (Fig.  4). 

In  Fig.  4,  we  can  see  that  the  result  diverges  in 
the  case  of  the  monocular  vision,  but  converges  in 
the  case  of  the  stereo  vision.  This  is  because  the 
image  Jacobian  is  fixed  at  the  desired  location  in 
case  of  the  monocular  vision.  Therefore,  a  correct 
feedback  command  can  not  be  generated  when  the 
initial  error  is  large.  On  the  other  hand,  the  image 
Jacobian  can  be  updated  at  every  in  the  case  of  the 
stereo  vision,  thus  it  is  possible  to  generate  a  correct 
feedback  command  which  assures  the  stability  visual 
servoing. 


x(pixel)  x(pixel) 

Left  Image  Right  Image 


Fig.  4  Trajectories  of  the  feature  points  on  the  images. 


4.2.  Experiments 

In  experiments,  we  used  a  four-axis  SCARA  Robot 
(SM5  Model)  made  in  Korea  with  a  stereo  camera 
attached  to  the  end  tip  of  the  arm.  The  feature 
points  are  three  circular  planes  of  20  mm  radius  on 
three  corners  of  a  equilateral  triangle,  one  side  87 
mm  and  are  placed  on  the  board.  Precise  calibration 
had  not  been  done  for  the  stereo  camera  attached  to 
the  end-tips. 

Two  stereo  images  were  taken  and  transformed  to 
the  binary  images  in  the  real  time  and  in  parallel  by 
two  image  input  devices  and  the  coordinate  of  the 
gravitational  center  of  each  feature  point  was 
calculated  in  parallel  by  two  transputers.  We  gave  the 
stereo  correspondence  of  the  feature  point  in  the  first 
sampling.  However,  the  stereo  and  temporal 
correspondence  of  the  feature  points  in  the  succeeding 
sampling  were  found  automatically  by  searching  a 
nearby  area  where  there  were  the  feature  points  in 
the  previous  sampling  frame.  The  coordinates  of  the 
feature  points  were  sent  to  a  transputer  for  motion 
control  and  it  calculated  a  feedback  command  for  the 
robot.  The  result  was  sent  to  the  robot  controller  by 
using  RS-232C,  and  the  robot  was  controlled  by  a 
velocity  servo  system  in  the  controller. 

The  sampling  period  of  visual  servoing  was  about 
50  msec.  Details  were  16  msec  for  taking  a  stereo 
images,  about  1  msec  for  calculating  the  coordinates 
of  the  feature  points,  3  msec  for  calculating  feedback 
command,  about  16  msec  for  communicating  with  the 
robot  controller.  If  we  send  a  feedback  input  to  the 
robot  controller  without  using  RS-232C,  the  faster 
visual  servoing  can  be  realized. 


Fig.  5  The  Experimental  equipment  set-up. 
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The  desired  location  was  (0,  0,  500)T  mm  and  the 
desired  orientation  in  Euler  angle,  (<£,  Qy  (p)=  (0,  0,  0) 
degree  and  the  initial  error  was  (  50 ,  50 ,  50  )rmm 
for  translation.  The  other  parameters  were  the  same 
as  in  the  simulation.  The  error  of  current  and  desired 
location  of  the  feature  points  are  shown  in  Fig.  7. 
From  these  experimental  results,  we  can  see  that  the 
manipulator  converges  toward  a  desired  location  even 
if  the  calibration  is  not  precise. 


To  use  this  visual  servoing  in  practical  tasks,  there 
still  exist  many  problems  such  as  the  number  of 
feature  points  to  reduce  noise  or  the  quantization 
error  and  the  way  to  choose  feature  points. 
Nevertheless,  this  method  overcomes  the  several 
problems  in  visual  servoing  with  the  monocular 
vision. 
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5.  Conclusion 

This  paper  proposes  a  new  method  of  visual 
servoing  with  the  stereo  vision  to  control  the  position 
and  orientation  of  a  SCARA  robot  with  respect  to  an 
object.  The  method  overcomes  the  several  problems 
associated  with  the  visual  servoing  with  the 
monocular  vision.  By  using  the  stereo  vision,  the 
image  Jacobian  can  be  calculated  at  any  position.  So 
neither  shape  information  nor  desired  distance  of  the 
target  object  is  required.  Also  the  stability  of  visual 
servoing  is  assured  even  when  the  initial  error  is  very 
large.  We  have  shown  the  effectiveness  of  this 
method  by  simulation  and  experiments. 


(b)  Right  image 

Fig.  6  Position  Error  in  #  and  y  axes. 
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Abstract 

This  paper  proposes  a  new  navigation  system  for  an 
autonomous  mobile  robot.  An  earlier  version  of  the  robot 
navigated  by  recognizing  a  single  landmark  at  a  time . 
But,  the  landmark  was  often  lost  sight  of  and  as  a  result, 
the  robot's  self-position  was  lost.  The  authors  propose  a 
method  that  enables  the  robot  to  recover  from  such  an 
error.  The  method  determines  the  self-position  by  using 
information  on  the  comer,  windows  and  doors  of  the 
room  simultaneously  if  the  single  landmark  is  lost. 


1.  Introduction 

Autonomous  mobile  robot  taht  has  vision  sensor  is 
very  useful  in  many  industry  fields.  The  good  navigation 
method  is  to  recognize  the  landmarks  by  the  vision 
sensor  to  detect  a  self-location [1].  However,  it  was 
impossible  that  the  robot  recognizes  the  landmarks 
perfectly  in  the  actual  world  for  the  various  obstruction. 
Therefore,  it  is  necessary  that  the  robot  can  make 
recovery  the  error  that  the  robot  itself  occurred  in.  It  is 
very  important  that  the  robot  detects  the  error,  repairs  the 
condition  and  prevents  the  recurrence  of  the  error  to 
make  recovery.  Therefore,  authors  propose  the  control 
architecture  HALAS.  In  this  paper,  the  Global  Matching 
method  that  is  located  at  the  highest  rank  in  the  sensing 
behavior  included  into  the  HALAS  is  stated.  The  Global 
Matching  is  how  to  recognize  its  self-position,  it  does  not 
depend  on  the  landmark.  When  the  landmark  is  lost  sight 
of  during  the  work,  the  Global  Matching  can  navigate  the 
robot  to  the  set  course  by  using  the  position  information 
such  as  corner  in  the  room,  the  doors,  the  windows,  and 
the  walls.  The  robot  combines  a  wide-ranging 
recognition  by  the  Global  Matching  and  the  local 
recognition  by  the  landmark  with  the  HALAS.  Then,  it 
can  move  with  making  recovery  the  error. 

Wang  proposed  how  to  estimate  the  self-position  at 
the  indoor  from  the  matching  rate  of  the  model  and  the 
edge  screen  gotten  by  the  image  processing[2].  However, 


it  is  impossible  to  decide  the  self-position  of  the  robot 
only  using  the  result  of  the  edge  detection,  because  there 
are  few  geometrical  characteristics  in  the  big  square 
room  where  the  robot  works  easily.  Tonaki  proposed  the 
method  which  the  position  was  decided  by  using  two 
laser  luminous  units[3].  This  method  can  be  expected 
precise  measurement.  But,  the  operator  needs  to  work 
more  because  the  measurement  unit  must  be  installed 
precisely  every  time.  Furthermore,  Ando[4]  and 
Oriolo[5]  proposed  how  to  detect  the  self-position  by 
using  the  ultrasonic  sensor  to  measure  distance  from  the 
wall.  But,  the  measurement  range  of  the  ultrasonic  sensor 
is  so  short  that  the  method  can  not  measure  the  distance 
directly  in  a  large  room.  These  techniques  can  not  give 
full  play  to  its  ability  regarding  self-position 
measurement  in  the  large  office  room.  So,  the  authors 
propose  this  technique.  It  is  stated  about  the  algorithm, 
the  position  measurement  precision  and  the  result  of  the 
movement  experiment  in  this  paper. 

2.  Autonomous  mobile  robot 

We  have  developed  the  robot  that  had  the  navigation 
system  using  the  air  diffusers  as  the  landmarks  and  it  was 
able  to  inspect  the  air  flow  rate  and  temperature[6].  The 
ceiling  air  diffuser  is  called  Anemo  and  it  is  shown  in 
Fig.l.  The  robot  was  equipped  two  CPUs  and  two  CCD 
cameras.  The  robot  consists  of  four  submodules;  the 
vision  system,  the  vehicle  system,  the  sensor  system  and 
the  lift-up  system.  One  CPU  manages  the  vision  system 
that  recognizes  the  landmark  and  calculates  the  robot’s 
position,  and  the  other  one  manages  the  vehicle  system, 
the  sensor  system  and  the  lift-up  system.  The  vision 
system  has  two  CCD  cameras.  One  is  main  CCD  camera, 
and  the  other  is  small  CCD  camera.  The  main  CCD 
camera,  which  is  installed  on  a  pan-tilt  table,  collects  the 
image  of  an  Anemo  on  the  ceiling.  This  image  is 
processed,  and  the  robot  calculates  the  distance  to  under 
the  Anemo.  The  small  CCD  camera  is  installed  for 
adjustment  of  self-location[7].  The  robot  moves 
according  to  the  instruction  that  is  transferred  from  the 
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vision  system  to  the  vehicle  system.  The  vehicle  system  is 
composed  of  two  driving  wheels  and  two  caster  wheels. 
The  driving  wheels  have  two  DC  servo  motors  and  the 
left  wheel  and  the  right  wheel  move  independently.  This 
robot  is  operated  under  point  to  point  control  of  motion. 
The  pattern  of  the  robot  motion  is  that  it  takes  spin  turns 
and  faces  in  the  direction  of  the  Anemo,  then  it  moves 
straight  forwards  until  right  under  the  Anemo  and, 
finally  it  adjusts  the  position  with  the  spin  turn  again. 
The  lift-up  system  lifts  the  sensor  system  before 
measuring  the  rate  of  air  flow.  While  the  robot  is  moving, 
the  hood  is  500mm  below  the  ceiling  in  order  to  avoid 
any  obstacle.  As  the  system  changes  to  the  measurement 
mode,  the  hood  is  lifted  until  the  distance  between  the 
ceiling  and  the  hood  is  in  the  range  of  70mm  to  200mm. 
Then  the  photoelectric  switch  turns  on,  and  the  lift-up 
speed  slows  down  to  avoid  a  shock  when  hitting  the 
ceiling.  As  the  hood  touches  the  ceiling,  the  touch  switch 
on  the  attachment  hood  is  activated  and  the  lift-up 
operation  is  ended.  This  whole  process  takes  about  30 
seconds.  The  robot  does  these  measurements  over  again 
unless  all  measurements  in  same  floor  are  finished.  We 
show  the  autonomous  mobile  robot  in  Fig.2. 


Fig.2  Autonomous  mobile  robot 


3.  Error  recovery 

The  error  of  the  autonomous  robot  is  not  the  same  as 
the  trouble  of  the  mere  machine.  It  is  the  same  kind  of 
the  error  of  human  behavior  based  on  the  recognition. 

There  are  these  six  kinds  of  errors;  (1)  Commission 
error,  (2)  Omission  error,  (3)  Repetition  error,  (4) 
Imperfect,  (5)  Sequential  error  and  (6)  Extraneous  act. 

In  this  research,  we  are  going  to  deal  with  from  (1)  to  (3). 
When  the  error  recovery  is  carried  out,  three  points  of 
the  following  becomes  important . 

(1)  How  does  the  robot  discover  the  error  ? 

(2)  How  does  the  robot  restore  the  error  ? 

(3)  How  does  the  robot  prevent  the  recurrence  of  the 
error  ? 

The  purpose  of  this  research  finds  an  answer  in  these 
problems. 

4.  HALAS 

The  authors  propose  HALAS  (Hierarchical  Adaptive 
and  Learning  Architecture  System)  as  the  control 
architecture  of  the  navigation  system  of  the  autonomous 
mobile  robot  in  consideration  of  the  error  recovery 
(Fig.2)[8]-[10j.  The  structure  of  the  HALAS  has  the 
voluntary  movement  and  the  involuntary  movement  of 
vertebrate  animal.  The  HALAS  prevents  the  recurrence 
of  the  error  by  the  voluntary  movement.  And  the 
involuntary  movement  can  discover  the  error  and  repair 
it.  The  involuntary  movement  consists  of  four  action  sets 
by  the  sensing  level. 

They  are  (1)  Confident  Action  Set,  (2)  Discreet  Action 
Set,  (3)  Reconsideration  Action  Set,  and  (4) 
Reconstruction  Action  Set. 

The  voluntary  movement  can  choose  the  action  set 
which  is  suitable  for  the  error  recovery  by  the 
reinforcement  learning.  The  wind  that  the  robot  can  get 
when  it  reaches  an  Anemo  is  being  used  as  the  reward  of 
the  reinforcement  learning.  Then  the  action  set  that  has 
the  highest  evaluation  function  value  is  chosen  at  that 
time. 

5.  Global  matching 

The  robot  taht  lost  sight  of  an  Anemo  gets  lost  the  path  it 
should  go  along.  So,  this  robot  does  the  Global  Matching, 
and  detects  its  position,  and  reverts  in  the  course  of  the 
base.  This  Global  Matching  Method  is  one  of  the  action 
sets  including  above  mentioned  (4)  Reconstruction 
Action  Set  in  HALAS.  The  Global  Matching  is  the 
behavior  that  the  robot  can  recognize  the  self-location 
absolutely  by  the  coordinates  system  of  the  large  room  at 
the  any  positions.  The  self-location  is  shown  in  the  two- 
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dimensional  coordinates  x  and  y,  and  the  posture  angle 
0 .  In  this  research,  the  robot  searches  the  position  on  the 
comer  inside  the  room  by  the  main  camera.  Then,  the 
robot  calculates  its  position  with  the  corner. 


Sensor  Actuator 


Voluntary  Movement 


□ 


Involuntary  Movement 

Fig.3  HALAS 


5..1  Environmental  condition 


The  environment  that  the  Global  Matching  is  done, 
and  the  precondition  are  explained  here.  The 
experimental  environment  was  the  room  26m  by  11m  is 
shown  in  Fig.4.  This  is  the  room  that  generally  the  air 
conditioning  equipment  inspection  robot  is  used  for.  The 
robot  has  the  design  chart  at  this  room.  Therefore,  the 
dimension  of  the  room  is  known  well.  For  example,  the 
height  of  the  ceiling,  the  position  and  size  of  the  Anemo, 
the  windows  and  the  doors  and  so  on  are  known  well. 
The  route  of  inspection  was  determined  by  human  in 
advance.  Furthermore,  other  environment  conditions  and 
preconditions  in  this  technique  are  shown  in  the 
following. 

(1)  The  room  is  large  rectangle  office  room  and  there  are 
system  lines  where  the  Anemo  and  so  on  are  arranged  on 
the  ceiling. 

(2)  There  is  at  least  one  wall  side  that  there  are  not  a 
pattern,  a  line,  and  a  structure  thing  in  the  room.  Then, 
there  is  at  least  one  corner  where  the  outline  is  clear  and 


it  touches  the  wall.  There  can  be  window,  door  and  so  on 
in  other  walls. 

(3)  The  condition  that  it  was  lost  sight  of  the  self-location 
was  as  the  following. 

When  the  Anemo  is  not  found  even  if  the  error  recovery 
techniques  included  the  involuntary  movement  were  used. 
When  the  number  of  the  Anemo  was  different  from  the 
map  information. 

Whdow  Window  Window 

Pilar  Pilar  Pilv  Pfllir 


26m 


Fig.4  Experimental  room 


5.2.  Orientation  modification 

To  begin  with  this  process,  the  robot  turns  to  the  wall 
for  the  comer  detection.  The  robot  detects  the  self¬ 
posture  angle  0  by  using  the  system  line  on  the  ceiling. 
Then,  the  robot  faces  in  the  system  line,  and  it  makes  the 
posture  parallel  to  the  system  line.  The  process  of  the 
orientation  modification  is  as  the  following.  The  robot 
takes  pictures  of  the  top  by  using  the  small  camera  which 
is  set  upright.  Then,  it  confirms  whether  a  system  line 
exists.  It  is  known  from  the  past  experiment  that  the 
small  camera  can  find  out  the  goal  object  if  it  is  inside 
the  range  in  every  direction  300mm  from  the  center  of 
the  robot.  But,  when  the  robot  judges  that  a  system  line 
does  not  exist  in  this  range  because  a  straight  line  is  not 
extracted  as  a  result  of  the  image  processing,  it  looks  up 
at  the  ceiling  with  the  other  vision  sensor,  the  main 
camera  which  is  set  on  the  pan-tilt  table.  Then,  it  does 
the  search  of  a  system  line  again  turning  the  table.  After 
that,  the  robot  calculates  the  distance  to  the  system  line 
and  the  direction,  and  it  moves  to  right  under  the  system 
line.  The  robot  detects  the  angle  of  the  posture  of  the 
system  line  with  the  small  camera  after  the  arrival,  and  it 
rotates  to  become  parallel  in  the  system  line. 

Then,  the  authors  did  the  posture  compensation 
experiment  of  the  robot,  and  confirmed  the  error  of 
posture  matching.  The  initial  position  of  the  robot  in  this 
experiment  was  divided  into  two  cases.  One  is  Case  (a) 
Right  under  the  system  line,  the  other  one  is  Case  (b) 
Between  the  system  line  and  the  system  line.  Then,  each 
initial  posture  it  was  inclined  to  0°  ,45°  and  90°  . 
Then,  the  robot  paralleled  the  system  line  from  these 
position  and  postures.  The  result  is  shown  in  Table  1. 
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The  orientation  modification  movement  could  be  finally 
completed  in  less  than  ±1°  . 


Table  1  Results  of  Orientation  Modification 
(degree) 


Case  (a)  Result 

Case  (b) 

Result 

Initial  Small  Camera 

Main  Camera 

Small  Camera 

o 

o 

5.3 

0.4 

45  0.6 

7.7 

-0.1 

90  0.5 

6.6 

-0.1 

5.3  Comer  Detection 

The  robot  detects  the  comer  to  calculate  its  position 
expressed  by  x,  y  coordinates.  The  robot  can  use  the 
comer  of  the  ceiling  side  and  the  wall  or  the  corner  of  the 
floor  side  and  the  wall.  The  comer  of  the  ceiling  side  was 
used  by  this  research.  The  images  taht  the  main  camera 
took  by  the  corner  detection  treatment  are  shown  in  order 
by  Fig.5.  This  method  is  explained  concretely  in  the 
following.  As  the  authors  explained  so  far,  the  robot  faces 
to  the  wall  taht  a  pattern  is  not  in.  First,  the  robot 
controls  the  pan-tilt  table  and  the  zooming  of  the  lens  to 
make  the  main  camera  face  to  the  wall  straightly  at  this 
position,  Fig.5(a).  The  histogram  that  image  pixel  was 
projected  on  the  Y  axis  is  made  in  the  taken  image  at  this 
time.  This  histogram  shows  the  straight  line  that  crosses 
the  inside  of  the  screen  horizontally.  The  straight  line 
which  was  done  like  is  the  boundary  line  of  the  wall  and 
the  ceiling  side.  Next,  the  robot  controls  the  pan  angle  of 
the  main  camera  so  that  this  straight  line  comes  in  the 
center  of  the  screen,  Fig.5(b).  Thirdly,  the  robot  extracts 
the  boundary  line  that  crosses  the  center  of  the  screen  by 
Hugh  change.  Then,  it  calculates  the  center  of  gravity 
and  adjusts  the  pan -tilt  table  so  that  the  center  of  gravity 
comes  in  the  center  of  the  screen  of  the  Fig.5(c).  Fourthly, 
the  robot  detects  the  left  end  point  of  this  straight  line  in 
this  image.  Then,  it  chases  the  line  in  the  left  with 
(xmtrolling  pan -tilt  table  so  that  the  end  point  becoming 
in  the  center  of  the  camera  again,  Fig.5(d).  This 
operation  is  repeated.  Fifthly,  the  robot  recognizes  the 
straight  terminal.  The  movement  of  the  camera  is  to  stop 
when  the  left  end  point  on  the  boundary  line  comes  in  the 
center  of  the  image,  Fig.5(e).  Finally,  the  robot  confirms 
that  there  is  a  straight  line  that  extends  in  the  vertical 
bottom  from  this  end  point.  This  shows  the  boundary  line 
of  the  wall  and  the  wall.  The  angle  of  the  pan  and  tilt  are 
the  directions  of  the  corner  seen  from  the  robot. 

Then,  the  coordinate  of  the  corner  is  shown  by  the 
equations  (1)  and  (2)  in  the  coordinates  system  of  Fig.6. 


xc,yc:  The  x,y  coordinate  of  the  comer  in  the  room 
xr,yr:  The  x,y  coordinate  of  the  robot  in  the  room 
0  r:  The  posture  of  the  robot  in  the  room 
Ot,  <t>  t:The  pan  and  tilt  angle  of  the  camera 
xi,yi:  The  x,y  coordinate  of  the  comer  in  the  image 
f:  The  focal  distance 

xv,yv,zv:The  x,y,z  coordinate  of  the  viewpoint 
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Fig.6  Coordinates  System  of  Corner  Detection 
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5.4  Evaluation  of  Position  Error 

When  the  self-location  measurement  is  done  by  using 
this  technique,  five  of  the  next  are  thought  as  factors  of 
the  error. 

(a)  Inclination  toward  the  vertical  angle 

(b)  The  orientation  modification  error  of  the  robot 

(c)  The  error  of  the  corner  detection  by  the  image 
processing 

(d)  The  control  error  of  the  pan-tilt  table  of  the  main 
camera 

(e)  Architectural  error 

The  influence  which  the  error  of  (a)  -  (d)  gives  to  the 
self-position  measuring  was  thought  and  the  result  of  (a) 
is  shown  as  a  representative.  It  is  shown  in  the  maximum 
error  circle  of  the  self-position  detection  in  each  position 
inside  the  room  in  Fig.7.  But,  the  error  circles  change 
magnification,  and  it  is  expanded  to  put  emphasis  on  the 
result.  The  data  shows  the  diameter  of  the  maximum 
error  circle.  The  corner  in  the  coordinate  origin  (0,0)  was 
detected,  and  the  self-position  was  computed  with  the 
robot  in  each  position. 

The  purpose  of  the  Global  Matching  is  that  it  makes 
the  robot  revert  in  the  movement  course  when  the  robot 
deviated  the  course  because  an  Anemo  was  lost  sight  of 
once.  Therefore,  the  robot  should  have  moved  from  the 
point  where  the  comer  detection  had  been  done  to  the 
nearest  Anemo  that  was  in  the  course.  However,  as  for 
the  self-position  measuring  value,  it  was  found  out  that  it 
had  the  big  error  in  the  far  point  from  the  comer  as  a 
result  of  the  error  analysis.  Therefore,  there  is  area  where 
the  robot  can  not  reach  the  nearby  Anemo  directly,  when 
the  robot  measures  the  position  by  only  the  corner 
detection.  So,  the  robot  moves  once  to  the  range  that  the 
precision  of  position  measuring  by  the  corner  detection  is 
within  ±  300mm.  Then,  it  does  the  self-position 

measuring  again  at  the  point.  The  Anemo  to  approach  at 
this  time  is  called  an  Anemo  Origin,  and  the  robot 
reverts  from  this  Anemo  Origin  to  the  movement  course. 

The  robot  was  put  on  the  same  places  where  the  error 
analysis  was  done  on  the  actual  office  floor.  Then,  the 
self-position  measuring  was  done  at  the  12  places.  The 
results  are  shown  as  the  maximum  error  circles  in  Fig.8. 
The  experimental  results  showed  the  tendency  which  was 
the  same  as  the  error  analysis.  The  error  became  as  big  as 
a  distance  from  the  corner  was  far. 

.  From  these  results,  it  becomes  clear  that  an  Anemo 
Origin  should  be  a  (5000,  5000)  neighborhood  Anemo. 


0  10000  20000  x  (mm) 

Fig.7  Error  caused  by  Inclination  of  Robot 


0  10000  20000  x  (mm) 

Fig.8  Experimental  Result 

5.5  Topological  Matching 

The  coordinate  of  the  robot  based  on  the  comer  was 
calculated,  and  then  the  robot  was  moved  to  the  Anemo 
Origin.  However,  it  has  not  been  confirmed  yet  if  the 
robot  was  moved  to  the  Anemo  Origin  or  in  the  reverse 
direction.  So,  the  robot  does  a  final  decision  on  the  self¬ 
position  by  this  technique.  If  the  robot  is  in  the  position 
of  Fig.9  (a),  it  can  decide  the  self-position  using  the  map 
information  that  describes  where  the  door  and  windows 
are.  But,  the  door  and  the  windows  shown  in  the  map  can 
not  be  discovered  if  the  robot  faces  an  opposition 
direction  in  Fig.9  (b).  Therefore,  modification  on  the 
position  is  necessary.  The  existence  of  the  window  and 
the  door  is  judged  by  histogram  on  the  edge  in  the 
vertical  direction  in  the  image.  The  robot  recognized  the 
window,  door  and  wall. 


00  (b) 

Fig.9  Topological  Matching 
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6.  Locomotive  experiment 


it  was  shown  by  this  technique  that  the  robot  could  revert 
in  the  movement  course  from  the  Anemo  Origin. 


The  proposed  technique  was  applicable  to  this  robot, 
and  the  error  recovery  movement  experiments  to  revert  to 
the  Anemo  Origin  from  four  positions  ware  done.  The 
movement  path  at  that  time  is  shown  in  Fig.  10.  The  robot 
was  moved  first  with  Case  1  and  Case  2  in  the  y  direction. 
This  shows  that  the  robot  moved  under  the  system  line 
during  the  orientation  modification  was  done.  It  had  the 
self-position  measurement  error  of  246mm  with  Case  2 
by  the  first  Global  Matching  movement  after  the 
orientation  modification  was  done.  The  robot  moved  to 
the  Anemo  Origin  (4400, 4480)  based  on  this  value.  But, 
it  gained  the  error  of  the  dead  reckoning.  The  error 
therefore  to  arrive  was  1435mm.  But,  because  it  could 
approach  the  corner  enough,  the  robot  reduced  the  error 
by  1177mm  by  the  2nd  Global  Matching.  When  the  robot 
came  to  this  area,  the  Anemo  Origin  could  be  caught  in 
the  small  camera,  and  the  position  error  could  be  finally 
under  20mm. 


y  (mm) 


O  Case  1 
O  Case  2 
□  case  3 
0  Case  4 


x  (mm) 


Fig.10  Trajectory  of  Global  Matching 
7.  Conclusion 


It  was  proposed  how  to  recognize  the  self-position  by 
the  Global  Matching  that  did  not  depend  on  the 
landmark,  Anemo.  And  it  was  an  error  recovery  method 
when  an  Anemo  was  lost  sight  of.  After  the  robot  did  the 
orientation  modification  first  and  faced  in  the  wall,  the 
boundary  line  of  the  wall  and  the  ceiling  side  was 
extracted.  Then,  the  robot  found  the  end  of  this  straight 
line,  and  confirmed  that  there  was  other  straight  line  that 
extended  in  the  vertical  bottom.  This  was  confirmed  as  a 
corner  and  the  self-position  was  calculated  with  the 
corner.  It  was  confirmed  that  the  robot  could  revert  to  the 
Anemo  Origin  set  up  around  the  neighborhood 
(5000,5000)  of  the  coordinates  of  the  corner  as  a  result  of 
the  experiment.  Even  if  an  Anemo  was  lost  sight  of 
during  the  work  and  the  robot  deviated  from  the  course, 
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Abstract 

In  the  future,  mobile  robots  will  most  probably  navigate 
through  the  fields  autonomously  to  perform  different  kind 
of  agricultural  operations.  As  most  crops  are  cultivated  in 
rows,  an  important  step  towards  this  long-term  goal  is  the 
development  of  a  row-recognition  system,  which  will 
allow  a  robot  to  accurately  follow  a  row  of  plants.  In  this 
paper  we  describe  a  new  method  for  robust  recognition  of 
plant  rows  based  on  the  Hough  transform.  Our  method 
adapts  to  the  size  of  plants,  is  able  to  fuse  information 
coming  from  two  rows  or  more  and  is  very  robust  against 
the  presence  of  many  weeds.  The  accuracy  of  the  position 
estimation  relative  to  the  row  proved  to  be  good  with  an 
error  variance  between  0.6  and  1.2  cm  depending  on  the 
plant  size.  Extensive  field  tests  also  showed  that  the 
system  is  sufficiently  fast,  about  10  images  per  second,  to 
be  used  in  a  closed-loop  control  system. 


1.  Introduction 

Increasing  cost  of  chemicals  and  the  soil  pollution 
caused  by  the  herbicide  residues  ask  for  alternative 
methods  of  crop  protection.  A  potential  way  of  reduction 
of  chemicals  is  the  use  of  precision  techniques  for  various 
types  of  agricultural  operations,  so  that  the  chemicals  can 
be  placed  where  they  have  an  optimal  effect  with 
minimum  quantity.  For  some  operations  it  will  even  be 
possible  to  abandon  the  use  of  chemicals  and  apply  other 
methods,  e.g.  mechanical  weed-control.  Our  vision  is  that 
in  the  future,  autonomous  mobile  robots  will  navigate 
through  the  fields  to  perform  these  operations.  As  most 
crops  are  cultivated  in  rows,  an  important  step  towards 
this  long-term  goal  is  the  development  of  a  row- 
recognition  system,  which  will  allow  a  robot  to  accurately 
follow  a  row  of  plants. 

To  derive  guidance  information  from  planting 
structures  has  been  investigated  by  others.  Marchant  [1] 
and  Marchant  and  Brivot  [2]  perform  several  steps  of 
image  processing  based  on  near-infrared  images  to 


discriminate  weeds  from  plants.  They  calculate  the  center 
of  each  object  in  the  image  and  use  a  Hough  transform  to 
calculate  the  offset  and  orientation  relative  to  the  row 
structures.  They  combine  information  coming  from  three 
row  segments  to  increase  performance. 

Billingsley  and  Schoenfish  [3]  use  the  chrominance 
signal  from  a  colour  camera  to  acquire  an  image  based  on 
the  ’’greenness”  in  the  image.  To  threshold  the  image  they 
use  an  adaptive  method  based  on  the  expected  plant 
material  in  the  image.  To  find  line  structures  they  used 
linear  regression  based  on  information  from  three  row 
segments  to  find  the  best  fit.  Slaughter,  Chen  and  Curley 
[4]  use  a  Bayesian  classifier  based  on  information  from  a 
color  camera  to  discriminate  weed  from  plants.  The 
camera  was  centered  above  the  row  and  they  sum  up  all 
pixels  in  every  colum  thus  forming  a  histogram  of  the  row 
structure.  They  use  the  median  value  of  the  histogram  to 
estimate  the  position  of  the  row. 

Common  for  the  above  described  methods  is  that  they 
work  for  larger  plants  (>5  cm  in  diameter)  or  small  plants 
with  low  weed  density  (3  weed/plant).  In  our  case  the 
system  must  be  able  to  work  even  when  the  size  of  the 
plants  and  weeds  are  the  same  and  the  weed-density  is 
high  (>3  weed/plant).  In  this  paper  we  present  a  new 
robust  image-processing  algorithm  to  detect  the  position 
of  a  row  of  plants.  Our  method  is  able  to  adapt  to  the  size 
of  the  plants  and  is  not  disturbed  by  the  presence  of  many 
weeds. 

2.  Row-recognition  system 

A  problem  for  the  development  of  a  row  recognition 
system  is  the  fact  that  there  might  be  a  lot  of  weed  which 
disturbs  the  appearance  of  a  nice  clear  row  structure  of 
green  plant  material  coming  from  the  crops.  One  approach 
is  to  first  classify  every  plant  as  crop  or  weed  and  then 
find  a  row  structure  based  on  the  crops  only.  Another 
approach  is  to  consider  weed  as  noise  and  to  find  a  row 
structure  based  on  both  crops  and  weeds.  We  use  the  last 
described  method  and  to  get  a  good  signal  to  noise  ratio, 
which  means  the  ability  to  handle  large  weed  pressure,  we 
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look  at  a  row  segment  of  about  four  meter  long.  To  be 
able  to  deal  with  missing  crops  in  the  rows  and  to  increase 
robustness  and  accuracy  we  look  at  two  rows  at  the  same 
time. 

We  use  a  gray-scale  camera  with  a  near-infrared  filter 
to  detect  plants.  Using  the  filter  gives  a  high-contrast 
image,  where  living  plant  material  is  bright  and  the  soil  is 
dark  [5].  However  due  to  disturbances,  in  form  of 
different  light  conditions  and  due  to  the  fact  that  all  white 
pixels  do  not  correspond  to  crops,  some  additional  image- 
processing  has  to  be  done. 


Figure  1.  Original  and  threshold  image 

To  decrease  the  effect  of  different  light  conditions  we 
perform  an  opening  operation  on  the  image  [7]  and 
subtract  the  result  from  the  original  image.  This  results  in 
an  intensity-independent  gray-level  image  from  which  a 
binary  image  can  be  derived  with  a  fixed  threshold.  In  the 
resulting  binary  image,  plant  material  from  both  weeds 
and  crops  is  white  and  the  rest,  coming  from  soil,  stones 
and  residues  is  black  (figure  1). 


2.1  Geometric  model 


Based  on  the  binary  image  the  next  step  is  to  detect  the 
row  of  crops  in  the  image.  To  control  for  example  a 
mobile  robot  or  other  tool,  we  must  know  the  offset  and 
the  heading  angle  of  the  camera  relative  to  the  row 
structure.  To  be  able  to  extract  this  information  from  the 
image  we  must  have  a  geometric  model  of  the  row 
structures  and  know  the  transformations  between  the 
world  coordinate  system  and  the  image  coordinate  system. 
To  start  with  we  assume  that  a  line  can  approximate  the 
row  of  crops.  The  world  coordinates  of  the  row  can  first 
be  transformed  to  camera  coordinates  and  finally  to  image 
coordinates  through  a  perspective  transformation.  Figure  2 
shows  the  geometry  of  the  world  and  camera  coordinate 
system.  The  heading  angle,  a,  and  the  offset,  5,  are  the 
unknown  parameters  which  need  to  be  found.  It  is 
assumed  that  the  field  can  be  approximated  by  a  plane  and 
the  world  coordinate  system  is  chosen  in  such  a  way  that 
this  plane  can  be  described  as  zw  =  0  and  that  the  rows  are 
lying  in  parallel  with  the  yw-axis.  The  camera  angle,  </>, 
and  its  height  above  the  ground,  h,  are  fixed. 

The  transformation  from  a  position  expressed  in  the 
camera  coordinate  system  k  to  a  position  expressed  in  the 
world  coordinate  system  w  is  in  general  [6] : 
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Image  intensity ~  Image  opening  ~  Image  original 

(1) 


Where  R  is  the  rotation-matrix  between  world 
coordinate  and  the  camera  coordinate  system.  In  our  case 
R  is  a  function  of  </>  and  a: 


Figure  2.  World-Camera  Coordinate  System 
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In  our  case  the  translation-vector  is  a  function  of  a,  s 
and  h: 


tz=h 

tx-  S'  cos  (a) 
ty  =  s-  sin  (a) 


The  inverse  transformation  is: 
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where: 


assuming  that: 
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As  said  before,  the  row  structure  can  be  regarded  as  a 
line  lying  in  the  plane  z*,  =  0,  which  gives  us  the  three 
following  equations  from  (7): 

xk  =  cosCa)*^  +  sin^))^  _  s 

yk  =  -cos^sinCa)^  +cos(a)cos(^)yw  -hsin(<fi)  (8) 
zk  =sm(a)sin(0)xw-cos(a)sin(<f>)yw-hcos(<f>) 

The  camera  can  be  simply  modeled  as  a  pinhole 
camera  where /is  the  so-called  focal  length  [7].  The  xi}yi 
coordinate  system  is  the  image-sensor  coordinate  system 
of  which  the  axis  are  parallel  to  xk  and  yk  respectively. 

X;  =  —f  •  —  *=-/■?*-  (9) 

zk  zk 

Equations  8  and  9  together  give: 


sin(a)  =  a  and  cos(a)=l.  (13) 

which  gives: 

a(-  hy:  cos(^)  -  fhs,\n(<j>))  +  s(f  cos(^)  -  y,  sin(^)) 

-xw{f  cos(^)  -  y- sin(^))  +  hxt  =  0 

This  can  also  be  written  as: 

Aa  +  Bs  +  C  =  0  (15) 

where:  A  =  (-  hyf  cos(^)  -  /isin(^)) 

B  =  (f  cos(^)  -  yt  sin(^)) 

C  =  -XWB  +  hx{ 

Finally,  a  straightforward  transformation  from  image¬ 
sensor  coordinates  to  image-pixel  coordinates  is  needed: 

Xj  =  -{xp  -  xc  ^dpixelx  ,  y,  =  ~(yp  -  y^dpixely  ( 1 6) 

Where  dpixelx  and  dpixely  is  the  size  of  the  sensor 
elements  in  the  x-  and  y-  direction  of  the  CCD-array.  xc 
and  yc  are  the  coordinates  of  the  optical  center  of  the 
camera  in  pixels  and  set  to  half  the  size  of  the  image. 

This  gives  the  following  equation: 
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Given  a  known  position  in  the  field  described  as  xw,yw 
equation  10  and  11  give  the  position  in  the  image 
described  as  Xi,y*  assuming  we  know  the  position  and 
orientation  of  the  camera  described  by  a,  s,  h,  (j> ».  In  our 
case  we  know  that  every  pixel  belonging  to  a  plant  row 
has  a  certain  fixed  position  xWi  which  depends  on  the  row 
distance  but  generally  we  do  not  know  the  value  yw. 
Therefore  we  eliminate  yw  from  equation  10  and  1 1  which 
gives: 

s  •  cos(a)  •  (-  /  cos(0)  +  y,  sin(^))  = 

-  fxw  cos  {(/>)  +  hxi  cos(a)  -  hy{  cos(^)sin(a)  (12) 

+  xwy{  sin(^)  -  fh  sin(a)  sin(^) 

For  small  values  of  a  we  can  simplify  the  equation  by 


This  equation  plays  a  central  role  to  find  the  heading 
angle,  a,  and  the  offset,  s ,  which  is  described  in  the  next 
section.  Please  note  that  xw  is  known  as  it  is  the  position 
of  the  row  in  world  coordinates. 

2.2  Finding  row-structures 

The  Hough  transform  is  a  well-known  and  robust 
method  to  find  lines,  especially  if  the  lines  cover  the 
whole  image  as  in  our  case  [7].  Normally  the  lines  are 
found  with  their  equation  in  the  image  space  e.g.  yp  =  axp 
+  b,  where  the  coefficients  a  and  b  are  found  with  the 
Hough  transform.  We  could  also  do  this  based  on  the 
binary  image  of  plant  material.  All  pixels  coming  from  the 
crops  contribute  to  the  line  and  all  pixels  from  the  weeds 
are  just  noise.  By  using  equation  17  for  two  points  on  this 
line  we  are  able  to  calculate  the  heading  angle,  a,  and  the 
offset,  s. 

However,  as  shown  in  equation  17,  there  is  a  linear 
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Figure  3.  Distribution  of  s  for  a  specific  a 

relation  between  s  and  a  for  a  given  pixel  (xp,  yp).  This 
means  that  we  directly  can  perform  the  Hough  transform 
for  s  and  a  based  on  equation  17. 

The  Houghspace,  H(s,a),  is  then  an  accumulator  where 
each  pixel  coordinate  (xp,yp)  in  the  binary  image,  which  is 
on,  contributes  even  if  it  is  belonging  to  a  weed  plant.  The 
recognition  of  the  row  of  crops  among  the  weeds  is 
achieved  due  the  fact  that  weeds  are  uniformly  distributed 
in  the  field,  whereas  all  the  crops  grow  exactly  in  a  row, 
thus  leading  to  a  peak  in  the  Houghspace. 


s 


Figure  4.  Houghspace  accumulator 

In  the  world  model  a  plant  row  can  better  be  described 
by  a  rectangular  box  rather  then  a  line.  The  width  of  the 
box  is  equal  to  the  average  width  of  the  plants  and  the 
length  of  the  box  is  ’’unlimited”  as  it  fills  the  whole  image. 
The  rectangular  box  can  be  described  by  a  set  of  parallel 
adjacent  lines.  The  number  of  lines  is  the  width  of  the  box 
divided  by  the  line  thickness,  which  is  determined  by  the 
pixel  size  of  the  image.  In  our  Houghspace  this  means  that 
for  one  value  of  a  the  rectangular  box  corresponds  to  a 
number  of  adjacent  s-cells.  By  summing  up  the 
contributions  of  the  adjacent  s-cells  we  obtain  the  support 
for  a  rectangular  box,  i.e.  for  the  row  of  plants.  The  best 
estimate  of  s  and  a  is  found  by  searching  for  the 
maximum  of  the  sum  of  adjacent  s-cells  in  our 


Houghspace.  We  can  easily  adapt  to  different  size  of  the 
plants,  by  reducing  or  increasing  the  number  of  adjacent  s- 
cells. 

Figure  4  shows  an  example  of  the  Houghspace  and 
figure  3  shows  the  corresponding  distribution  of  s  for  the 
correct  value  of  a.  In  this  example  we  look  at  the  sum  of 
three  adjacent  s-cells  to  find  the  most  likely  values  of  s 
and  a. 

If  more  than  one  row  is  used  each  row  has  then  its  own 
corresponding  Houghspace.  Information  from  the 
different  rows  can  be  fused  together  by  calculating  the 
average  of  s  and  a  derived  from  the  individual 
Houghspaces.  Another  possibility  is  to  sum  up  the 
contributions  from  all  Houghspaces  for  each  cell  (s,a), 
thus  forming  a  common  Houghspace  and  extract  the  most 
likely  value  of  s  and  a  from  this  one.  The  resolution  of  the 
Houghspace,  i.e.  the  size  of  the  cell  (s,a)  must  be  chosen 
carefully,  where  the  resolution  of  the  camera  plays  a 
major  role.  Figure  5  shows  the  sensitivity  of  s  and  a  as 
function  of  the  top  and  bottom  pixels.  The  top  pixel  is  the 
x-pixei  coordinate  of  the  row  at  the  top  of  the  image,  and 
the  bottom  pixel  is  the  x-pixel  coordinate  at  the  bottom. 
The  size  of  s  and  a  are  chosen  so  that  this  corresponds  to 
at  least  one  pixel  difference.  In  our  case  the  resolution  of  s 
was  set  to  1  cm  and  of  the  heading  angle,  a ,  to  0.2 
degrees. 

3.  Evaluation  of  the  row-recognition  system 

The  row  recognition  system  was  implemented  on  an 
industrial  PC,  equipped  with  a  Pentium  II  333  MHz 
processor  and  a  Matrox  Meteor  II  framegrabber.  The 
vision  system  is  a  COHU  CCD  camera  equipped  with  a 
near-infrared  filter  and  a  lens  with  8.5  mm  focal  length. 
The  camera  was  placed  about  800mm  above  the  ground 
and  the  camera  angle  was  68°.  This  means  that  we  look  at 
a  segment  of  the  rows  from  1  to  5  meter  in  front  of  the 
camera. 


Figure  5.  Resolution  in  image 


138 


Figure  6.  Four  sets  of  real  images 

To  evaluate  the  row-recognition  system  a  number  of 
real  images  were  used  to  verify  the  system.  Three  sets  of 
images  from  sugar-beet  plants  at  three  different  stage  of 
growth  were  included.  Also  one  set  of  images  from  a  rape 
field  was  used.  From  each  set  of  images  a  subset  of  70 
spatially  distributed  images  was  chosen.  The  most 
relevant  properties  of  each  set  are  given  in  table  1  and  one 
example  image  of  each  set  is  shown  in  figure  6.  Ten  of  the 
images  were  used  to  calibrate  the  camera  angle,  and  the 
camera  height  above  the  ground,  h.  The  number  of 
adjacent  s-cells  was  set  to  a  value  smaller  than  the  plant 
size.  Simulations  have  shown  that  the  best  performance  is 
achieved  when  the  number  of  adjacent  s-cells  is  set 
smaller  than  the  plant  diameter.  The  opening  kernel  was 
also  adjusted  based  on  the  plant  size.  We  look  at  two  rows 
simultaneously  to  increase  the  performance  and 
robustness  against  missing  crops,  outliers  and  high  weed 
densities  ( >3  weed/plant). 


Table  1.  Four  sets  of  real-images 


Set 

Stage 

0  cm 

Weed 

Plant 

1 

first  true  leaf  (rape) 

7-10 

1 

2 

first-second  true  leaf 

7 

3.5 

3 

cotyledon 

5 

<  1 

4 

second  true  leaf 

10-13 

<  1 

has  its  own  corresponding  Houghspace,  leading  to  two 
different  observations  for  s  and  a.  The  Houghspaces  of 
the  left  and  right  row  are  also  fused  together  in  a  common 
Houghspace  by  summing  up  the  entry  of  each 
corresponding  cell.  In  table  2  the  variance  of  the  error  of 
the  measured  offset  s  based  on  the  left  and  right  row 
separately,  and  based  on  the  common  Houghspace  are 
shown.  Another  method  to  combine  the  result  form  the 
two  rows  is  to  take  the  average  result  from  the  left  and 
right  row.  The  result  of  this  is  also  shown  in  table  2. 
Based  on  the  result  reported  in  table  2  we  may  conclude 
that  the  row-recognition  system  has  a  good  performance 
ranging  from  0.6  cm  variance  of  error  for  set  2  to  1.2  cm 
for  set  1.  Moreover,  it  is  shown  that  by  using  two  rows 
instead  of  one  the  accuracy  is  improved  significantly. 
Finally,  it  seems  that  fusion  of  the  information  from  the 
two  rows  should  be  done  in  the  Houghspace  directly 
rather  than  by  averaging  the  final  results,  as  this  leads  to 
slightly  less  error  variance. 

To  evaluate  the  robustness  against  calibration  errors, 
errors  of  the  values  of  the  camera  height  and  camera  angle 
were  introduced.  To  evaluate  the  robustness  in  plant  size 
variations,  the  number  of  adjacent  s-cells  has  been  varied. 
The  results  are  presented  in  form  of  extreme  errors  of  s, 
shown  in  figure  7-9,  and  of  the  variance  of  the  error  of  s  in 
figure  10-12.  All  these  results  are  based  on  the  output 
from  the  common  Houghspace.  From  the  figures  we  may 
conclude  that  the  system  shows  a  satisfying  robustness 
against  calibration  errors  and  that  the  number  of  adjacent 
s-cells  must  be  chosen  rather  small  compared  to  the 
expected  plant  size  as  a  to  large  number  of  adjacent  s-cell 
result  in  a  poor  performance. 


Table  2.  Variance  of  error  of  S  in  cm 


Set 

bright 

Sfeft 

e 

^common 

$  right  +  Sleft 

2 

1 

1.6 

2.3 

1.2 

1.4 

2 

1.0 

0.8 

0.6 

0.7 

3 

1.1 

1.4 

0.9 

0.9 

4 

1.0 

1.4 

0.8 

0.9 

4.  Field  tests 


For  all  images  the  real  position  of  the  camera  relative 
to  the  rows  was  estimated  as  follows.  A  human-operator 
marked  the  rows  with  a  mouse  based  on  which  the 
estimation  of  the  real  heading  a  and  offset  5  could  be 
calculated.  All  images  were  treated  twice  and  the  average 
value  from  the  two  observations  was  taken  as  the  final 
estimate  of  the  real  values  and  used  to  evaluate  the 
performance  of  the  row  recognition  system.  As  we  look  at 
two  rows  simultaneously,  a  left  and  right  one,  each  row 


The  row-recognition  system  has  been  tested  on  a  so- 
called  inter-row  cultivator,  able  to  perform  mechanical 
weed  control  between  rows  of  plants.  The  system  consists 
of  a  tractor  that  drives  along  the  rows  with  the  cultivator 
mounted  at  the  rear  of  the  tractor.  The  accuracy  of  the 
driver  is  not  sufficient  to  come  close  to  the  plant  rows.  So 
a  row-recognition  system  is  desired  which  is  able  to  guide 
the  cultivator  closer  to  the  plants  without  damaging  them. 
A  steering-unit  based  on  a  very  thin  steering  wheel,  which 
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cuts  through  the  soil,  is  used  to  control  the  position  of  the 
cultivator.  A  regulator  has  been  implemented  which  gets 
its  position  feed-back  from  our  row-recognition  system 
and  gives  a  control  signal  to  the  steering-unit. 

The  system  was  tested  on  two  types  of  crops:  sugar 
beet  plants  and  rape.  The  row  distance  was  typically  48 
cm  and  the  distance  between  the  plants  in  the  row  was 
about  12  cm  for  the  sugar  beet  plants  and  15  cm  for  the 
rape.  The  weed  density  varied  from  field  to  field  and 
could  reach  a  level  up  to  12  weed/plants. 

A  number  of  field  tests  have  been  done  to  verify  the 
row-recognition  system,  including  tests  under  different 
light  conditions,  weed  density  and  size  of  plants.  The 
system  was  able  to  process  8-12  images/s  (depending  on 
plant  size  and  weed  density)  and  to  cultivate  at  a  speed  of 
2  km/h.  The  cultivator,  cultivating  several  rows 
simultaneously,  had  a  spacing  left  between  the  tools  of  9 
cm  for  the  crops.  As  no  crops  were  damaged,  we  may 
conclude  that  we  were  able  to  control  the  cultivator  within 
±  4.5  cm.  These  results  confirm  very  well  the  results  from 
the  evaluation  described  in  section  3,  where  the  extreme 
errors  were  +3  and  -  4cm  (figure  8). 

5.  Conclusions  and  Outlook 

In  this  paper  we  have  presented  a  new  method  for  the 
recognition  of  plant  rows.  The  system  proved  to  be  able  to 
effectively  fuse  information  coming  from  two  rows,  and 
proved  to  be  robust  against  calibration  errors  and  to  be 
robust  against  high  weed  densities.  The  accuracy  of  the 
position  estimation  relative  to  the  row  proved  to  be  good 
with  an  error  variance  between  0.6  and  1.2  cm  depending 
on  the  plant  size  and  with  a  maximum  error  of  ±  4  cm. 
Extensive  field  tests  also  showed  that  the  system  is 
sufficiently  fast,  about  10  images  per  second,  to  be  used 
in  a  closed-loop  control  system.  In  future  work  we  will 
develop  an  automatic  calibration  procedure  and  an  end  of 
row  detection  function.  Moreover  we  plan  to  use  the  row- 
recognition  system  to  guide  a  mobile  robot  through  the 
field. 
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Figure  9.  Error  of  camera  angle  in  degrees 
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(variance) 


-  Set  1 - Set  2  .  Set  3 - Set  4 


Calibration  error  of  camera  angle  in  degrees 


Figure  10.  Error  of  s-cells  in  cm  (variance) 


Figure  12.  Error  of  camera  angle  in  cm(variance) 
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Abstract 

To  achieve  high  performance  robot  control  system,  the 
inclusion  of  actuator  dynamics  in  the  controller  design 
will  be  inevitable .  Nonetheless,  the  consideration  of  ac¬ 
tuator  dynamics  in  the  robot  control  system  may  compli¬ 
cate  the  controller  structure  and  its  corresponding  stabil¬ 
ity  analysis.  Moreover,  on-line  computation  of  the  re¬ 
gressor  matrix  in  conventional  adaptation  mechanism  is 
always  time-comsuming  and  expensive .  Therefore,  this 
study  presents  a  novel  approach  to  circumvent  the  above- 
mentioned  drawbacks.  In  this  scheme,  the  regressor  ma¬ 
trix  merely  contains  the  desired  trajectories  which  can  be 
calculated  a  priori,  and  hence,  the  computational  burden 
is  largely  reduced.  The  so-called  semi-global  asymptotical 
stability  is  guaranteed  via  Liapunov  analysis  and  simula¬ 
tion  results  also  show  the  effectiveness  of  the  controller. 

i 

1  Introduction 

Intensive  studies  have  been  carried  out  in  the  literatures 
on  the  rigid  robot  represented  by  a  set  of  second-order 
differential  equations,  charactering  the  dynamic  behavior 
of  the  robot  and  assuming  the  inputs  to  be  the  torques  or 
the  forces  acting  on  on  the  joints.  In  reality,  however,  the 
actuator  dynamics  constitutes  an  important  part  in  the 
robot  dynamics,  especially  when  high  operation  speed  is 
required,  as  pointed  out  by  Good  et  al  [1].  Nonetheless, 
the  consideration  of  actuator  dynamics  in  the  robot  con¬ 
trol  system  may  complicate  the  controller  structure  and 
its  corresponding  stability  analsis. 

Recently  the  problem  of  controlling  the  motion  of 
robots  with  motor  dynamics  has  attracted  considerable 
attention  [2]-[13].  In  [2]  and  [3],  the  controller  is  designed 
to  compensate  for  the  uncertainty  in  the  robot  and  ac¬ 
tuator  dynamics  by  using  robust  control  schemes.  Non¬ 
linear  feedback  linearization  and  decoupling  is  studied  in 
[4]  for  robot  containing  motor  dynamics.  A  third-order 
model  is  considered  and  a  nonlinear  feedback  lineariza¬ 
tion  in  Cartesian  space  is  investigated  theoretically  and 


experimentally  in  [5],  and  [6]  proposes  an  adaptive  control 
scheme  to  account  for  the  parametric  uncertainty  with 
the  need  of  acceleration  signals.  Later  on  some  other 
adaptive  control  approaches  [7]-[8]  are  also  advocated 
without  the  acceleration  feedback.  In  [9]-[ll],  adaptive 
robust  control  schemes  are  designed  to  compensate  for 
the  parametric  and  nonparametric  uncertainties.  More¬ 
over,  the  adaptation  mechanism  always  require  the  on¬ 
line  computation  of  regressor  matrix,  which  would  be 
very  time-consuming  and  expensive.  Although  an  adap¬ 
tive  control  algorithm  is  proposed  [12]-[13]  to  avoid  the 
velocity  measurement  in  the  feedback,  the  on-line  posi¬ 
tion  measurement  is  still  involved  in  the  regressor  ma¬ 
trix  calculation.  Therefore,  this  study  presents  a  novel 
approach  to  circumvent  the  above-mentioned  drawbacks. 
In  this  scheme,  only  position  signal  is  needed  in  the  feed¬ 
back  law,  and  the  regressor  matrix  merely  contains  the 
desired  trajectories  which  can  be  calculated  a  priori,  and, 
hence,  the  computational  burden  is  largely  reduced.  The 
so-called  semiglobal  asymptotical  stability  is  guaranteed 
via  Lyapunov  analysis  and  simulation  results  also  show 
the  effectiveness  of  the  controller. 

This  paper  is  organized  as  follows  :  In  section  2,  brief 
description  of  the  dynamic  model  and  its  basic  properties 
are  given.  In  section  3,  an  adaptive  controller  is  provided 
for  the  model  described  in  section  2.  The  design  proce¬ 
dure  of  the  control  algorithm  is  splitted  into  two  stages, 
namely,  mechanical  and  electrical  stages.  Moreover,  Lya¬ 
punov  stability  proof  is  also  given  in  this  section.  In  sec¬ 
tion  4,  a  case  study  on  a  two-Inik  robot  arm  is  made,  and 
simulation  results  show  the  validity  of  the  proposed  con¬ 
troller.  Finally,  some  concluding  remarks  are  provided. 

2  Dynamic  Model 

In  this  section,  the  dynamic  model  of  a  robot  system  and 
some  of  its  intrinsic  properties  will  be  briefly  reviewed. 
All  the  properties  described  are  very  useful  in  adaptive 
controller  design  apd  can  be  found  in  the  literatures. 
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Consider  the  dynamics  of  an  n-link  robot  manipulator 
with  D.C.  motor  embedded  in  each  joint  to  be  : 

M(q)q  +  C(q}q)q  +  G(q)  =  KnI  (1) 

LI  +  RI  =  u  (2) 

where  q  €  Rn  represents  the  vector  of  joint  angle  and 
I  is  the  vector  of  armature  currents  in  D.C.  motors,  and 
M{q)  e  tfnxn,  C(q ,  q)q  6  Rn ,  and  G{q)  G  Rn  are  denoted 
as  the  moment  of  inertia  matrix,  Coriolis  and  centrifu¬ 
gal  terms,  and  gravitational  torques,  respectively,  Kn  == 
diag[Kni,  A\»2»  •  •  • ,  Knn],  L  =  diag[L\,  L2,  •  •  • ,  Ln]t  and 
R  =  diag[Ri1R2,  -  ,Rn]  are  diagonal  matrices  repre¬ 
senting  the  equivalent  torque  constants;  armature  induc¬ 
tion,  and  resistance,  respectively,  and  u  £  Rn  is  defined 
as  the  control  voltage  input  variable.  It  should  be  noted 
that  (1)  can  be  viewed  as  the  mechanical  part  of  the 
whole  system  while  (2)  can  be  treated  as  the  correspond¬ 
ing  electrical  part.  In  the  subsequent  controller  design, 
these  two  sub-systems  will  be  analyzed  seperately.  Fur¬ 
thermore,  the  following  statements  are  held  for  general 
robot  systems  and  are  very  useful  in  the  stability  analy¬ 
sis. 

Property  1  :  The  right-hand  side  of  dynamic  equation 
(1)  can  be  rewritten  in  a  regression  matrix  form  as  follows 

M(q)q  +  C(q,  q)q  +  G(q)  =  W(q ,  q,  q)B  (3) 

where  W(qfq}q)  G  Rnxmi  denoted  as  the  regressor,  is  a 
matrix  of  known  functions  Constituting  the  system  dy¬ 
namics,  and  9  G  Rm  consists  of  the  kinematic  and  dy¬ 
namic  parameters  of  the  manipulator  system. 

Property  2  :  M(q)  -2 C(q7q)  is  skew-symmetric,  i.e., 
xT(M-2C)x  =  0,VxE  A" 

Property  3  :  For  a  robot  system,  the  time  derivative 
of  the  inertia  martix  and  the  Coriolis  centrifugal  matrix 
can  be  bounded  by  the  following  inequalities  : 

l|^Af(g)||  <  «n||e||  +  <*i2 

||C'(g.?)ll  <  a2i||e||  +  a22 

where  e  =  q  —  qd  and  qd  is  some  prespecified  bounded  tra¬ 
jectory  function  vector  with  bounded  first-  and  second¬ 
time  derivatives,  and  ar,j,  i7j  =  1,2  are  some  positive 
constants. 

3  Adaptive  Controller  Design 

In  this  section,  the  adaptive  control  algorithm  is  devised 
under  the  assumption  that  all  the  system  parameters  are 
not  known  a  priori  and,  the  design  procedure  is  performed 
separately  on  the  basis  of  two  stages,  namely,  the  mechan¬ 
ical  stage  and  the  electrical  stage. 


First  Stage  Design  : 

Consider  the  mechanical  part  of  the  dynamic  model, 
as  shown  in  (1),  which  can  be  reformulated  as  : 

M(q)e  +  C(q,q)e  =  -M(q)qd  -  C(q,q)qd  -  G(q )  +  KnI 

(4) 

where  qd}  qdi  and  qd  have  been  defined  previously.  Since 
there  is  no  actual  control  input  involved  we  further  define 
a  virtual  control  input  Id  for  this  subsystem.  Thus,  if 
I  Id,  we  can  have  the  subsystem  to  behave  as  if  the 
virtual  input  Id  is  the  true  input.  Moreover,  the  on¬ 
line  computation  of  the  regressor  matrix  is  very  time- 
consuming  and  would  lead  to  large  computational  cost. 
To  remedy  this  problem,  all  the  measured  signalsin  the 
regressor  matrix  will  be  replaced  by  their  desired  forms, 
that  is,  let  the  virtual  control  input  Id  adaptive  control 
algorithm  be  devised  as  : 

Id  =  knv[M(qd)qd  +  C(qd,qd)qd  +  G(qd)-02k-P2e] 

=  KnAW(<id,qd,qd)0-02k-P2e] 

=  Knv[Wd9-p2k-02e ] 

=  KnvTd  (5) 

where  I\nv  is  the  estimate  of  K~ 1 ,  Wd  =  W(qd,qd,<id), 
Wd9  -  02k  -  02e  =  Td ,  and  0  is  the  design  parameter 
which  will  be  properly  selected  to  guarantee  the  asymp¬ 
totical  stability.  And  the  adaptive  parameter  6  and  k 
satisfy  the  following  equations  : 


k 

=  — 20k  -f  0?  e 

(6) 

9 

=  H^Wjk-^Wje- 

-Wje) 

(7) 

K  nv 

—  nvl  j  rui 2 1 

t 

K  nvi 

-  (jjki  -  ^e,-  -  ei)rdi, 

*’=1,2,” 

■  > 71  (8) 

where  T  is  some  positive-definite  matrix,  and  Xi  denotes 
the  ith  element  of  vector  x. 

Then,  substituting  the  above  control  law  (5)  into  the 
dynamics  (4)  along  with  the  adaptation  laws  (6)-(8),  we 
obtain 

M(q)e  +  C{q,q)e  +  02k  +  02e 

=  [M(qd)  -  M{q)]qd  +  [£(?<*,  qd)  ~  C{q,  q)]qd 
+  [< G{qd )  -  G(g)]  +  Knt,  +  I<n(knv  -  K-')Td 

=  Wd9  —  W  9  -f  I\n^I  +  Kn  KnvTd 
=  Wd9  +  (Wd  -  W)9  +  Knei  +  KnI<nvrd  (9) 

where 

WO  =  M(q)q  +  C(qtq)q  +  G(q) 
ej  =  I  —  Id 
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Before  proceeding  with  the  stability  analysis  we  have, 
based  on  the  properties  mentioned  in  section  2,  the  fol¬ 
lowing  proposition  : 

Proposition  1:  The  bounds  of  the  vector  norm  \\(Wd  ~ 
W)0\\,  ||^M||,  and  \\C(q,  <j)||  satisfy  the  following  in¬ 
equalities: 


(1) 

||(^-W^||<ai||e||  +  a2||e|| 

(10) 

(2) 

||^M||<o3||e||  +  o4 

(11) 

(3) 

l|C(9,?)||<a5||e||  +  a6 

(12) 

Proof:  The  proof  is  straighforward  and  omitted  due  to 
limitted  space. 

Q.E.D. 

Accordingly,  through  Lyapunov  stability  analysis,  the 
following  theorem  can  be  readily  derived  on  the  basis  of 
the  error  dynamics  (6)-(9)  by  incorporating  proposition 

1. 

Proposition  2:  Consider  the  closed-loop  system  dy¬ 
namics  (6)-(9),  then  the  position  error  e  and  the  velocity 
error  e  will  asymptotically  decay  to  zero  if  the  armature 
current  I  equals  the  virtual  control  input  Id  and  the  con¬ 
trol  gain  p  is  chosen  large  enough. 

Proof  :  The  evaluation  o£  the  closed-loop  system  sta¬ 
bility  will  be  made  on  the  basis  of  the  Lyapunov  analy¬ 
sis.  Before  proceeding  with  the  stability  analysis  of  the 
closed-loop  system,  we  first  define  a  Lyapunov  function 
candidate  as  follows  : 

y,  =  ^ 

Z  t  — 1 


i 

+ 0 

eT M(q)e  - 

-  ^kTM(q)e  + 

Uty  ~l0 

= 

Iv 

2 

Tpx  +  l 

n 

«=1 

(13) 

where 

X 

=  f 

eT  eT 

kT  9t  ]T 

'  02I 

M{q)/0 

0 

0  “ 

P 

M(q)/0 

M(q)  - 

M(q)/P 

0 

(  1  A  \ 

0 

-M{q)/P 

I 

0 

(14J 

0 

0 

0 

r 

Due  to  the  structural  symmetry  and  positive-definiteness 
of  M{q)  and  T,  it  can  be  verified  that,  if  the  control 
gain  p  is  chosen  large  enough  as  required,  the  Lyapunov 
function  candidate  V  is  a  positive-definite  function. 


Then,  by  taking  the  time  derivative  of  V\  along  the 
error  dynamics  (6)-(9),  we  obtain 

Vi  =  p2eTe  +  eT[-Ce-  {32k-  P2e  +  Wd6 
HWi  -W)0  +  Knei  +  Kn Knvrd ) 

+  \eTMe  +  kT{-‘2p  k  +  02e)  +  -UTMe  +  LTMi 

P  P 

+  ieT’[-Ce  -  /??*  -  02e  +  Wd9 
+(Wd  -  W)9  +  I<ne,  +  KnKnvrd] 

+—eTM  (—2/3k  +  02e)  -  i kTMi 
P  P 

-^kT[-Ce  -  02k  -  02e  +  Wd9 

+(Wd  -  W)9  +  Kne,  +  KnI<nvrd] 

-eTWj  dk  -  L  -  e)  +  £  I<niKnJ<nvi 

P  P  1  =  1 

Next,  we  can  have  the  upper  bound  for  V  by  using  the 
properties  described  in  section  2  and  the  inequalities 
derived  in  proposition  1  to  yield: 

Vi  <  -/3||e||2-(/?-i)Am(M)||e||T-^||2 

+l|e||(ailie||  +  »2||e||)  +  ^||e||(a3||e||  +  a4)||e|| 

+  ^llell(a5||e||  +  ar6)||e||  +  ^||e||(ai||e||  +  a2p||) 

+2ar||e||2||fc||+ip||(a3||e||  +  a4)||e|| 

+  ^ll^ll(a5||e||  +  a6)||e||  +  ^p||(ai||e||  +  a2||e||) 

<  -(0-  j)INI2-M2 

-  ||e|| 

-(2a7+^+i)P||]||e||2 

H*i  +  j  +  j  +  J)\\e\\\\e\\ 

+  jm\M  +  (j  +  J  +  ^)l|e||P|| 

-^k~~c-6)Kn(I-Id) 

<  -VilH|2-r;2||c||2-r/3pj|2 
+>?4lW|||e||  +  %||i||||e||  +  %||e||||fc|| 

-dkT -  l-cr -er)r<nr.t 
P  rp 
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(17) 


<  ~(m  -  \rn  -  ^/5)||e||2  -  {r]2  -  ^4  -  ^6)||e||2 

-0»  -  \n*  -  \^)\\kf 

~(^*T  -  j eT  -  er)I<nej 

where  A m(M)  denotes  the  smallest  eigenvalue  of  M ,  and 

=  0-^- 

p  p 

=  (/?-I)Am(M)-a2-i(a3  +  a5)||e|| 

-4(2a7/?  +  a5  +  l)||*|| 

=  P 

1, 

=  <*i  +  +  a4  +  a6) 

— 

P 

=  ~(a2  +  a4  +  a6) 

Obviously,  all  the  terms  in  rji ,  i  =  1, 2,  *,  6  involving  -  can 
be  made  negligible  if  0  is  chosen  large  enough.  Therefore, 
we  can  infer,  under  the  condition  that  /  =  Id,  that  all 
the  signals  in  the  closed-loop  system  are  bounded  and,  in 
addition, 

e  6  L>2  i  e  £E  L2 

As  a  result,  by  invoking  Barbalat’s  Lemma,  we  have 
lim  e  —  0 

t  — ►  oo 

lim  e  —  0 

t—*oo 

which  completes  the  proof. 

Q.E.D. 

Second  Stage  Design  : 

In  this  stage,  an  adaptive  control  algorithm  is  provided 
for  the  electrical  subsystem  (2).  It  is  noteworthy  that  in 
the  first  stage  we  have  define  a  virual  control  input  Id , 
and  hence,  an  error  term  e/  between  the  armature  current 
and  the  virtual  control  input  arises  at  the  right-hand  side 
of  (9).  Therefore,  we  are  trying,  in  this  stage,  to  design  an 
adaptive  control  approach  for  the  dynamics  of  the  error 
term  e/.  Before  doing  this,  we  first  construct  the  error 
dynamics  of  e/.  Consider  the  electrical  dynamics  (2), 

Li  +  RI  =  u,  (15) 

then  after  some  mathematical  arrangements  we  have 

Ler  +  Kjej  —  —Lid  —  RI  +  Kjej  +  u  (16) 

If  we  let  the  control  voltage  input  be 


m 

*73 

*74 

*?5 

*76 


it  =  LId  +  RI  —  Kitf  4*  v 


where 

L  —  dia(j[Lu  L2,  ■  •  ■ ,  Ln]  and  ft  =  diag[R\,  R2,  •  •  ■ ,  Rn] 
denote  the  estimates  of  L  and  L,  respectivley,  and  v  is 
the  auxiliary  control,  which  will  be  define  later.  Then, 
(17)  becomes 

Lcj  4-  A/ c /  =  Lid  +  RI  4-  v  (18) 

where  L  —  L  —  L  and  R  —  R  —  R  are  the  estimation 
errors.  Now,  let  the  parameter  estimates  be 

Li  —  yi^Ii  Idi  j  *  =  1 1  2,  •  ■  ■  ,  7Z  (19) 

Ri  =  72C/i/f,  t  =  (20) 

where  e/i,  /*,  and  Idi  denote  the  ith  element  of  e/,  I,  and 
Id ,  respectively,  and  71  and  y2  are  the  adaptation  gains 
which  are  positive.  Under  such  design  philosophy,  there 
exists  a  Lyapunov  function  V2  such  that  the  following 
proposition  hold: 

Proposition  3  :  Consider  the  electrical  subsystem  (17). 
If  the  control  input  u  is  design  as  (18)  incorporating  with 
the  adaptation  laws  (20)-(21)  and  Kj  is  selected  to  be 
postive-definite  then  a  Lyapunov  function 

=  r/i'Le,  +  I  T'Ll  +  T *'*?)  (21) 

exist  and  its  time  derivative  along  the  dynamics  (19)-(21) 
satisfies 

V2  <  —  ef  Kiei  4-  cjv 

Proof :  Obviously,  differentiating  U2  with  respect  to  time 
we  obtain 

n 

V2  =  ejLe,  +  J2(7xlLiLi  +  l2lRiki) 

1  =  1 

n 

=  -ej  Kiei  +  ^(e/.L./d,-  +  ej.-fl,/,) 

1  =  1 
71 

4-ef  v  4-  y:(7i  1LiLi  4-  72  1  RiRi) 

i- 1 

Upon  the  substitution  of  (20)-(21),  the  above  equation 
can  then  be  rewritten  as: 

l '2  <  —rj  KjCf  4*  ej  v 

which  completes  the  proof.  Q.E.D. 

In  the  sequel,  the  main  theorem  concerning  the  overall 
closed-loop  system  is  to  be  proposed  based  on  Proposi¬ 
tion  and  Proposition  3. 
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Theorem  1  Consider  the  overall  closed-loop  system  (1)- 

(2).  If  the  control  input  u  and  the  adaptation  laws  are 
elaborated  as  (18)  and  (6) -(8),  (20),  (21),  respectively, 
with  large  enough  (3  and  Kj,  and  the  auxiliary  input  v  is 
designed  as 

v  =  ^n(^--e-e)  (22) 

I<n  =  diag[KnliKn2r--J<nn\ 

I<ni  =  l3(pki  -  -a  -  ei)en,  i  —  1,  2,  •  • ,  n  (23) 

where  Kn  is  the  estimate  of  I<n,  then  all  the  signals  in  the 
closed-loop  system  will  be  stable  in  the  sense  of  Lyapunov. 
Moreover ,  the  tracking  errors  e  and  e  will  converge  to  zero 
asymptotically. 

Proof  :  Consider  a  composite  Lyapunov  function  V  ~ 
Vi  +  V2  +  |£"_i  73 lK*i  as  shown  in  (13)  and  (22),  then 
take  its  first  time  derivative  along  the  closed-loop  system 
dynamics  (9)  and  (19)  and  substitute  (23)  and  (24)  into 
V  to  yields  : 

V  <  ~(m  - -  ^5)||e||2 

“(*»  -  \ *14  -  ^6)||e||2 

~(V3  -  ^T)s  -  ^s)P||2  -  Am(A'/)||e/||2 

i 

where  \m{Kj)  denotes  the  smallest  eigenvalue  of  I<j. 
Again,  by  using  the  results  developed  in  Proposition 
2  and  Propsoition  3  and  invoking  Barbalat’s  Lemma, 
we  can  conclude  that  e,  e,  k ,  and  ej  will  decay  to  zero 
asymptotically,  which  ends  the  stability  proof. 

Q.E.D. 

Remark  :  The  stability  result  shown  above  is  also  called 
the  ”  semi-global  asymptotic  stability”  which  means  that 
the  domain  of  attraction  of  the  closed-loop  system  can 
be  made  infinitely  large  by  way  of  proper  selection  of  0, 
although  is  seems  that  only  local  result  is  guaranteed  in 
the  Lyapunov  analysis. 

4  Simulation  Studies 

In  this  section,  the  advocated  adaptive  tracking  control 
algorithm  is  applied  to  a  two-link  rigid  robot  manipula¬ 
tor  with  D.C.  motor  dynamics  as  an  illustrated  example. 
Several  simulations  are  studied  and  shown  below.  The 
desired  position  trajectories  are  qdx  =  ^(cos(3*)  -  1) 
degree  and  qd2  =  !M(Cos(2*)  -  1)  degree,  and  the  control 
gain  0  is  selected  to  be  10.  The  following  plots  demon¬ 
strate  the  simulation  results.  Fig.(l)  and  Fig.(2)  show  the 


position  tracking  errors.  It  noteworthy  that  if  the  initial 
conditions  on  the  position  errors  are  small  and  the  con¬ 
trol  gain  p  is  made  large  enough,  the  tracking  responses 
will  converge  to  zero  very  fast  within  0.8  second  which 
confirms  the  well-suited  performance.  However,  it  must 
be  paid  at  the  expense  of  large  control  force.  Unfortu¬ 
nately,  no  systematice  approach  can  be  used  to  select  the 
control  gain  0.  It  must  be  chosen  by  iterative  simula¬ 
tions,  and  a  trade-off  between  the  system  response  and 
the  control  gain  0  should  be  made.  Finally,  Fig. (3)  and 
Fig. (4)  depict  the  velocity  tracking  resposes. 

5  Conclusion 

In  this  paper,  we  have  proposed  an  adaptive  trajectory 
tracking  control  method  for  a  multi-link  rigid  manipula¬ 
tor  including  electrical  motor  dynamics.  No  prior  knowl¬ 
edge  on  the  system  parameters  is  required  in  the  control 
algorithm.  Moreover,  the  adaptive  controller  can  be  eas¬ 
ily  implemented,  and  hence,  save  a  large  amount  of  time 
from  the  parctical  point  of  view  since  the  regressor  matrix 
only  contains  the  desired  trajectory  which  can  be  calcu¬ 
lated  off-line.  Simulation  results  also  the  effectiveness  of 
the  devised  algorithm. 
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Abstract 

A  controller  based  on  neural  networks  (NN)  is  proposed 
to  achieve  trajectory  tracking  for  rigid  manipulators. 
The  NN  used  here  has  three  layers  with  one  hidden 
layer.  Elements  of  a  dynamics  model  of  manipulator 
are  approximated  by  this  neural  networks.  These  ap¬ 
proximation  form  a  stable  adaptive  control  law.  The 
NN  weights  are  tuned  on-line.  Performance  of  the  NN 
presented  is  compared  with  that  of  conventional  adap¬ 
tive  controller. 

1.  Adaptive  inverse  dynamics 

In  the  1980s  several  globally  convergent  adaptive  con¬ 
trol  schemes  for  rigid  link  manipulators  appeared  in 
the  literature  [1,  2].  These  schemes  belong  to  several 
distinct  categories  which  were  discussed  in  detail  in  [3], 
The  research  described  in  [4]  and  [5]  uses  the  adaptive 
inverse  dynamics  and  the  tracking  error  to  drive  the 
update  law. 

An  important  property  for  rigid  manipulators  is  that 
the  dynamics  are  linear  in  terms  of  a  suitably  selected 
set  of  robot  parameters,  sometimes  called  the  linear- 
in-parameters  property  [6],  such  that  the  equation  of 
motion  can  be  written  as 

r(t)  =  M  (y>,  P)p  +  C(<p,  p,  P)f>  +  G(v>,  P)  =  Y(<p,  p,  p  )P 

This  property  is  widely  used  in  adaptive  inverse  dynam¬ 
ics  algorithms  for  rigid  manipulators  and  other  adap¬ 
tive  schemes.  Here,  the  mxm  mass  matrix  M  is  a 
function  of  the  m-vector  of  degrees  of  freedom,  p,  and 
the  r  unknown  parameters  P;  Cp  is  the  m-vector  of  the 
centrifugal  and  Coriolis  forces  and  G  is  the  m-vector  of 
the  gravitational  force;  r  is  the  m-vector  of  controls; 
and  Y(p,  p,  ip)  is  an  mxr  regressor  matrix. 

For  controlling  the  motion  of  the  manipulator  so  as 
to  follow  a  given  path,  pd,  an  inverse  dynamics  adap¬ 
tive  controller  (model-based  adaptive  controller  [4]) 
shown  in  Figure  1,  was  used.  In  [4],  it  was  assumed 


Figure  1:  Inverse  dynamics  adaptive  controller. 


that  the  system  with  the  estimated  parameters  P  was 
driven  by  the  following  control  law 

r(t)  =  M(p,  P)pe  +  C{p,  p,  P)p  +  G{p,  P)  (1) 

where  M,  C  and  G  were  estimates  of  M ,  C  and  G 
respectively  and 

^eOO  =  fid  4-  Ave  +  Kpe  e{t)  =  <pd  —  ip  (2) 

where  Kv  and  Kp  were  mxm  positive  constant 
diagonal-gain  matrices  and  e(t)  was  the  tracking  error. 
The  adaptation  law  was  determined  as 

P(t)  =  r  Y'M-'ej  (3) 

where  rrXr  was  a  positive  constant  diagonal  matrix 
and  ej  was  the  filtered  tracking  error  given  as 

e/(0  =  e  -f  \Pe  (4) 

where  ^mxm  also  was  a  positive  constant  diagonal  ma¬ 
trix. 

The  adaptive  inverse  dynamics  approach  is  an 
efficient  means  for  trajectory  control  of  manipulators. 
However,  there  are  some  drawbacks  with  this  and  other 
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adaptive  methods.  Most  of  all,  the  dynamics  model  of 
the  controlled  plant  should  be  known.  Their  tracking 
performance  in  high  speed  operation  may  be  affected 
by  unmodeled  dynamics  if  they  are  not  taken  into 
account. 


2.  Neural  networks  adaptive 
controller 

In  recent  years,  neural  networks  (NN)  have  attracted 
the  attention  of  many  researchers  [7,  8,  9,  10,  11,  12, 
13,  14].  Some  of  the  recent  reviews  of  applications  of 
the  NN  in  robotics  can  be  found  in  [15,  16,  17].  The 
learning  capability  of  the  NN  can  be  used  for  learning 
the  nonlinear  functions  associated  with  the  dynamics 
of  rigid  manipulators.  Upon  learning,  the  NN  would 
be  able  to  generalize  and  respond  effectively  to  unfa¬ 
miliar  situations.  This  type  of  NN,  which  sometimes  is 
referred  to  as  back  propagation,  is  widely  used  for  off¬ 
line  learning  of  the  inverse  dynamics  of  manipulators. 
After  the  off-line  learning  period,  the  NN  controllers 
can  learn  uncertainties  of  manipulators  on-line.  Many 
of  the  NN  used  for  manipulators  have  three  layers  (in¬ 
put,  hidden,  output),  in  which  the  units  in  each  layer 
are  connected  through  weights.  The  weights  are  usually 
initialized  randomly  and  modified  by  error  back  prop¬ 
agation.  The  tracking  error  decreases  as  the  number  of 
learning  trials  increases.  In  comparison  with  adaptive 
controllers  some  researchers  say  that  NN  offer  better 
trajectory  tracking  performance. 

Some  of  the  drawbacks  of  NN  are  as  follows:  Ran¬ 
domly  initialized  weights  may  result  in  poor  control 
and  may  cause  stability  problem.  The  number  of  neu¬ 
rons  required  has  to  be  determined  experimentally.  It 
is  not  guaranteed  that  the  learning  algorithm  will  con¬ 
verge.  In  other  words,  the  networks  may  take  an  in¬ 
definite  amount  of  time  to  be  trained.  For  large  sys¬ 
tems,  with  high  degrees  of  freedom,  slow  adaptation 
will  arise.  The  solutions  of  NN  is  in  general  not  accu¬ 
rate  [16].  Retraining  is  needed  when  the  trajectory  is 
changed.  This  is  a  serious  disadvantage  compared  with 
adaptive  and  robust  controllers. 

An  adaptive  neural  based  controller  for  trajectory 
following  of  rigid  manipulators  was  studied  here  as 
shown  in  Figure  2.  One  of  the  advantages  of  this  NN 
controller  is  that  there  is  no  need  to  replace  the  NN 
when  the  rigid  link  manipulator  is  changed.  A  three 
layer  nonlinear  NN  with  arbitrary  activation  functions 
is  used.  The  neural  network  weights  are  tuned  on-line. 
The  global  asymptotic  stability  of  the  position  tracking 
error  as  well  as  the  boundedness  of  NN  weight  updates 
are  guaranteed. 


A  three  layer  NN  with  one  hidden  layer  has  an  out¬ 
put  Viyi=  1,  •  •  • ,  n3 


where  x  €  Rnl  is  an  input  vector,  a  is  the  acti¬ 
vation  function,  Vjk  the  lst-to-2nd  layer  interconnec¬ 
tion  weights,  Wjk  the  2nd-to-3rd  layer  interconnection 
weights;  0vj  and  0wj  are  threshold  offsets  of  the  second 
and  third  layers  respectively;  nl  and  n3  are  the  num¬ 
ber  of  neurons  in  input  and  output  layers  respectively, 
and  n2  is  the  number  of  neurons  in  hidden  layer. 

Equation  (5)  in  the  matrix  form  is 


X]  vik*k  +  (5) 


n  2  / 

w  =  £  [  Wij 

i= i  V 


y  =  W'viV'x)  (6) 

in  which 

x*  =  [  1  Xx  •  •  •  xnl  ]  y*  =  [  yx  ■■■  yn3  ] 
vi,i  ••• 


w*  = 


L  @vn2  Vn2,l  '  '  -  vn2,nl  ] 

9W1  Wx,x  ■■■  Wi'„2 


Vwn3  Wn3,l  ’  *  *  W>n3,r»2  J 

A  continuous  function  f(x)  can  be  constructed  as 

f(x)  =  Wt<r{Vtx)  +  €(x) 
where  c(a?)  is  a  NN  error  vector. 


(7) 


2.1.  Manipulator  dynamics 

As  before,  the  dynamics  an  m-link  rigid  manipulator 
with  revolute  joint  can  be  written  as 


r(t)  =  M{<p)<p  +  C(<p,  <p)<p  +  G(<p)  +  F(<p)  (8) 

where  the  mxm  mass  matrix  M  is  a  function  of  the  m- 
vector  of  degrees  of  freedom,  (p\  C<p  is  the  m-vector  of 
the  centripetal  and  Coriolis  forces;  G  is  the  m- vector  of 
the  gravitational  force;  F  is  the  m-vector  of  the  friction 
force;  and  r  is  the  m-vector  of  controls.  It  should  be 
noted  that  the  rigid  manipulators  hold  the  following 
properties. 

Property  1.  The  inertia  matrix  M(<p)  is  symmetric 
positive  definite,  and  bounded  by 

mi  I  <  M(<p)  <  mol  (9) 
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where  mi  and  m2  are  known  positive  constants  and  I 
is  an  unit  matrix. 

Property  2.  The  centripetal/Coriolis  matrix, 
C(<p,  ip),  may  be  chosen  such  that  the  matrix  M-2C  is 
skew-symmetric  and  consequently  the  following  holds 

ip\M  -  2C)<p  =  0  (10) 

where  M  is  time  derivative  of  the  inertia  matrix. 

Property  3.  The  following  three  layer  neural  net¬ 
work  was  used  to  approximate  a  nonlinear  function 

fd(<Pd,  'fid,  <Pd)  =  W'^V'x)  -(-  e(a:)  (11) 

<fid,  <Pd)  =  M(<pd)$d  4-  C(tfd ,  <fid)v>d  +  G(lfid)  +  F(lfid) 

where  x*  =  [  <p\  <p d  <pd  ]',  also  V,  W  are  constants 
with  nl  =  n  and  n3  =  m.  It  was  assumed  that  the 
desired  trajectory,  ,  <p*d,  is  bounded;  that  is, 

ll«(*)ll  <  ft  (12) 

in  which  ct  is  a  positive  constant. 

2.2.  The  controller 

The  objective  here  is  to  track  the  motion  of  the  manip¬ 
ulator  with  an  inexact  knowledge  of  the  manipulator 


dynamics.  Using  the  definitions  of  e  and  c j  which  were 
given  in  (2)  and  (4)  respectively,  the  dynamics  of  ma¬ 
nipulator,  (8),  can  be  written  as 

Me}  =  f(tp,<p,<pd,(pd,<pd)-Cef  -t  (13) 
/(v>,  <P,  V>d,  <Pd,  <Pd)  =  M(<f>)(tpd  +  tfe)  +  C(<pd  +  e)  +  G  +  F 

where  was  a  positive  constant  diagonal  matrix. 

The  control  input  was  selected  as 

r=Wta(Vtx)  +  Kvef +I<pe  +  Vr  (14) 

where  W ,  V  are  the  dynamic  estimates  of  W,  V  respec¬ 
tively,  Kv  and  Kp  areraxra  positive  constant  diagonal- 
gain  matrices,  and  Vr(t )  is  a  nonlinear  term  which  com¬ 
pensate  for  robustness  and  disturbance  given  by 

r~  (Sr))\\e/\\  +  6  (15) 

where 

6  =  -7  S  (16) 

with  5(0),  7  as  positive  constants.  The  vectors  S,  rj 
and  z  are  defined  as 

=  [  >7o  m  t)2  r/3  t)4  T)5  ] 

S  =  [l  11*11  IMP  \\W\\j  ||U||/  \\W\\s\\V\\t  ) 

z  =  [et  e)  } 
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where  ||W||j  is  Frobenius  norm  defined  as 

\\W\\)  =  trace(W‘W0  =  J2wh 

hj 

The  parameters  77*  ’s  are  positive  bounding  constants 
which  depend  on  the  desired  trajectory,  physical  prop¬ 
erties  of  the  manipulator,  the  disturbance  bound,  error 
bound  and  also  on  control  gain  matrix  The  param¬ 
eter  tuning  law  for  7 7  is 

fj  =  ASt\\eJ\\  =  -ii  (17) 

where  A  is  a  symmetric  positive  definite  matrix  and 
V  =  V  ~  1- 

As  assumed  in  (12),  the  desired  trajectory  is 
bounded  (||z||  <  Qd  in  which  Qd  is  a  positive  scalar 
constant).  With  the  control  input  given  by  (14),  (15) 
and  (17),  the  neural  network  weights  are  updated  by 

W  =  Acre)  (18) 

V  =  Bx(a'tWef)t  (19) 

where  A  E  An2xn2,  B  E  Rnxn  are  symmetric  posi¬ 
tive  definite  constant  matrices,  and  a '  =  \da(z) / dz\z=z 
and  a '  =  <rt{Vix).  It  was  assumed  here  that  the  ideal 
weights  are  bounded  such  that 

\\V\\f  <  Vm  \\W\\j  <  ||Z!|;  < 

where  Vm ,  Wm ,  Zm  are  positive  constants  and 


With  these  definitions  tracking  error  e,  e  goes  to  zero 
and  the  weight  estimates  V ,  W  are  bounded  ([7,  18]). 

The  controller  (14)  may  be  used  for  any  revolute 
joint  rigid  link  manipulator.  In  this  approach  no  weight 
initialization  problem  exists.  The  weights  1^(0),  W(0) 
are  initialized  as  zero  which  takes  the  NN  out  of  the 
circuit.  The  PD  terms  Kvej  and  Kpe  can  stabilize 
the  manipulator  until  the  NN  begins  to  learn.  This 
means  no  off-line  learning  phase  is  required  for  this  NN 
controller. 

3.  Conclusion 

Neural  network  controllers  have  distinct  adaptive  and 
learning  features  when  dealing  with  uncertainties  and 
unmodeled  dynamics.  With  sufficient  learning  a  small 
tracking  error  and  fast  error  convergence  can  be  ex¬ 
pected  from  the  networks.  Their  drawbacks  are  that 
there  are  no  guarantees  in  the  convergence  of  learning 


algorithms,  and  they  need  retraining  when  the  trajec¬ 
tory  is  changed. 

A  NN  adaptive  controller  for  the  motion  control  of 
rigid  manipulators  was  studied  here.  The  NN  was  used 
to  approximate  the  nonlinear  dynamics  of  the  manip¬ 
ulators.  This  means  that  the  robot  dynamics  does  not 
need  to  be  known  exactly.  Compared  to  adaptive  con¬ 
trol,  this  NN  needs  fewer  assumptions  and  fewer  calcu¬ 
lation  of  explicit  long  expressions. 
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Abstract. 

The  present  article  deals  with  simple  adaptive  motion 
control  of  mechatronic  systems.  The  direct  adaptive 
control  approach  is  used,  as  long  as  theoretically  it 
does  not  require  controller-embedded  model  of  the 
plant,  thus  yielding  simplified  control  realization.  The 
adaptation  is  based  on  adaptive  variation  of  signals 
rather  than  gains.  As  a  result,  fixed-gain  controllers 
can  be  readily  implemented,  which  is  an  unusual  issue 
in  adaptive  control.  Such  control  can  be  easily  imple¬ 
mented,  since  the  demands  on  computer  power  are 
significantly  reduced  and  the  sampling  rate  can  be  kept 
sufficiently  high. 

1.  Introduction 

Robots  are  typical  representatives  of  mechatronic 
systems.  One  aspect  that  will  not  be  treated  here 
concerns  the  design  of  the  mechanical  system,  as  well 
as  the  tools  and  techniques  for  type-dimensional 
synthesis,  analysis  and  optimization  of  robot 
mechanisms.  A  mechatronic  concept  for  structural  and 
functional  design  of  such  mechanisms  can  be  found  in 
[12]. 

There  are  certain  difficulties  in  realizing  of  fast 
motions  with  robots.  The  problems  come  from  the 
strong  couplings  between  individual  joint  motions. 
Non-adaptive  compensation  of  the  couplings  requires 
time-consuming  computations,  since  accurate  enough 
dynamic  models  are  bulky.  From  one  hand,  there  is 
some  decrease  of  the  sampling  rate  associated  with  the 
increased  amount  of  computations.  From  the  other 
hand,  too  slow  sampling  may  deteriorate  the 
performance,  especially  in  the  case  of  fast  motions. 
Actually,  decentralized  fixed-gain  control  is  used  with 
all  commercially  available  controllers.  Such  type  of 
control  results  in  an  inherently  parallel  modular 
structure  of  the  controller,  which  is  most  practical. 

Adaptive  control  is  a  label  assigned  to  a  wide  group 
of  approaches,  which  are  based  on  variations  of  the 


control  inputs  adequately  to  a  priori  unknown 
variations  of  the  plant’s  dynamics  [1].  There  are  two 
widely  recognized  groups  of  adaptive  control:  direct 
and  indirect  ones  [2]. 

Adaptive  control  schemes  based  on  the  indirect 
approach  use  a  controller-embedded  model  of  the  plant. 
The  model  is  with  a  known  structure  but  with  unknown 
parameters.  During  the  normal  operation  of  the  system 
the  model  parameters  are  updated  on-line  in  order  to 
minimize  the  mismatch  between  the  outputs  of  the 
model  and  the  plant  when  they  are  subjected  to  one  and 
the  same  control  input.  Subsequently,  the  control  is 
generated  according  to  the  model.  This  approach  is 
often  referred  to  as  identification-based  adaptive 
control \  as  well  as  explicit  adaptive  control.  However, 
explicit  adaptive  control  may  be  used  within  the  frame 
of  direct  adaptive  control  approach,  too. 

Direct  adaptive  control  schemes  (see  Fig.  2)  are 
based  on  controllers  with  previously  defined  structure 
but  with  unknown  parameters.  During  the  normal 
operation  the  controller’s  parameters  are  updated 
directly  with  the  goal  to  minimize  the  error,  or  some 
other  qualifier  of  the  closed-loop  system  performance. 

2.  Some  previous  results 

The  equation  of  motion  of  an  n  degree-of-freedom 
rigid-body  manipulator  is  described  by 

M(q)q  +  C(q,q)q  +  g(q)  =  u,  (1) 

where  q  is  the  nxl  vector  of  joint  (generalized) 
coordinates;  g(q)  eSR"  is  the  vector  of  gravity  forces; 
M(q)e^”x”  is  the  generalized  inertia  matrix; 
C(q,  q)  £  is  the  matrix  of  Coriolis  and  centrifugal 
effects;  u(f)  is  the  vector  of  generalized  forces;  it 
is  also  the  control. 
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Basic  properties.  The  generalized  inertia  matrix,  M(q), 
is  positive  definite  and  bounded  [3],  e.  3  m,  >  0  and 
3  m2  >  m]  such  that 


Theorem  1.  The  equilibrium  state  e  (t)  =  0  of  the 
system  (1)  is  globally  asymptotically  stable  under  the 
control 


Vq  e<2  =>  m  I„  <  M(q)  <  m  I„  ,  (2) 

where  Q  denotes  the  allowed  joint  space,  and  the  matrix 
inequalities  imply  positive  definiteness  rather  than 
component-wise  inequalities.  The  matrix  of  Coriolis 
and  centrifugal  effects  is  not  uniquely  defined  [5], 
However,  it  can  be  represented  in  a  unique  form  [3] 
such  that 

C(q,  q)  +  CT(q,  q)  =  ^-M  •  (3) 

dl 

Inverse  dynamics  control.  The  goal  of  gross  motion 
robot  control  is  to  achieve  closed-loop  performance  that 
meets  some  previously  defined  criteria.  A  widely  spread 
control  scheme  is  based  on  a  computer-embedded 
inverse  dynamics  model.  This  is  illustrated  on  the 
following  figure  (see  Fig.  1).  The  approach  requires 
precise  inverse  model  (denoted  by  P~')  of  the  plant  (P), 
as  well  as  a  properly  chosen  feedback  controller  ( C ), 
that  stabilizes  the  closed-loop  system. 


Fig.  1  .  Generalized  scheme  of  inverse  dynamics  control. 


The  control  comprises  two  components:  feedback  and 
feed-forward  ones.  The  reference  trajectory,  qr(t),  has  to 
be  available  together  with  its  first  and  second  order 
derivatives.  Unlike  the  computed  torque  method,  the 
generalized  inertia  matrix,  M(q),  is  not  used  as  a  gain 
matrix  here,  and  this  is  most  suitable  for  use  with  direct 
adaptive  control  schemes.  The  computation  of  the  feed¬ 
forward  component,  Ujf,  presents  certain  difficulties, 
since  inverse  models  of  complex  mechatronic  systems 
are  rather  bulky  and  uncertain  to  some  extent.  The  feed¬ 
back  component,  Un,,  may  be  realized  in  a  decentralized 
way.  The  following  theorem  [3,  5]  helps  to  find  such  a 
control. 


u  =M(q)  qr  +  C(q,  q)  qr  +  g(q)  +  Ufb  (4) 

where  uft:=Kpe  +  Kde  is  a  PD  feedback  control, 
e(0  :  =  <lr  0)  —  q  (t)  is  the  tracking  error,  and  the  gain 
matrices,  Kp  and  Kd  ,  are  constant  and  positive 
definite.  g 

The  gain  matrices  in  (4)  may  be  chosen  diagonal,  which 
is  an  important  issue  in  decentralized  control.  In 
addition,  the  closed-loop  system  is  exponentially  stable 
provided  the  feedback  gains  are  properly  chosen,  as 
stated  by  the  following  theorem. 

Theorem  2.  Let  A  >  0  be  a  constant  and  diagonal 
nxn  matrix.  Then  the  equilibrium  state  e  (t)  =  0  of  the 
system  (1)  is  exponentially  stable  under  the  control  (4), 
where  the  gain  matrices  are  related  by 
Kd  =  AKp 

if  there  exists  a>  0  such  that  the  inequality 

Kd  - }  (A  M(q)  +M(q)  A)  - 1  (C  K  d  C)  >  er  In  (5) 
verifies  along  the  actual  trajectory .  □ 

This  theorem  gives  sufficient  conditions  for  exponential 
stability  and  may  be  used  for  initial  (off-line)  setting  of 
the  gains.  For  lack  of  space  proofs  of  Theorem  1  and 
Theorem  2  are  not  quoted  here.  The  reader  is  kindly 
directed  to  references  [5,  6]  instead.  It  should  be  noted 
that  the  direct  method  of  Lypunov  is  used  to  the 
purpose.  Energy-like  Lyapunov  functions  prove  to  be 
very  suitable  for  stability  analysis  of  mechanical 
systems,  and  the  identity  (3)  is  essential  in  the  proofs.  In 
the  case  of  Theorem  2,  a  Lypunov  function  is  found  in 
the  form 

Vexp  =|(e+Ae)T M(q)  (e+Ae) 

(6) 

+  eT  (a  Kd  +  Kd  A  -  A  M(q)  a)  e 

This  function  is  positive  definite  if  inequality  (5)  holds. 

The  feed-forward  component  of  the  control  (4) 
cannot  be  computed  unless  the  structure  and  the 
parameters  of  the  inverse  model  are  known.  The  “true” 
values  of  the  process  parameters  are  often  not  known 
with  sufficient  accuracy.  In  addition,  some  of  them  may 
vary  during  operation  in  a  priori  unknown  manner. 
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3.  Adaptive  control  approaches 

Some  of  the  approaches  exploit  the  linear-in- 
parameters  representation  of  the  inverse  dynamics  [3] 

Y(qr>  qr,  q,q)  p  =Mqr  +  C  qr  +  g 

where  p  is  an  estimate  of  the  vector  of  dynamical 
parameters.  Subsequently,  the  closed-loop  system  error, 
e(t)  =  qr(t)  -  q(t),  is  a  function  of  the  parameters  error, 
p  =  p  -  p ,  and  this  is  used  to  tune  the  estimates  to 
their  “true”  values.  Although  this  is  essentially  an 
explicit  adaptive  control  approach,  it  falls  into  the 
group  of  direct  adaptive  control,  as  long  as  it  is  based 
on  computer-embedded  inverse  dynamics  model.  The 
generalized  scheme  of  direct  adaptive  control  is  shown 
in  Fig.  2.  The  goal  is  to  reduce  the  mismatch  between 
the  plant  output  and  the  output  of  a  reference  model 
(RM)  when  subjected  to  one  and  the  same  reference 
input  qr.  As  it  is  usually  assumed,  the  reference  model 
is  identity.  The  filter  (F)  is  essentially  of  differentiating 
character  and  it  is  designed  with  respect  to  closed-loop 
stability.  When  the  reference  model  is  identity,  then  the 
goal  ef  =  0  relates  actually  to  stable  reference  phase- 
plane  trajectories.  In  the  particular  case  under 
consideration,  these  trajectories  are  described  by  the 
equation  e  +  A  e  =  0.  The  guidance  of  the  closed-loop 
system  along  these  phase-plane  trajectories  is  per¬ 
formed  by  proper  variations  of  the  controller’s 
parameters,  pc,  according  to  the  rules  of  a  properly 
chosen  adaptation  algorithm  (AA). 


Fig.  2.  Direct  adaptive  control  scheme. 

Typically,  explicit  adaptive  control  is  used  within 
the  group  of  indirect  adaptive  control  approaches.  An 
example  is  the  work  of  Craig  et  al.  [1],  where  the 


control  synthesis  is  related  to  the  computed-torque 
method. 

The  common  advantage  of  the  methods  that  are 
based  on  structurally  perfect  dynamical  model  is  the 
opportunity  to  prove  the  asymptotical  stability  of  the 
closed-loop  system.  However,  the  amount  of  calcula¬ 
tions,  that  is  necessary  to  be  performed  in  real  time,  is 
too  large  to  allow  effective  implementations.  In 
addition,  the  synthesized  controller  is  robot-dedicated 
and  it  is  with  centralized  structure. 

Actually,  decentralized  controllers  are  used  with  all 
commercially  available  robots.  Therefore,  the  develop¬ 
ment  of  decentralized  adaptive  controllers  is  of  prac¬ 
tical  interest.  The  prevailing  part  of  adaptive  control 
approaches  make  use  of  simple  dynamical  models  of 
individual  joint  motions,  and  they  are  usually  models  of 
a  second  order  plant.  The  parameters  of  the  model  are 
estimated  on-line,  and  subsequently  the  control  is  syn¬ 
thesized  on  the  base  of  the  current  estimates.  Thus, 
most  of  these  adaptive  control  approaches  are  indirect 
ones. 

4.  A  simple  adaptive  control 

In  this  section  a  simple  direct  adaptive  control  is 
discussed.  The  approach  is  essentially  of  the  type 
shown  in  Fig.  2.  However,  instead  of  tuning  the 
controller  gains  in  compliance  with  the  widely  accepted 
practice,  here  the  adaptive  tuning  relates  to  the 
generation  and  tuning  of  adaptive  signals.  This  is 
obtained  through  some  simplifications  that  prove  to  be 
quite  adequate. 

As  a  clear  result,  the  gains  may  be  kept  constant,  so 
they  may  be  tuned  off-line.  Thus,  it  is  possible  to 
reduce  the  undesired  effects  during  the  transients,  and 
robust  controller  design  may  be  used. 

The  controller  in  a  direct  control  scheme  is  with  a 
previously  defined  structure,  but  with  unknown  (or  not 
a  priori  tuned)  parameters.  The  structure  of  the 
controller  is  adopted  using  the  available  information 
about  the  plant  or  the  experience  in  the  control  of 
similar  plants.  Concerning  mechatronic  sy stems, PD,  PI 
or  PID  controllers  are  used  quite  often.  Generally,  the 
feedback  control  is  obtained  in  the  form 

u  “  Y(s)  pc,  (7) 
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where  s  is  the  vector  of  output  signals,  Y(s)  is  a  known 
matrix-valued  function,  defined  by  the  adopted 
controller  structure,  and  pc  is  the  vector  of  controller 
parameters. 

In  case  of  decentralized  control,  which  is  of  major 
interest  in  this  paper,  the  individual  control  actions  are 
generated  in  accordance  with  the  generalized  law 

«/  =  g.TSi,  (8) 


where  ut  is  the  /- th  local  control  action,  gj  is  the  vector 
of  local  gains,  and  Sj  is  the  vector  of  local  signals 
(outputs)  used.  For  instance,  in  the  case  of  PD  control 
we  have 


=  kpt  e,  +  kdi  e{  =>  g,  = 


_ 

kpi 

;  Sj  = 

V 

.kdi  _ 

fi  _ 

(9) 


An  approach  to  adaptive  adjustment  of  the  PD  gains  is 
proposed  by  Dubowsky  and  DesForges  [9].  Similar 
issues  in  direct  adaptive  control  are  considered  in  other 
references,  too,  e.  g.  see  [5,  6].  Here  we  utilize  a 
different  approach,  which  is  aimed  at  the  use  of  the 
rigorous  results  for  inverse  dynamics  control,  presented 
in  Section  2.  The  control  is  generated  in  accordance 
with  a  governing  law  that  bears  resemblance  with  the 
law  (8): 


"I =  &T  (Si  +  Pi)  =  giT  Si  +  giT  a;  (t),  (10) 

where  the  second  component  of  the  local  control  is 
destined  to  emulate  the  corresponding  entry  in  the 
output  of  the  inverse  dynamics  model,  and  as  is  the 
vector  of  additional  (auxiliary)  signals  [8],  that  is  to  be 
adaptively  tuned  in  order  to  obtain  stable  closed-loop 
performance.  In  particular,  assume  that  the  local 
feedback  control  is  with  the  structure  given  by  (9), 
where  kpi  =  X,  kdi  ,  and  Xt  >  0,  /—  1  ...  n ,  are  the 
diagonal  entries  of  the  A  matrix  in  (5),  chosen  in  such  a 
way  that  the  inequality  (5)  of  Theorem  2  holds  for  the 
particular  device  and  for  a  set  of  desired  trajectories. 
Then  the  control  may  be  represented  in  the  form: 

u  =  Kpe  +  Kd  e  +  Y  pc-Ypc,  (11) 


where  pj  -  [a/,  a2T,  ...  anT]  is  the  vector  of  “true” 
parameters,  Y  =  diag  [kdh  kd2 >  ...  kdn  ]  is  a  constant 
matrix,  and  pc  :=  pc  -  pc  is  the  parametric  error. 


Note,  that  the  “true”  parameters  may  be  chosen  theore¬ 
tically  in  such  a  way  that  the  Y  pc  component  of  the 
control  (11)  emulates  the  output  of  the  inverse  dyna¬ 
mics  model,  since  the.Y  matrix  is  non-singular.  If  this  is 
the  case,  the  undisturbed  closed-loop  system  must  be 
exponentially  stable,  as  it  follows  from  Theorem  2.  The 
use  of  the  decentralized  inverse  dynamics  model 
concept  [5,  6]  is  quite  satisfactory  to  the  purpose.  For 
lack  of  space  we  do  not  discuss  it  in  details.  The 
concept  requires  that  st{  (/=  1,2,  ...  n)  be  functions  of 
the  local  reference  joint  accelerations  and  velocities. 
However,  in  order  to  simplify  the  controller  structure, 
the  simplest  possible  structure  of  these  adaptive  signals 
may  be  assumed.  This  reduces  significantly  the  number 
of  adaptive  parameters  per  joint,  while  the  performance 
does  not  degrade  dramatically. 

The  adaptive  adjusting  of  the  adaptive  parameters 
ai9  i=l,2,  ...  n ,  is  performed  according  to  stability 
requirements,  and  it  is  treated  in  the  following  section. 
Actually,  these  parameters  may  be  treated  as  additional 
inputs  that  are  fed  into  the  system. 

5.  Stability  issues 

Note  that  with  the  control  (1 1)  the  system  (1)  may  be 
represented  in  the  generalized  closed-loop  form 

x  ~  A(x,  Pc )  x  +  B  (x,  pc)  pc  (12) 

where  xT  =  [eT,  e  T]  is  the  system  state.  By  virtue  of 
Theorem  1  and  Theorem  2  this  system  is  stable, 
provided  the  “true”  parameters  emulate  perfectly  the 
output  of  the  inverse  dynamics  model,  and  the 
parametric  error  is  zero,  i.  e.  if  pc  =  0.  Naturally,  the 

true”  parameters  should  not  be  constant.  However, 
they  are.  small  in  amplitude,  as  demonstrated  by 
previous  researches  [7,  8].  In  order  to  develop  the 
adaptive  tuning  of  the  controller  parameters  we  need  a 
result,  outlined  by  the  following  lemma. 

Lemma.  Consider  the  system  (12)  where  p  is 
assumed  to  be  a  vector  of  constant  but  unknown 
parameters.  Let  F  >  0  be  a  constant  positive  definite 
matrix,  and  let  for  the  undisturbed  system  there  exists 
the  positive  definite  quadratic  function 

v=lxTVx  (13) 
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where  V  =  V(x,  pc )  is  a  positive  definite  matrix ,  such 

that  its  derivative  is  negative  definite  along  the  actual 
trajectory: 

v  =  --xT  W  x  (14) 

2 

where  W  =  W(x,  pc)  is  a  positive  definite  matrix. 
Then  the  application  of  the  updating  law 

pc  =  r  bt  v  x(t)  (is) 

makes  the  state  x  =  0  an  asymptotically  stable 
equilibrium  of  the  closed-loop  system.  □ 

Proof.  The  direct  method  of  Lyapunov  is  used  and  the 
proof  is  given  schematically.  Consider  the  quadratic 
positive  definite  function 

va  —  ixTvx+  ±  pcTr'  pc 

which  is  a  legitimate  Lyapunov  function  candidate.  Its 
derivative  proves  to  be 

va=-xTWx+  pcT(BTVx+r'^c)  (16) 

Taking  into  account  that  pc  =  -  pc ,  since  the  “true” 

parameters  are  constant,  as  well  as  the  updating  law 
(15),  the  derivative  reduces  down  to 

va  =  -xT  W  x  . 

Thus,  the  derivative  is  a  negative  definite  function  of  x. 
Therefore,  x(t)  tends  to  zero  asymptotically.  □ 

It  is  important  to  note  that  if  the  “true”  parameters  are 
not  constant,  then  the  derivative  (16)  is 

va  =— xT  wx  +  pcTr'  pc. 

Therefore,  the  equilibrium  x  =  0  is  no  longer  asympto- 
tically  stable.  However,  it  proves  to  be  practically 
stable,  i.  e.  the  state  tends  to  some  vicinity  of  the  origin 
as  the  time  tends  to  infinity.  The  corresponding  region 
depends  on  the  parametric  error,  as  well  as  on  its 
derivative. 

Taking  into  account  the  result  outlined  by  the 
lemma ,  and  on  the  base  of  the  energy-like  Lyapunov 


function  (6),  applied  to  the  system  (1)  with  the  control 
parameterization  (10)— (1 1),  the  adjusting  of  the 
parameters  is  obtained  in  the  form 

=  T  Yt  ef(t) ,  (17) 

where  T  is  a  properly  chosen  diagonal  positive  definite 
matrix  and  e^t) :  =  e  (t)  +  A  e(t)  is  the  “filtered  error” 
(A  is  the  diagonal  and  constant  positive  definite  matrix 
introduced  earlier). 

Usually,  the  adaptation  gains  are  considered  as 
“magic”  parameters,  and  their  proper  values  are  found 
by  experience.  However,  the  adaptation  gains  may  be 
subjected  to  adaptive  variations,  too.  This  aspect  is  not 
considered  here,  since  the  related  theory  is  too 
complex. 

6.  Self-learning  issues 

The  perfect  match  of  adaptive  and  self-learning  (or 
repetitive)  control  has  been  put  forward  by  some 
authors,  e.  g.  see  [11].  The  type  of  control  presented 
here  has  a  very  easy  and  natural  issue  to  self-learning 
control  [10].  Note  that  the  adaptive  control  parameters 
may  be  treated  as  signals  additional  to  the  reference 
inputs. 

The  main  idea  is  illustrated  by  a  simple  paradigm. 
Using  fixed  gains  and  no  adaptation  in  the  control  there 
should  be  tracking  error,  i.  e.  the  actual  trajectory  will 
differ  from  the  reference  one.  However,  there  exists  a 
modified  reference  trajectory  such  that  it  will  results  in 
exactly  the  goal  (non-modified)  trajectory. 

This  modified  trajectory  can  be  obtained  on  the  base 
of  the  adaptive  control.  The  modified  trajectory  is 
obtained  in  the  following  iterative  way 

qr  (<•  k)=  q,  (l,  k  -  1)+  a  (/,  k) 

where  k  is  the  pass  number,  qr  (/,  0)  =  qr  (/),  and  the 
additive  component  is  obtained  on  the  base  of  the 
adaptive  control.  In  the  case  of  PD  control,  where  the 
gain  matrices  are  related  by  Kp  =  AKd,  the  additive 

component  proves  to  be  a  solution  of  the  following 
differential  equation: 

a  (t,  k)  =  pc {t,  ^-1)-Aa(/,  k) 
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where  pc(Y,  i)  is  the  vector  of  adaptive  parameters 
obtained  during  pass  i.  The  use  of  self-learning  control 
in  a  combination  with  adaptive  control  of  the  type 
presented  here  yields  very  good  results  especially  with 
smooth  reference  trajectories. 

An  important  advantage  comes  from  the  fact  that  the 
additional  inputs  can  be  easily  truncated  in  order  to 
keep  the  transients  within  the  limits.  Unlike  the  typical 
self-learning  techniques,  here  the  feed-forward  compo¬ 
nent  of  the  control  is  expressed  in  terms  of  additive 
reference  trajectory. 

7.  Conclusion 

The  article  treats  a  direct  adaptive  control  technique 
that  is  based  on  decentralized  fixed-gain  control,  alike 
the  one  used  with  all  commercially  available  robot 
controllers.  Such  type  of  control  results  in  an  inherently 
parallel  modular  structure  of  the  controller,  which  is 
most  practical.  The  technique  falls  into  the  group  of 
simplified  adaptive  control  [4],  and  it  can  be  easily 
implemented,  since  there  is  no  need  of  extensive 
computational  capacity  and  the  sampling  rate  can  be 
easily  kept  sufficiently  high. 

The  control  presented  here  is  based  on  adaptive 
variation  of  signals  rather  than  gains  and  it  is  closely 
related  to  standard  fixed-gain  feedback  control,  in 
contrast  to  the  well  known  adaptive  control  approaches 
with  proven  asymptotically  stable  equilibrium  state  (see 
[1]  -  [3]),  where  the  use  of  computer-embedded  model 
is  essential.  The  experimental  investigations  show  that 
the  adaptive  control  based  on  auxiliary  signals  is  a 
much  better  choice,  when  compared  to  decentralized 
control  with  variable  adaptive  gains. 

Better  experimental  results  are  obtained  with  higher 
sampling  rates.  The  computational  scheme  is  rather 
simple,  and  it  allows  higher  sampling  rates  be  realized 
easily. 
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Abstract 

This  paper  presents  a  new  robot  controller  based  on 
artificial  neural  networks  (ANNs)  and  capable  to 
compensate  changes  in  inertial  model  parameters.  The 
approach  involved  uses  a  decomposition  of  the 
computed  torque  method  into  two  parts ,  fixed  and 
variable .  The  both  fixed  and  variable  torque  are 
generated  by  an  ANN.  Another  ANN  is  employed  to 
estimate  the  inertial  parameters.  The  inputs  are  fed  to 
ANN  to  the  variable  torque  as  in  the  case  of  a 
conventional  feedforward  controller.  To  reduce  the 
tracking  error  due  to  unmodeled  dynamics  and  frictions 
we  use  a  decentralized  adaptive  PD  controller.  High 
tracking  performance  from  the  proposed  control 
approach  are  shown  by  the  results  of  simulation  and 
compared  with  the  conventional  computed  torque 
method. 

1.  Introduction 

For  the  past  several  years,  the  neural  computation 
approach  has  emerged  [1,  2,  3,  4,  5]  to  tackle  problems 
for  which  several  conventional  computational 
approaches  have  been  proven  ineffective.  Hence,  there 
have  been  a  lot  of  interests  in  applying  artificial  neural 
networks  (ANNs)  to  solve  the  problems  of  identification 
and  control  of  complex  nonlinear  system  by  exploiting 
the  nonlinear  abilities  of  the  ANN  [3,  5,  6,  7]. 

The  robot  manipulator  control  is  often  confronted  with 
the  highly  complex  nature  of  robot  dynamics  which 
present  uncertainties  like  [8,  9,  10]: 

•  Imprecision  on  the  inertial  and  geometrical 
parameters. 

•  Payload  variations. 

•  Imprecision  on  the  unmodeled  dynamics. 

The  fundamental  approach  for  robot  manipulator 
control  is  the  model-based  computed  torque  control 
scheme  ini  as  shown  in  Figure  1.  This  control 


technique  can  offer  some  required  qualities,  of 
robustness  and  precision  if  the  robot  model  will  be 
accurate.  Variations  of  this  basic  idea  can  be  found  in 
other  designs  [12,  13].  ANN  can  also  be  trained  off-line 
to  learn  the  nonlinear  robot  dynamics,  and  then  is  used 
to  implement  the  computed  torque  controller  with 
further  on-line  modifications  to  account  for 
uncertainties.  However,  the  difficulty  in  implementing 
this  approach  is  met  in  the  important  time  required  to 
update  on-line  the  weights.  Furthermore,  this  approach 
can  not  overcome  the  large  variation  of  parameters  in 
the  dynamic  model. 

The  adaptive  control  could  be  felt  an  attractive 
solution  to  control  robot  manipulator  since  it  does  not 
require  an  accurate  model  [14,  15].  However,  as  well 
known,  it  can  not  perform  good  results  at  high  speed.  In 
addition,  a  general  drawback  of  an  adaptive  scheme  is 
that  the  computational  requirements  for  real-time 
parameters  identification,  and  sensitivities  to  numerical 
precision  and  observation  noise,  increases  undesirably 
with  the  number  of  the  system  state  variables  [7]. 

To  include  parameter  estimation  in  the  derivation  of 
the  robot  control  scheme,  Leahy  et  al  [16]  have  proposed 
an  adaptive  model-based  neural  network  controller 
(AMBNNC)  incorporating  a  neural  network  trained  to 
provide  a  feedforward  model  with  a  payload  mass 
estimator. 

The  feedforward  torque  adaptation  is  performed  using 
the  tracking  error.  But,  this  can  mislead  payload  mass 
estimation  since  this  error  may  be  due  to  other 
perturbations  such  as  frictions. 

In  this  paper  we  investigate  a  new  robot  control 
technique  as  depicted  in  Figure  2.  This  control  approach 
is  different  from  that  of  Figure  1.  It  is  proposed  in  the 
present  work  a  decomposition  of  dynamic  model  into 
parts,  fixed  and  variable.  ANNs,  are  used  to  map  the 
both  parts.  The  fixed  part,  containing  the  constant  term 
provide  the  nominal  torque  based  on  the  dynamic  model. 
The  variable  part,  permits  the  tracking  of  the  inertial 
parameters  variation.  This  decomposition  should 


159 


provide  benefits  in  the  reduction  of  the  ANN  size  and 
avoid  the  need  for  an  on-line  adaptation  of  the  ANN 
weights.  Also,  to  reduce  influence  of  other  perturbations, 
like  wise  unmodeled  dynamics,  a  decentralized  adaptive 
proportional-derivative  (PD)  controller  has  been  used 
and  its  output  is  added  to  the  feedforward  torque. 

The  structure  of  the  paper  is  as  follows.  In  Section  2  the 
robot  dynamic  model  is  mentioned.  Section  3  presents 
the  computed  torque  control.  The  proposed  Neural 
Decomposed  Adaptive  Dynamic  Control  is  described  in 
Section  4.  Simulation  results  of  the  proposed  technique 
and  a  comparison  with  the  computed  torque  method  of  a 
two-link  direct  drive  manipulator  are  given  in  section  5. 
Section  6  draws  some  conclusions. 

2.  Robot  Dynamic  Model 

The  dynamics  of  an  n-joints  robot  manipulator  can  be 
described  by  the  nonlinear  and  coupled  second  order 
vector  differential  equation: 

T  =  M(0  )0  +  C(d,  0)  +  G(0)  +  F(0)  (l) 

where  0  is  the  nxl  vector  of  joint  angular  positions,  T  is 
the  nxl  vector  of  applied  joint  torque,  M(6)  is  the  nxn 
symmetric  positive-definite  inertia  matrix,  C(0,G)  is  the 

nxl  coriolis  and  centrifugal  torque  vector,  G(0)  is  the 
nxl  gravity  loading  vector,  and  F(0)  is  the  n  x  1 
frictional  torque  vector.  For  convenience,  let  us  denote 
H(0, 0)  =  C( 0,0)  +  G(0)  so  that  (1)  can  be 
rewritten  as  : 

T  =  M(0  )0+H(0, 0)+F(0)  (2) 

The  robot  dynamic  equation  (2)  is  highly  nonlinear  and 
coupled  multi-input  multi-output  system.  In  most 
practical  cases,  the  model  is  affected  by  some 
uncertainties  due  to  the  imprecision  on  the  model 
parameters  [17]. 

3.  Neural  Computed  Torque  Control 

The  model-based  computed  torque  method  [12,  13,  17] 
is  the  basic  approach  to  robot  control.  The  control  law 
can  be  written  as  : 

T  =  M(0)u  +H(0,0)  (3) 

where  M(0)  and  H( 0,0 )  are  estimates  of  M(0)  and 
H( 0,0),  and  u  is  given  by 

u=e+Kp(ed-o)+Kv(dd-e)  (4) 


where  Kp  and  Kv  are  nxn  symmetric  positive  definite 
gain  matrices.  Combining  (2),  (3),  and  (4)  yields  the 
closed  loop  dynamic  equation 

e+Kj+Kpe=tirl(M(0)0+/H(0,0)+F(0))  (5) 

where 

AM (0)  =  M(0)  - M(0 )f  AH (0, 0)  =  H(0 , 0)- 

H( 0,0),  and e=(0d-  9).  \f  M  -  M,  H  -H ,  and 

F  =  0  then  equation  (5)  becomes  the  following  ideal 
second  order  equation : 

e  +  Kve  +  Kpe  =  0  (6) 

where  Kp  and  Kv  can  be  chosen  to  achieve  decoupled 
critically-damped  or  over-damped  system  response. 

Since  there  are  always  uncertainties  in  the  robot 
dynamic  model,  the  ideal  error  response  (6)  can  not  be 
achieved  and  the  performance  is  degraded  as  indicated 
in  (5).  Thus  the  computed  torque  based  control  is  not 
robust  in  practice[17]. 

4.  Neural  Decomposed  Adaptive  Computed 
Torque  Control 

A  new  robot  control  scheme  is  proposed  in  this  paper  as 
depicted  in  Figure  2.  The  basic  concept  of  this  scheme  is 
that  the  dynamic  model  of  robot  is  decomposed  into  two 
parts : 

Part-1  is  fixed  and  contains  terms  with  fixed  inertial 
parameters. 

Part-2  is  variable  and  contains  terms  with  variable 
inertial  parameters. 

Thus,  the  feedforward  torque  is  given  by  : 

T  =  T,  +  T2  (7) 

T,  =MI(0d,PfJU+H1(0d,0d,Pf)  (8) 

T2  =M2(0d,PJU+H2(0d,0d,PJ  (9) 

Where  r7  is  the  n  x  1  vector  of  the  fixed  torque  and  T2 
is  the  n  x  1  vector  of  the  variable  torque.  Mj  and  H}  are 
respectively,  the  inertial  matrix  and  the  coriolis  and 
centrifugal  vector  containing  fixed  parameters.  M2  and 
H2  are,  respectively,  the  inertial  matrix  and  the  coriolis 
and  centrifugal  vector  containing  parameters.  Pj  is  the 
vector  of  the  fixed  inertial  parameters,  and  Pv  is  the 
vector  of  variable  inertial  parameters  as  a  function  of 
time.  U  is  given  by 


Tfb,  ^MOJ+kJtAO) 


(12) 


u=e+Tjb  do) 

This  decomposition  has  the  advantage  of  simplifying  the 
problem  by  avoiding  the  need  of  a  precise  model.  In 
that,  the  partial  robot  dynamic  knowledge  can  be 
sufficient  for  the  fixed  part  to  provide  the  nominal 
torque.  The  variable  part,  based  on  parameter 
estimation,  compensates  high  changes  in  the  system 
dynamic  during  the  robot  motion. 

Two  ANNs  are  trained  off-line  to  map  M2  and  H}  of  the 
fixed  torque  Tj.  The  variable  torque  T2  is  determined  as 
a  conventional  feedforward  controller  using  the 
propriety  of  being  linear  in  its  parameters  [9,10]. 
Consequently,  equation  (10)  can  be  rewritten  as 

T2  =(M2inf(0)U  +  H2mf(0,0))Pv  (11) 

where  M 2inf( 0)  +  H2inf( 0,0)  represents  the  n  x  m 

kinematic  information  matrix  that  describes  the 
motion  of  links  and  also  referred  to,  in  the  literature,  as 
the  observation  matrix.  Each  element  Pvi  of  the  vector  Pv 
can  be  an  independent  parameter  or  a  linear 
combination  of  others  parameters. 

In  the  present  case  an  ANN  can  be  used  as  usual  to  map 
the  variable  torque.  However,  it  will  necessary  be  of  an 
important  size  which  may  further  increase  with  the 
complexity  of  the  variable  torque  terms. 

But,  by  employing  the  linear  propriety  in  the  parameters 
of  the  variable  torque  one  can  reduce  the  neural  network 
size.  Hence,  the  ANNs  are  not  used  to  map  the  whole 
variable  torque  as  in  [18]  but  only  M2inf  and  H2inf  which 
includes  the  nonlinear  terms  [19],  So,  as  shown  in 
equation  (1 1),  the  variable  torque  will  be  computed  with 
the  ANN  output  and  the  parameters  estimation. 

The  parameter  estimator  uses  a  multilayer  ANN  trained 
off-line  in  order  to  determine  the  value  of  the  variable 
parameters  during  manipulator  motion.  The  inputs,  fed 
to  the  neural  nets  during  the  training,  are  vectors 
containing  normalized  values  of  the  position,  velocity, 
acceleration  and  applied  torque.  Notice  that  the 
parameters  estimation  acts  here  as  a  dynamical  control 
since  it  allows  the  variable  torque  to  track  changes  in  the 
system  dynamic. 

To  deal  with  unmodeled  uncertainties  and  other 
perturbation  on  the  robot  system  a  decentralized 
adaptive  PD  control  action  is  added  to  the  feedforward 
control  part.  This  adaptive  action  has  been  used  by 
Seraji  [20]  who  considered  each  joint  as  a  subsystem 
with  inertial  coupling,  Coriolis,  centrifugal,  friction,  and 
gravity  terms  as  in  the  equation  (2). 

The  adaptive  feedback  independent  joint  controller 
dedicated  to  the  /th  joint  is  described  by 


where  e/t)  and  e,(t)  are  the  position  and  velocity 
tracking  errors  of  the  /th  joint. 

The  feedback  controller  adaptation  law  which  ensures 
asymptotic  trajectory  tracking  is  based  on  the  weighted 
error  r,(t)  described  as  follows  [20]: 

The  weighted  error  and  the  feedback  gains  are  given  by 
the  expressions  below : 

ri(t)=Gpiei(t)+GJi(t)  (13) 

t 

kijO)  =  kij(0)  +  aij  \ri(t)e<in(t)dt  j=0  ]  (14) 

0 

where  a#  are  any  positive  scalar  integral  adaptation 
gains,  and  {Gp„  Gvi  }  are  positive  scalar  weighting 
factors  which  reflect  the  relative  significance  of  the 
position  and  velocity  errors.  The  superscript  “(/')”  denote 
the/th  derivative  with  respect  to  the  time  parameter,  and 
is  =  dejdt . 

In  our  case  e'0)  and  e,1'  are  the  position  error  e,  and  the 
velocity  error  e„  respectively. 

In  digital  control  implementation,  the  auxiliary  signal 
and  the  feedback  gains  controller,  at  each  sampling 
instant  N,  are  obtained  from  equations  (9)  and  (10)  by 
the  following  simple  recursive  adaptation  procedure 
[20]: 

ri(N)  =  Gpiei(N)  +  GJi(N)  (15) 

k/NJ =k/N-l)+apfri(N)ei(N)+r/N~l)e/N-l) ]  (lg) 
K(N)  =KfN~V+^i[r/N)e/N)+t-(N-l)efN~l) ] 

This  decentralized  feedback  control  can  highly  fast  be 
computed  since  the  controller  gains  are  obtained  from 
simple  adaptation  laws.  Also,  here  the  feedback  control 
action  can  be  evaluated  very  rapidly  than  the 
conventional  adaptive  algorithms.  Finally,  this  feedback 
controller  has  a  veiy  simple  structure  with  an  auxiliary 
signal  and  an  adaptive  position-velocity  feedback  gain. 

5.  Simulation  Results 

Simulations  were  done  to  verify  the  proposed  approach 
and  to  compare  it  with  the  neural  computed  torque 
method.  A  two  link  robot  manipulator  whose 
parameters  are  taken  from  the  first  two  links  of  PUMA 
560  robot  are  used  for  simulation  studies  [20].  For  our 
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application  we  consider  only  the  payload  mass  variation 
as  parameter  to  estimate  and  the  friction  parameters  as 
perturbation  on  the  system.  To  implement  the  computed 
torque  control  two  different  ANNs  NN1,  NN2  are  used 
to  identify  the  inertial  matrix,  the  coriolis  and 
centrifugal  term,  respectively.  Both  the  two  networks  are 
multilayered  where  the  structures  are  (2/8/4)  and 
(4/20/10/2)  for  NN1  and  NN2,  respectively.  The  PD 
gains  are  selected  as  *,,=100  and  *,,=60  for  each  joint. 
To  implement  the  proposed  approach  we  use  multilayer 
neural  networks  to  map  Mt  and  M2tnf,  have  the  same  size 
with  2  input  nodes,  8  nodes  in  the  hidden  layer  and  4 
nodes  in  the  output  layer.  Also  H,,  and  H2mf  have  the 
same  size  with  4  input  nodes,  20  hidden  nodes  and  2 
output  nodes.  The  neural  network  estimator  must 
provide  the  payload  mass  value.  It  consist  of  6  input 
nodes,  20  nodes  per  each  of  two  hidden  layers,  and  1 
output  node.  The  all  ANNs  are  trained  on  a  random  data 
uniformly  distributed.  After  more  trial,  the 
adaptation  gains  for  the  decentralized  adaptive  PD 
action  are  chosen  as  show  in  table  I. 

Table  I. 

Adaptation  gains  parameters. 


Joint 

_&J 

Gy 

kpo 

kyO 

r0 

OCo 

a} 

1 

40 

30 

40 

30 

0 

100 

800 

2 

40 

30 

40 

30 

0 

100 

800 

Figures  (3-10)  show  the  simulation  results  of 
trajectories  control  by  the  computed  torque  method  and 
the  proposed  control  approach.  For  comparison.  Figure  3 
and  4  show  the  obtained  position  trajectory  for  joint  1 
and  2.  We  establish  that  the  both  methods  provide  a 
good  trajectory  tracking.  But  the  neural  adaptive 
dynamic  control  performed  better  than  the  computed 
torque  method  as  prove  in  Figures  5  and  6,  where  the 
errors  position  obtained  by  the  computed  torque  method 
is  more  important  than  those  obtained  by  the  proposed 
approach.  These  performances  were  again  confirmed  by 
the  velocity  trajectories  as  show  in  Figures  (7-10).  We 
have  remarked  that  maximum  errors  correspond  with 
the  abrupt  change  in  the  payload  mass. 

6.  Conclusion 

A  new  neural  network  control  for  robot  manipulator 
is  presented  in  this  paper.  Unlike  the  classic  computed 
torque  method  the  decomposition  of  the  robot  dynamic 
model,  in  the  proposed  approach,  into  fixed  and  variable 
parts  has  provide  a  better  robustness  via  the  change  in 
the  variable  dynamic  parameters.  The  decentralized 
adaptive  PD  compensator  was  able  to  overcome  the 


errors  due  to  the  unmodeled  dynamic  (frictions  in  our 
case)  and  the  estimation  errors. 


Figure  1.  Neural  Computed  Torque  Control. 


Figure  2.  Neural  Decomposed  Adaptive  Computed 
Torque  Control. 
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Figure  10  velocity  error  of  joint  2 
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Abstract 

This  paper  provides  a  summary  of  research  done  on  the 
adaptive  control  of  flexible  manipulators .  Such  manip¬ 
ulators,  due  to  their  generally  lighter  masses,  should  be 
of  particular  interest  in  space  applications . 

1.  Introduction 

Rigid  robot  control  is  based  on  the  assumption  that 
the  transmissions  are  stiff  and  that  the  links  are  rigid. 
Such  manipulators  are  heavy  (load- carrying  capacity  is 
typically  5%  to  10%  of  their  own  weight  [1]),  consume 
considerable  power,  and  are  generally  impractical  for 
high  speed  maneuvers.  On  the  other  hand,  light  ma¬ 
nipulators  have  links  that  deform  significantly  in  high 
speed  operations  and  therefore  their  flexibility  cannot 
be  ignored.  Flexibility  can  also  be  present  in  the  joints; 
that,  is,  gear  elasticity,  flexibility  in  the  transmissions 
between  actuators  and  links,  shaft  wind  up,  etc.  Me¬ 
chanical  flexibility  often  generates  severe  problems  for 
such  controllers  due  to  structural  vibrations.  Two  ap¬ 
proaches  have  been  used  for  vibration  reduction:  in¬ 
put  command  pre-shaping  [2]  and  closed-loop  feedback 
[3].  If  the  controller  was  designed  with  the  assump¬ 
tion  of  perfect  rigidity,  flexibility  in  the  links  or  joints 
can  decrease  tracking  capabilities  and  cause  instabil¬ 
ity.  Therefore  flexibilities  should  be  taken  into  account 
in  the  controller  design.  In  addition,  for  flexible  sys¬ 
tems  the  number  of  degrees  of  freedom,  which  theoret¬ 
ically  for  a  continuous  system  becomes  infinite,  exceeds 
significantly  the  number  of  actuators  and  the  number 
of  output  variables.  This  introduces  some  extra  con¬ 
straints  affecting  the  control  strategy  (under-actuated 
control  system). 

2.  Control  of  flexible 
manipulators 

Due  to  the  complexity  of  the  dynamics  of  manipulators 
with  both  joints  and  links  flexibility,  many  researchers 
have  considered  only  link  flexibility  [2,  4,  5,  6,  7],  or 


only  joint  flexibility  [8,  9,  10,  11,  12],  or  some  special 
case  with  only  one  flexible  link  [13,  14,  15,  16].  How¬ 
ever,  some  have  considered  both  joint  and  link  flexibil- 
ity  [17,  18]. 

Several  general  approaches  to  the  control  of  flexi¬ 
ble  robots  have  been  reported  such  as  feedback  lin¬ 
earization  [19]  which  is  computationally  expensive,  and 
singular  perturbation  techniques  [3]  which  are  appli¬ 
cable  to  manipulators  with  small  flexibility  effects. 
Some  more  specific  methods  have  been  presented  in 
[20],  where  the  transfer  function  approach  was  used 
for  single-link  flexible  arm  control  and  in  [21],  where  a 
pseudo-link  approach  was  employed.  A  method  called 
adaptive  computed  reference  computed  torque  control 
was  proposed  in  [17].  This  approach  will  be  discussed 
here  in  detail. 

A  digital  adaptive  controller  for  a  two-link  flexible 
manipulator  with  one  rigid  link  and  one  flexible  link 
was  given  in  [16].  The  manipulator  operates  in  a  grav¬ 
itational  field  and  a  finite  element  model  with  three 
elements  represented  the  flexible  link.  The  adaptive 
control  algorithm  was  indirect;  that  is,  the  control  law 
at  each  sampling  time  was  based  on  a  prediction  model 
of  the  plant  whose  time-varying  parameters  were  es¬ 
timated  adaptively.  A  composite  adaptive  control  (a 
mixture  of  joint  subsystem-based  adaptive  control  and 
simple  flexible  fixed-gain  feedback)  for  flexible-link  ma¬ 
nipulators  was  presented  in  [5].  Linear  parameteriza¬ 
tion  was  used  to  design  an  adaptive  law  for  identifying 
the  unknown  parameters  (payload  mass)  of  a  flexible 
manipulator.  In  [4]  an  adaptive  controller,  based  on  an 
approximate  set  of  dynamic  equations,  was  developed 
for  a  flexible  manipulator  with  revolute  joints  carrying 
a  rigid  large  uncertain  payload.  Given  these  equations, 
a  globally  stable  tracking  of  the  end-effector  was  estab¬ 
lished.  The  concepts  of  filtered  error  and  a  passivity 
theorem  were  used  in  developing  the  adaptive  law.  Ex¬ 
perimental  results  for  a  two-link  flexible  manipulator 
with  adaptive  nonlinear  control  and  pre-shaping  were 
reported  in  [2].  The  input  pre-shaping  scheme,  which  is 
an  open-loop  controller,  adjusted  the  input  command 
to  the  manipulator  so  that  vibrations  were  eliminated. 
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Hybrid  adaptive  control  algorithms  for  two-link  flexi¬ 
ble  manipulators  were  presented  in  [14].  The  authors 
claimed  that  the  hybrid  adaptive  control  is  superior  to 
pure  continuous-time  or  discrete-time  adaptive  control, 
because  the  infrequent  adjustment  of  the  control  pa¬ 
rameters  makes  it  more  robust  than  these  adaptive  con¬ 
trol  techniques.  The  hybrid  adaptive  control  systems 
operate  in  continuous  time,  while  the  control  parame¬ 
ters  are  updated  only  at  discrete  instants.  In  [6]  a  task- 
level  learning  control  problem  was  considered.  This 
control  problem  can  be  formulated  as  an  online  nonlin¬ 
ear  least  square  parametric  optimization  problem.  The 
learning  was  performed  in  the  course  of  the  normal  op¬ 
eration  of  the  manipulator.  An  adaptive  control  scheme 
for  flexible  joint  robot  manipulators  was  presented  in 
[9],  Asymptotic  stability  was  ensured  regardless  of  the 
joint  flexibility  using  the  passivity  property.  In  [22]  an 
adaptive  variable  structure  scheme  was  proposed  for 
the  control  of  a  flexible  robot  manipulator.  All  of  the 
nonlinear  dynamics  of  the  system  were  all  taken  into 
account  in  the  design  of  the  control  system.  To  allevi¬ 
ate  chattering,  a  phenomenon  that  is  commonly  experi¬ 
enced  in  variable  structure  type  of  control,  a  saturation 
type  adaptive  scheme  was  proposed.  In  [23]  the  adap¬ 
tive  hybrid  control  of  constrained  nonlinear  robots  with 
flexible  links  was  studied.  For  the  analysis  and  the  de¬ 
sign  of  a  controller  for  the  flexible  manipulator,  a  sin¬ 
gular  perturbation  approach  was  used.  According  to 
the  physical  properties  of  a  flexible  manipulator,  a  two 
time-scale  approach,  namely  the  singular  perturbation 
approach,  was  further  utilized  for  analysis  and  general 
controller  design.  For  demonstration  of  the  controller 
performance,  experiments  of  a  two-link  flexible  manip¬ 
ulator  were  performed  for  the  proposed  controller  and 
satisfactory  results  observed.  In  [15]  the  control  of  the 
tip  position  of  a  flexible  manipulator  carrying  variable 
payload  was  considered.  An  approach  to  adaptive  de¬ 
centralized  control  for  flexible  joint  robots  based  on 
adaptive  observer  technique  was  presented  in  [10].  A 
two-level  adaptive  control  scheme  was  used  to  guaran¬ 
tee  the  trajectory  tracking  and  the  stability  of  the  con¬ 
trol  system.  The  problem  of  endpoint  position  control 
for  a  planar  manipulator  which  has  two  very  flexible 
links  was  considered  in  [7].  The  experimental  model 
was  used  in  a  fixed  controller  as  well  as  in  a  self-tuning 
one  with  unknown  mass  at  the  tip  of  manipulator.  The 
design  of  a  hybrid  passive/adaptive  controller  for  a  sin¬ 
gle  flexible  link  manipulator  with  a  payload  mass  was 
discussed  in  [13].  A  robust  adaptive  design  procedure 
for  flexible  joint  manipulators  was  developed  in  [8].  It 
was  claimed  that  the  scheme  guarantees  a  robust  per¬ 
formance  even  if  the  Coriolis,  centrifugal,  and  friction 
terms  in  the  manipulator  dynamics  were  unknown. 


2.1.  Manipulators  with  flexible  joints 
and  links 

A  control  technique  for  the  motion  of  flexible  manip¬ 
ulators  presented  in  [17]  took  into  account  both  joint 
and  link  flexibility.  This  technique  can  be  applied  in 
adaptive  form  if  the  slow  changing  robot  parameters 
were  unknown.  The  method  was  called  adaptive  com¬ 
puted  reference  computed  torque  control.  The  proposed 
method  resulted  in  trajectory  tracking  while  all  state 
variables  remained  bounded.  A  block  diagram  of  adap¬ 
tive  computed  reference  computed  torque  control  for 
trajectory  following  is  shown  in  Figure  1.  The  idea  of 
computed  torque  control  used  for  flexible  manipulators 
in  [17]  was  originally  proposed  in  [24]  for  rigid  manip¬ 
ulators. 

The  following  is  a  brief  description  of  the  technique 
taken  from  [17].  Due  to  link/joint  flexibility  the  num¬ 
ber  of  degrees  of  freedom,  n,  is  larger  than  the  number 
of  actuators,  m.  A  general  model  for  the  dynamics  of 
m-link  flexible  manipulators  with  revolute  joints  can 
be  written  in  the  form 

Hr  =  M ip  +  C(p  +  G  (1) 

where  <p  is  the  generalized  coordinates,  r  E  Rm  is 
vector  of  the  actuator  torques,  H  =  H(<p)  is  the 
distribution  matrix  with  the  full  rank  m,  the  vector 
G  =  G(<p ,  <p,  P)  includes  the  gravity,  damping,  link  and 
joint  flexibility,  M  =  M((p,  P)  is  symmetric  positive 
definite  inertia  matrix,  C  =  C((p ,  (p,  P)  accounts  for  the 
Coriolis  and  centrifugal  forces  and  P  is  the  r-vector  of 
unknown  parameters.  Like  for  the  rigid  manipulator, 
the  matrix  Y  =  M  —  2 C  is  skew-symmetric;  that  is, 
Yij  =  - Yji ,  and  consequently  (pTY  <p  —  0. 

To  eliminate  problems  related  to  the  under- actuated 
systems,  the  generalized  coordinates  can  be  chosen  such 
that  p>k  E  Rm  is  referred  to  as  the  vector  of  the  rigid- 
body  coordinates  and  (pu  E  Rn~m  is  the  vector  of  flex¬ 
ible  coordinates  such  that 

ip  -  Lk<pk  +  Lu(pu  (2) 

where  [Lk  Lu]  is  a  permutation  matrix.  Then,  the  goal 
is  that  ipk  tracks  the  desired  trajectory  <pkd,  while  <pu 
and  (pu  remain  bounded.  If  ek  =  (pkd  —  ipk  is  the  track¬ 
ing  error,  then  the  reference  trajectory,  ipkr,  and  refer¬ 
ence  error,  e*r,  are  defined  as 

•  ipkr  —  < Pkd  +  fcejfc  ekr  =  ipkr  ~  <Pk  (3) 

where  is  a  positive  definite  matrix,  and  y?*.r(0)  = 
<^k(0).  Similarly,  assuming  that  a  bounded  desired  tra¬ 
jectory  (pud  can  be  determined,  and  that  eu  =  <pud  —  <pu 
is  the  associated  tracking  error,  the  reference  trajec¬ 
tory,  (pur ,  and  reference  error,  eur,  can  be  defined  then 
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Adaptive  controller 


Figure  1:  An  adaptive  controller  for  flexible  manipulators. 


as 

Pur  —  find.  4*  & ur  —  'pur  Pu  (4) 

where  \PU  is  a  positive  definite  matrix,  and  <pur(0)  = 
<pu(0).  The  total  reference  trajectory,  y?r,  and  total 
reference  error,  er,  were  given  as 

( Pr  ~  Pkpkr  4"  Lu(pur  (5) 

Sr  —  Pr  P  —  4“  L^Cmr  (6) 

The  control  objective  is  to  find  a  bounded  input,  r, 
such  that  er  and  er  are  bounded  and  er  — *  0  when 
t  —  oo. 

Using  the  linear-in-parameters  property  for  rigid 
manipulators  which  partially  holds  for  flexible  manipu¬ 
lators,  the  right-hand  side  of  Equation  (1)  can  be  writ¬ 
ten  as 

M{<p,  P)(p  +  C(<p ,  <p,  P)ip  +  G(<p,  <p,  P)  = 

W(‘p,ip,'p)P  +  (7) 

where  W  E  Rnxr  and  W\  E  Rn .  The  control  law  was 
given  as 

t(1)  =  ( HtH)~ 1  HT[Mpr  +  C<fir  +  G*  +  I<rer]  (8) 

where  M  —  M(<p,P ),  C  =  C(p>,p,P),  and  Kr  is  a 
positive  definite  gain  matrix.  The  unknown  reference 
trajectory  p>ur  was  determined  from 

Nt  [M pr  +  Cp>r  +  G*  4-  Krer\  =  0  (9) 


where  N  E  Rnx(n  m )  is  a  matrix  of  full  rank  that 
satisfy 

NtH  =  0 

The  new  function  G*  =  G*(<p,  <p,  pr ,  <pr,  P)  was  intro¬ 
duced,  and  it  was  assumed  that  it  was  possible  to  deter¬ 
mine  such  functions  such  that  the  set  (8),  (9)  produced 
a  bounded  solution  for  p>ur  and  r.  As  an  example  G* 
can  be 

G*(p,  p,<pr,<pr,  P)  =  Kpr  +  Bpr  4~  Gn{<p,  P)  (10) 

where  K  and  B  are  respectively  the  symmetric  positive 
definite  stiffness  and  damping  matrices.  If  the  function 
G *  is  linear  in  P,  then  the  error  equation  for  the  closed- 
loop  system  would  be 

Mer  +  Cer  +  G*  -  G  4~  Krer  =  Wr$  (11) 

Wr(‘P,r,‘P>-,V>r,'Pr)^  =  (12) 

(M  -  M)<pr  +(C-  C)<pr  +  (G*  -  G*) 

where  =  P  —  P  is  the  parameters  estimation  error. 

The  proof  of  stability  can  be  obtained  by  choosing 
the  following  Lyapunov  function  candidate 

eg(t)  =  ie^’Me.+Q+h^r-1*  (13) 

where  Q  =  Q(y>,  er ,  P)  is  a  semi-positive  definite  matrix 
and  T  is  a  known  positive  constant  diagonal  matrix. 
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Differentiating  (13)  with  respect  to  time  and  using  the 
skew-symmetric  property,  eJ(M  —  2C)er  —  0,  and  (11) 
results  in 

eg(t)  =  -ejKrer  -R+$T{Wjer  +  r_1$)  (14) 

where  R  —  R(<p,<p,er,er,  P)  >  0  and  if  (15)  is  satis¬ 
fied.  If  G  =  Kip  +  B(p  +  Gn(<p ,  (p,  P),  using  (10),  and 
assuming  ^  =  ^ejKer  and  R  =  ej Ber ,  the  following 
equation  can  be  satisfied. 

eJ(G*  -G)  =  Q  +  R  (15) 

Choosing  the  following  parameter  adaptation  law 

-*(<)  =  >(t)  =  TWjev  (16) 

makes  eg  semi-negative.  This  imply  that  er  — ►  0  when 
t  — +  oo.  However,  it  is  not  guaranteed  that  the  esti¬ 
mated  parameters  P  converge  to  their  true  values  P 
[17].  The  estimated  parameters  converge  to  their  true 
values  if  the  desired  trajectory  is  persistently  exciting; 
that  is,  if  the  matrix  Wd  =  Wr(<pd,  <pdi  Pd,  Pd,  Pd)  is 
persistently  exciting  and  uniformly  continuous  where 
Wr  is  given  in  (12)  with  <p}  <pr  replaced  by  <pd.  This 
means  there  should  exist  positive  constants  a1}  c*2,  f3 
such  that  for  all  >  0  (see  [25]) 

*il<  WjWddt<a2I  (17) 

3.  Concluding  remarks 

Adaptive  control  for  rigid  manipulators  is  well  repre¬ 
sented  in  the  literature  [26]. 

Despite  many  published  works  on  the  adaptive  con¬ 
trol  of  flexible  manipulators  during  the  past  few  years, 
part  of  which  was  discussed  in  this  paper,  its  develop¬ 
ment  seems  to  be  still  in  its  initial  stages.  Significant 
progress  should  be  expected  in  the  next  few  decades 
in  understanding  the  fundamentals  and  in  developing 
practical  strategies  for  such  manipulators  to  be  used  in 
industry  and  elsewhere. 
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Abstract 

To  improve  the  productivity  in  the  harbor,  successful 
development  of  an  ACT(Autonomous  Container  Transp¬ 
orter)  is  needed.  Well-designed  steering  and  velocity 
control  systems  are  the  key  factor  for  the  development  of 
the  ACT.  In  this  paper,  a  research  concerning  the 
achievement  of  the  steering  control  is  introduced.  To  get 
an  information  on  the  guide  line  that  the  ACT  should 
track,  the  vision  system  is  applied.  By  using  neural 
network,  proper  steering  angle  is  gotten  fast  with  less 
influence  of  the  image  disturbance.  A  simulation  based  on 
the  ACT  kinematics  is  performed  with  the  gotten  steering 
angle,  and  it  shows  satisfactory 7  results. 


1.  Introduction 

An  ACT  may  be  defined  as  an  unmanned  vehicle  which 
transport  a  container  from  the  crane  to  the  specific 
position  in  a  harbor.  Recently,  many  concerning  fields  of 
the  harbor  management  automation  are  researched  to 
improve  the  productivity.  As  a  aspect  of  the  automation, 
the  research  on  the  development  of  the  ACT  is  introduced 
in  this  paper.  The  purpose  of  the  research  is  that  by 
successful  development  of  the  ACT,  we  can  transport 
more  cargoes  in  the  given  time,  and  it  is  a  straightway  to 
the  improvement  of  the  productivity  in  the  harbor.  The 
well-designed  ACT  should  not  only  be  stable  but  also 
perform  fast  steering  and  velocity  control.  To  design  such 
controller,  we  have  to  know  the  dynamics  of  the  plant, 
select  proper  control  algorithm,  and  determine  sensors  and 
design  procedures  using  chosen  sensors. 

In  this  paper,  the  selection  of  the  sensors,  which  provide 
basic  operating  signal  of  the  controller  system  and  the 
simulation  based  on  the  both  ACT  kinematics  and  sensor 
signal  is  presented.  The  vision  system  is  used  as  a  state 
sensor  of  the  ACT.  It  costs  less  installation  price,  but  the 
robustness  on  the  image  disturbance  is  not  strong.  To 
overcome  the  disadvantage,  the  neural  network  is  used  to 


evaluate  proper  steering  angle  based  on  the  vision  image 
signals.  It  is  commonly  known  that  well  learned  neural 
network  can  produce  proper  output  under  the  disturbance- 
existing  environment.  With  the  learning  information,  the 
neural  network  makes  good  steering  angle  in  spite  of  the 
damaged  or  noise-effected  guide  line  image. 

This  paper  consists  of  five  chapters:  In  the  chapter  1,  the 
motivation  and  the  purpose  of  this  study  is  shown.  The 
procedures  for  the  evaluating  steering  angle  using  vision 
image  information  and  the  neural  network,  and  the  control 
algorithms  of  the  ACT  are  introduced  in  the  chapter  2. 
Chapter  3  presents  the  simulation  results,  and  finally  the 
conclusion  and  the  further  study  is  discussed  in  the 
chapter  4. 

2.  Steering  Control 

The  vision  system  configuration  used  in  this  paper  is 
shown  in  Figure  1. 


!=□ 


Figure  1.  Block  diagram  of  vision  system 

The  detail  specification  of  each  system  components  is  as 
follow: 
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The  detail  specification  of  each  system  components  is 
as  follow: 


Table  1.  specification  of  each  system  components 


CCD  Camera 

Monochrome  type  by  COSMICAR,  Japan 

Lens 

Autoiris  by  COSMICAR,  Japan 

Vision  Board 

MVB03,  Korea 

Resolution 

256  gray  level,  640x480  pixel 

Host  PC 

Pentium  MMX-233MHz:  32MB  RAM 

Host  PC  can  perceive  the  deviation  of  the  ACT  from 
the  normal  driving  trajectory  by  analyzing  the  vision 
information  which  contains  the  image  of  the  white  guide 
line.  This  is  shown  in  Figure  2.  With  the  Figure  2?  we 
can  know  that  two  parameters  are  needed  to  express  the 
driving  state  of  the  ACT  :  r  and  0 

r :  lateral  direction  deviate  distance  of  ACT 
0  :  moving  direction  of  ACT 


Figure  2.  Case  of  current  guide  line  deviating 
the  center  point  of  vision  system 


The  image  processing  algorithms  are  designed  to  make 
the  ACT  go  along  with  guide  line  by  below  procedures. 

1st  procedure  :  Thresholding  of  gray  level 

To  extract  the  information  of  the  guide  line  from  the 
captured  image,  the  binarization  and  the  segmentation  on 
the  image  is  a  must.  But  the  determination  of  the  proper 
threshold  value  is  hard  because  it  heavily  depends  on  the 
weather  and  the  environmental  condition. 

To  solve  this  problem,  the  threshold  value  depends  on 
the  ratio  of  (the  area  of  the  guide  line  in  the  whole 
captured  image)  :  (the  whole  area  of  the  captured  image.) 
For  example,  if  guide  line  occupies  15%  area  of  the 
image,  the  threshold  value  is  set  to  85%  of  the  intensity7 
value. 


(a)  Origin  image 


(b)  Histogram  of  origin  image 
Figure  3.  Input  image(gray  level)  and  histogram 

2nd  procedure  :  The  segmentation  of  the  sub-block 

The  block  segmentation  is  applied  to  the  binarized 
image.  The  reason  of  the  segmentation  is  that  the 
computing  time  of  the  whole-image  processing  is  much 
longer  than  that  of  the  segmented  image. 

The  segmentation  is  not  applied  to  the  whole  image, 
but  to  the  highest/lowest  and  the  middle  parts  of  the 
image.  By  doing  so,  enough  information  to  determine  the 
steering  angle  is  taken.  We  set  up  the  block  number  of 
the  highest/lowest  parts  of  the  image  to  have  double  size 
amount  of  information  compare  to  the  middle  block 
number.  Because  the  key  factors  which  determine  the 
incline  of  the  guide  line  is  start/end  point  of  the  guide 
line,  which  are  allocated  at  the  highest/lowest  parts  of  the 
image. 

Full  size  of  the  captured  image  has  a  resolution  of 
640x480  pixels.  We  set  the  highest/lowest  parts  of  the 
block  to  be  a  resolution  of  20x20  pixel,  and  the  middle 
parts  of  the  block  to  be  a  40x40  pixel.  And  the  width 
of  the  guide  line  is  approximately  80  pixel. 
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Figure  4.  The  segmentation  of  the  sub-block 

3rd  procedure  :  The  binalizution  of  the  segmented 
blocks 

In  Figure  5,  total  80  blocks  are  binarized.  For  each 
block,  if  the  white  pixels  occupy  more  than  50%  of  a 
block  area,  then  the  block  is  assigned  to  be  one(that  means 
the  block  is  totally  considered  white),  else  to  be  zero(that 
means  the  block  is  totally  considered  black).  As  a 
consequence,  80  length  of  input  bit  sequence  is  obtained. 
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Figure  5.  The  binalization  of  the  segmented  blocks 

4th  procedure :  Input  to  the  neural  network 

Now,  80  input  bit  stream  is  acted  as  an  input  of  the 
neural  network.  To  get  a  steering  angle  from  the  captures 
image,  the  neural  network  is  used  as  a  'lookup-table',  and 
the  structure  of  the  neural  network  is  shown  in  the  Figure 
6. 

The  obtained  image  can  be  classified  as  two  classes: 
one  is  what  passes  the  center  of  the  camera,  and  the  other 
is  what  passed  away  from  the  center  of  the  camera. 


Input  Layer 
Layer  (10  nodes) 
(80  nodes) 


Figure  6.  The  architecture  of  NN 


Table  2.  Structure  of  NN 


Input 

3  layer  NN 

Input  Node 

80  nodes 

Hidden  Node 

10  nodes 

Output  Node 

1  nodes 

We  can  consider  that  non-center  passing  image  as  a 
center-passing  image  with  a  distance  r  from  the  center  of 
camera,  so  we  can  express  the  input  to  the  steering  part  as 
below: 

Input  =  6  ,  r 

where  the  6  is  a  steering  angle  evaluated  by  the  neural 
network,  and  r  is  a  distance  from  the  center  of  the  camera 
to  the  image  of  guide  line. 

With  above  configuration,  all  possible  cases  of  the 
guide  line  images  are  29.  That  means,  from  the  1st 
case(the  image  is  allocated  from  the  top-left  to  the 
bottom-right  of  the  camera)  to  the  29th  case(the  image  is 
allocated  from  the  top-right  to  the  bottom-left  of  the 
camera)  can  be  exist.  We  only  take  into  account  the  12th, 
13th,  14th,  15th,  16th,  17th,  and  18th  image  under 
consideration  of  the  actual  operating  angle  range  of  the 
front-wheel,  and  corresponding  angle  is  14.036(deg), 
9.462(deg),  4.763(deg),  0(deg),  -4.763(deg),  -9.462(deg)’ 
and  -14.036(deg)  respectively. 

The  angle  is  evaluated  by  the  next  formula: 

9  =  atan(x/y) 

x=640-(i-l+Width)  X  640/Column-(i-l)  X  640/Column 
Width=4,  Column=16,  M2...18 
y-480 

5th  procedure :  Steering  control  of  ACT 

consi-dering  kinematics  analysis 

The  ACT  knows  the  angle  of  inclination  between  road 
trajectory  and  moving  direction  of  ACT  and  the  lateral 
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direction  from  image  processing  and  NN.  Using  these 
informations,  we  need  the  steering  control  algorithm  of 
the  ACT  to  track  working  trajectory.  So,  in  the  procedure 
the  movement  of  ACT  using  in  the  harbor  may  abtain  the 
motion  equation  of  ACT  from  the  kinematics  analysis. 
And  the  ACT  is  four-wheel  driving  and  front-wheel 
steering  system(Figure  7). 


Table  3.  Coordinates  of  ACT 


X,  Y 

Vehicle  position 

a 

Vehicle  orientation 

8  1 

Left  front  wheel(LFT)  direction 

82 

Right  front  wheel(RFT)  direction 

61 

Angle  deflection  each  wheel(/- 1 ,  •  •  •  ,4) 

Figure  7.  Schematic  of  ACT  Model 


We  can  abtain  9  coordinates(Table  3)  for  kinematics 
analysis  of  the  movement  of  ACT  from  Figure  7,  then 
each  coordinate  has  dependent  relationship.  And  these 
coordinates  satisfy  the  follow  assumptions  to  abtain  the 
constraint  equations. 

1 .  We  ignore  dynamic  effect  of  ACT  such  as  the  slip  of 
lateral  direction  and  the  distortion  of  tire. 

2.  Actually  the  movement  of  ACT  constructs  rolling, 
pitching,  and  yawing,  But  yawing  is  influenced  more 
then  rolling  and  pitching.  So,  we  ignore  rolling  and 
pitching. 


L  tan  (j>  j  tan  <j)  ,  v  ' 

Constraint  Equation  2  :  Center  of  each  wheel  rotate  the 
same  angular  velocity  at  the  point  O. 

<9,  sin  ^  =  <9,  sin  <j)2  =  03  tan  ^  =  #4  tan  <f>2  (2) 

And  we  can  abtain  the  follow  three  constraint 
equations  by  the  condition  that  don't  occurred  the  slip  of 
lateral  direction  in  the  tire 


x  =  v/}  cos  (a  +  (/)) 

(3) 

y  =  vp  sin(a  +  ^) 

(4) 

ii 

(5) 

Using  the  these  constraint  equations(Eq.(l)~Eq.(5)), 
we  can  abtain  the  motion  equations. 

x  =  iv,.  cos(  J lov,  sin  M  +<l>)dt  (6) 

y  =  sin(  j  f'v;,  sin  <j>dt  +  </>)dt  (7) 


3.  Simulation 

We  simulate  guide  line,  lateral  direction  is  trapezoid 
type,  tracking  of  ACT.  Information  of  guide  line  measures 
the  lateral  direction  distance  and  angle  of  inclination  by 
the  process  as  the  Figure  8.  Then  these  values  are  the 
tracking  errors  of  the  ACT.  Using  these  errors,  Figure  9 
and  10  are  results  of  block  diagram(Figure  11),  where 
Figure  9  displays  the  desired  path  and  the  actual  path.  And, 
Figure  10  is  the  angle  variation  of  ACT  handle.  Then, 
tracking  trajectory  compensates  for  using  the  PID 
controller. 

4.  Result 

In  this  paper,  a  research  concerning  the  achievement  of 
the  steering  control  was  performed.  To  get  an  information 
on  the  guide  line  that  the  ACT  should  track,  the  vision 
system  was  applied.  By  using  neural  network,  proper 
steering  angle  was  gotten  fast  with  less  influence  of  the 
image  disturbance.  A  simulation  based  on  the  ACT 
kinematics  was  performed  with  the  gotten  steering  angle, 
and  it  showed  satisfactory  results. 


From  these  assumption,  we  induce  the  follow 
constraint  equations  and  simplify  2  D.O.F  system  about 


the  ACT. 


Constraint  Equation  1  :  Point  O  is  0  velocity  point. 
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Path  (in) 


Figure  8.  Flowchart 
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Figure  9.  Path  Tracking  Trajectory  Figure  10.  Angle  Variation  of  ACT  Handle 


Figure  11.  Block  Diagram  of  ACT  system 
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Abstract 

An  intelligent  robotized  wine  glass  palletizing  workcell 
has  been  developed  for  $i§ecam  Inc .  by  Altinay 
Robotics  and  Automation  Inc.  The  hardware  and 
software  design  of  this  mechatronic  system  includes 
selecting  right  mechanical  and  electronic  components 
which  are  combinations  of  different  products  and 
creates  a  solution  suitable  to  every  situation.  The 
robotic  cell  consists  of  two  gantries ,  two  4  axes  SCAR  A 
type  robots ,  five  servo  controlled  and  two  inverter 
controlled  conveyors ,  pallet  and  carton  separator 
storage  and  retrieval  units.  Also,  it  includes  a  quality 
control  station  for  eliminating  non-standard  products. 

1.  Introduction 

Robotics  is  defined  as  the  theory  and  practice  of 
automation  of  tasks  which  were  thought  to  be  reserved 
for  a  man.  Thus,  replacing  human  tasks  which  are  dull, 
repetitive,  hazardous,  or  beyond  the  capability  of  human 
physical  tolerance  with  robots  is  the  ultimate  goal  of 
robotics  research  [1].  During  the  last  30  years  robots 
became  a  widely  used  and  well  accepted  technology. 
After  80fs,  they  spread  to  nearly  all  industrial 
production  areas.  Today,  there  are  711,500  robots[2] 
worldwide.  They  are  used  mostly  for  arc  welding,  spot 
welding,  coating  and  material  handling. 

Material  handling  is  one  of  the  most  common 
applications  for  robots  today.  Robots  are  well  suited  for 
material  handling  operations  in  industrial  environments. 
Loading  parts  and  placing  them  onto  the  pallets  is  called 
palletization  and  unloading  products  from  a  pallet, 
called  depalletizing.  Palletizing/Depalletizing  are 
essential  operations  in  the  goods  receiving/shipping 
process  of  a  variety  of  industries[3].  Glass  industry  is  a 
good  candidate  for  robotic  palletizing  because  of  the 
repetitive  nature  of  the  job  and  high  cycle  time 
requirements.  Therefore,  the  objective  of  this 
automation  project  has  been  the  design  of  a  multiple 
robot  system  in  order  to  replace  the  manual  palletizing 


process  while  reducing  labor  requirements  and 
improving  labor  conditions. 

The  constraints  of  the  project  are:  1-)  The  system 
should  be  capable  of  handling  100  glasses  per  minute 
and  2-)adaptation  process  for  a  new  product  type  should 
be  very  short  and  simple. 

The  workcell  consists  of  four  robots.  This  makes  the 
system  cooperative.  Multiple  robots  performing  tasks 
together  in  a  cooperative  manner  have  advantages  over 
a  single  robot  just  as  a  human  using  two  arms  has  an 
advantage  over  one  that  uses  one  arm  and  as  multiple 
humans  have  an  advantages  over  a  single  human. 
Cooperative  robots  have  many  potential  applications  in 
assembly  and  manufacturing.  The  applications  can  be 
grouped  as: 

•  Every  robot  shares  a  part  of  the  task  in  the  process: 
If  one  robot  fails,  the  others  can  compensate  for  it. 
Therefore,  the  reability  of  the  manufacturing 
process  increases  and  process  time  decreases. 

•  Robots  work  on  the  same  object:  Carrying  a  heavy, 
flexible  or  long  object  requires  more  than  one 
robot. 

•  Robots  disassemble  or  assemble  mating  parts:  Each 
robot  arm  holds  a  different  object.  Complex 
assembly  is  essentially  a  two  hand  job,  and  can  be 
performed  better,  by  a  pair  of  robots  working 
together. 

The  designed  workcell  falls  into  the  first  category 
such  that  every  robot  shares  a  part  of  the  task.  If  there 
is  a  problem,  the  system  tries  to  organize  itself 
according  to  the  problem,  such  that  increasing  the  speed 
for  compensating  the  failing  part,  or  carrying  glasses  to 
the  bufferzone,  or  buffering  the  glasses  into  two  rows. 

The  system  is  also  intelligent,  it  keeps  the  necessary 
data  in  its  database  to  adapt  itself  automatically  in  case 
of  a  product  change.  The  only  thing  that  remains  to  the 
operator  is  to  select  the  product  type  from  the  operator 
terminal  menus,  and  to  place  a  few  sensors  using 
specially  designed  interfaces  for  easy  configuring. 
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Figure  1.  Overview  of  the  designed  workceil 


2.  System  architecture 

The  complete  mechanical  system  (Fig.l)  is  designed 
using  I-DEAS  and  DENEB  software.  The  mechanical 
design  of  the  robotic  cell  consists  of: 

2.1.  Determination  of  the  components  of  the 
workcell. 

AKR-XYZ:  Three  axes  gantry  robot  (Fig.  2)  picks 
glasses  from  the  cooling  oven  conveyor,  and  places 
them  onto  the  L-conveyor  while  tracking  the  speed  of 
the  conveyor.  If  a  failure  occurs  on  the  system,  AKR- 
XYZ  places  the  glasses  into  a  bufferzone,  after 
recovering  the  problem,  the  robot  loads  glasses  back 
from  the  bufferzone  to  the  L-conveyor. 

COBRA-l&2:Two  four  axes  SCARA  type  ADEPT 
robots  (Fig.  3-4)  pick  the  wine  glasses  from  L- 
conveyor  while  tracking  the  conveyor  and  place  them 
onto  the  position  controlled  conveyors.  One  COBRA  is 
capable  of  handling  all  the  production.  However,  using 
two  COBRAs  makes  the  system  more  flexible,  reliable 
and  the  mechanical  life  of  the  robots  will  be  longer. 

POSCON-la&lb&2a&2b:  There  are  four  position 
controlled  buffering  conveyors  (Fig.4).  COBRA- 1 
places  the  glasses  onto  the  POSCON-la  and  lb. 
COBRA-2  works  with  the  POSCON-2a  and  POSCON- 
2b.  The  robots  place  the  glasses  always  to  the  same 
point,  and  send  a  signal  that  the  glass  is  on  the 
conveyor. 


Figure  2.  AKR-XYZ:  Three  axes  gantry  robot 


Figure.3.  Four  axes  SCARA  type  robot 
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POSCON  proceeds  a  step  whose  length  is  a  function 
of  the  product  type. 

After,  the  accumulated  glasses  reach  a  certain 
number,  POSCON  proceeds  a  large  step  to  place  the 
group  of  glasses  to  the  end  of  the  conveyor,  that  AKR- 
XZ  can  pick  them.  If  there  is  a  jam,  it  is  possible  to 
place  two  rows  of  glasses  on  a  single  POSCON. 

AKR-XZ:  Two  axes  gantry  robot  (Fig.  5)  which 
picks  the  equally  placed  wine  glasses  from  the  position 
controlled  conveyors  and  places  them  onto  the  pallets,  it 
should  also  put  carton  separators  between  layers. 

PIM:  A  pallet  storage  and  retrieval  mechanism  for 
taking  a  pallet  from  the  pallet  storage  unit  and  placing  it 
to  the  filling  station.  When  the  robot  fills  the  pallet,  it 
will  carry  the  filled  pallet  to  the  stretching  station.  It  has 
to  communicate  with  AKR-XZ. 

2.2.  Designing  robotic  hands  for  AKR-XYZ, 
AKR-XZ,  COBRA1&2 

AKR-XYZ:  4  degrees  of  freedom  pneumatic  gripper 
is  designed  for  this  robot  (Fig.  6a).  There  are  10  sensors 
for  check  hand  status.  It  could  handle  disordered  glass 
groups. 

AKR-XZ:  very  flexible  and  adjustable  mechanic 
hand  (Fig.  6b)  is  designed  for  this  robot.  It  is  possible 
to  pick  up  to  24  wine  glasses  with  this  hand.  Also, 
carton  separators  can  be  picked.  It  is  equipped  with  3 
sensors. 

COBRA-1  &2:  have  vacuum  grippers  (Fig.  6c)  and 
sensors  to  check  the  status 


Figure  5.  AKR-XZ:  Two  axes  gantry  robot 


2.3.  Selecting  conveyors 


2.4.  Interfaces  and  storage  units 


Belt  conveyors  for  POSCON,  roller  conveyor  for 
pallet  retrieval  unit,  plastic  panel  belt  conveyors  for  L- 
conveyor  and  rib-flex  conveyor  for  bufferzone  are 
chosen. 


-robots'  base,  carton  storage  units 
-sensor  connection  interfaces  -adjustable  measuring 
systems  for  quality  control  station  are  designed. 


Figure  6.  Designed  robotic  hands  for  AKR-XYZ(a),  AKR-XZ(b),  COBRA 
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Figure  7.  Hardware  overview 


3,  Software  architecture  and  control 

The  software  is  the  key  element  to  make  the 
designed  workcell  flexible,  reliable  and  robust  under 
new  manufacturing  conditions.  Decentralized  hardware . 
(Fig.  7)  and  software  configurations  (Fig.  8)  are 
designed  for  optimum  performance.  A  user-friendly 
man-machine  interface  is  developed  on  an  industrial 
PC.  Each  robotic  unit  has  its  own  controller,  but  a 
multiprocessor  controller  is  the  brain  of  the  system. 


There  are  total  17  parallel  processes  running  on  the 
system.  Communications  between  different  processes 
are  possible  via  signals  and  global  variables.  The 
software  configuration  is  shown  in  Fig.8. 

The  palletizing  process  starts  after  determining 
number  of  rows,  columns  and  layers  in  a  pallet.  The 
operator  may  select  the  values  suggested  by  the 
computer  which  is  based  on  maximum  possible  glasses 
can  be  placed  in  a  pallet  or  he  can  enter  values 
depending  on  the  customer  or  the  transportation  needs. 


Product  Palet  Robot 

Parameters  Parameters  Parameters 


Figure  8.  Overview  of  the  software  scheme 
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According  to  the  glass  model  and  the  pallet  data, 
man-machine  interface  computer  calculates  all  the  robot 
pick  and  place  coordinates  and  downloads  those  data 
via  serial  communication  ports  or  via  Ethernet  to  the 
robot  controllers  and  one  axis  controllers  of  the 
conveyors. 

3.1.  Man-machine  interface 

A  menu  driven  graphical  user  interface  (Fig.  9)  is 
developed  on  the  industrial  PC. 


Figure  9.  Menu  structure  of  man-machine  interface 
program 

Operator  commands  are  interpreted  by  the  task 
interpreter  and  converted  to  the  robot  controller 
commands.  The  operator  can  monitor  several  robot 
parameters  and  also  physical  I/O  values  and  alters  them, 
he  can  also  change  the  robot  parameters  such  as  pick 
and  place  points,  he  can  select  a  new  product(glass), 
according  to  this  new  information  all  the  robot  points 
will  be  adapted  automatically.  He  can  also  create  or 
update  pallet  parameters  and  after  his  confirmation  the 
system  again  organize  itself.  All  the  confirmed  data  are 
stored  in  a  database  such  that  in  the  future,  the  operator 
only  selects  that  record  for  the  same  product  and  the 
pallet. 

3.2.  Robot  controllers 

Every  robot  has  its  own  industrial  controller.  The 
low  level  part  of  the  software  is  written  on  the  industrial 
controllers  using  their  own  languages. 


level  controls  for  gantries  and  communication  between 
operator  panel  by  ADEPT  MV  controller. 

3.2.2.  HNC-P384  controller.  Gantry  robots  have  Hirata 
HNC  controllers.  These  controllers  are  simple  point-to- 
point  controllers,  they  need  higher  level  controllers 
above  them. 

3.3.  One  axis  controllers 

POSCON  la&lb&2a&2b  have  one  axis  servo 
controller  UNIDRIVE  with  UD70  position  control  unit 
from  Control  Techniques  Inc..  L-conveyor  has  also 
UNIDRIVE  servo  controller  without  UD70,  position 
controller.  The  conveyors  of  the  bufferzone  and  pallet 
storage  and  retrieval  mechanism  are  inverter  controlled. 

3.4.  Conveyor  tracking 

The  three  axes  gantry  robot  (AKR-XYZ)  and 
SCARA  robots  (COBRAs)  have  conveyor  tracking 
capability.  To  determine  the  instantaneous  position  and 
speed  of  a  belt,  the  conveyor  is  equipped  with  an 
encoder  to  measure  its  position  and  speed.  Conveyor 
tracking  help  robot  to  pick  and  place  glasses  from  the 
belt  with  the  speed  of  the  belt  without  disturbing  the 
balance  of  the  glasses. 

3.5.  Coordinator 

This  high  level  controller  commands  to  the  system 
components  how  to  behave  if  there  is  a  problem  or  a 
jam  in  the  system. 

Problem-1:  One  of  the  SCARA  robots  has  a  failure 

The  other  SCARA  robot  compensate  the  failure  and 
loads  all  the  glasses  from  L-conveyor  to  its  own 
position  controlled  conveyors.  To  prevent  jam,  AKR- 
XYZ  (three  axes  gantry  robot)  picks  the  glasses  from 
the  cooling  oven  and  replaces  some  of  them  onto  the 
bufferzone  and  some  of  them  onto  the  L-conveyor 
according  to  the  ratio  selected  by  the  operator. 


3.2.1.  ADEPT  MV  controller.  The  Adept  MV  series 
controllers  are  based  on  the  VME  bus  specification.  The 
controller  used  for  this  project  has  dual  68040 
microprocessor.  This  controller  is  the  heart  of  the 
system.  The  MV  Controller  is  suitable  to  implement 
flexible  robot  cells  because  of  its  open  architecture.  It  is 
possible  to  control  two  SCARA  type  4  axes  robots,  a 
pallet  retrieval-storage  unit,  conveyors  and  to  do  higher 


Problem-2:  Two  of  the  SCARA  robots  have  failure 

L-conveyor  will  be  stopped.  AKR-XYZ  picks  all  the 
glasses  from  the  cooling  oven  and  replaces  them  onto 
the  buffer-zone.  When  the  problem  is  eliminated  it  will 
load  glasses  from  the  cooling  conveyor  and  buffer-zone 
according  to  the  ratio  determined  from  the  operator  to 
the  L-conveyor.  The  speed  of  the  L-conveyor  will  be 
increased  in  this  case. 
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Problem-3:  AKR-XZ  has  a  problem 


4.  Conclusion 


AKR-XYZ  picks  all  the  glasses  from  the  cooling 
oven  and  replaces  them  onto  the  bufferzone.  SCARA 
robots  continue  to  work  until  their  conveyors  are  filled. 
When  the  problem  is  eliminated  the  glasses  in  the 
bufferzone  will  be  unloaded  like  in  the  previous  case. 

Problem-4:  AKR-XYZ  has  a  problem 

The  end  of  the  system  continues  to  work  until  all  the 
glasses  are  palletized  and  waits  till  AKR-XYZ  is 
available  again. 

Problem-5:  One  of  the  position  controlled  conveyors 
has  a  problem 

SCARA  robots  will  not  use  that  conveyor  and 
continue  to  work  with  the  others  until  it  is  available 
again. 

3.7.  Quality  control  station 

The  variance  in  height  of  the  wine  glasses  will  be  a 
problem  for  palletizing.  If  they  are  higher  than  the 
standard  they  may  be  broken  when  a  higher  layer 
comes.  If  they  are  shorter,  it  is  possible  that  robots 
could  not  pick  it.  To  prevent  unacceptable  variance  in 
the  height  of  the  wine  glasses  a  quality  control  station 
was  developed.  Two  optical  sensors  determine  if  the 
glass  is  standard  and  if  it  is  a  nonstandard  product,  it 
will  not  be  processed  and  system  let  it  go  to  the  recycle 
pool. 

3.8.  Safety 

The  fence  system  for  the  safety  is  standard  in  all  the 
industrial  robotic  workcells.  For  this  palletizing 
workcell  also  a  fence  system  is  designed  considering 
the  maximum  safety  of  the  people  and  the  ergonomy 
for  the  operator. 

For  the  safe  operation,  the  parts  of  the  system  such 
as  robotic  hands  which  have  interaction  with  the 
environment  have  equipped  with  touch  sensors  in  order 
to  prevent  them  to  insert  excessive  forces. 


A  fully  automatic  system  for  palletizing  of  wine 
glasses  has  been  designed  and  developed  by  Altinay 
Robotics&Automation  Inc.  This  research  project  has 
had  a  total  duration  of  one  year.  It  proves  that  new 
robotic  technology  can  be  introduced  to  the  industry 
with  good  results.  The  main  contribution  of  this 
developed  system  is  its'  flexibility  to  product  type 
changes,  and  fault  tolerance  capability  against 
manufacturing  problems. 
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Abstract 

For  a  manipulator  operating  in  a  hazardous  or  remote  envi¬ 
ronment,  an  important  concern  is  its  capability  after  a  component 
failure,  since  retrieval  or  repair  is  not  always  possible.  Methods 
have  been  presented  in  the  literature  for  optimizing  capabilities 
after  specific  types  of  failures.  However,  techniques  for  achieving 
failure  tolerance  when  conversion  between  failure  types  is  possible 
has  not  been  fully  explored.  This  article  presents  an  approach  to 
improving  postfailure  performance  by  converting  between  locked- 
joint  failures  and  free-swinging  failures  through  active  braking. 
When  a  manipulator  is  moved  slowly,  gravitational  forces  can  be 
used  to  control  the  failed  joint  in  free-swinging  mode,  allowing 
the  problem  to  be  cast  as  a  kinematic  one. 

I.  Introduction 

Failure  tolerance  for  robotic  manipulators  has  been 
widely  addressed,  often  with  a  focus  on  using  kinematic 
redundancy.  A  type  of  failure  often  addressed  is  one 
resulting  in  a  joint  locking.  Methods  for  designing  a 
manipulator  with  a  desired  postfailure  workspace  or  a 
desired  level  of  dexterity  after  a  locking  failure  have 
been  presented  [1,  2,  3],  as  have  general  methods  for 
preparing  kinematically  redundant  manipulators  for  the 
possibility  of  a  failure  [4,  5,  6].  Similar  issues  have  been 
addressed  for  free-swinging  failures  (those  for  which  ac¬ 
tuator  torque  is  lost).  Methods  of  preparing  for  a  free- 
swinging  failure  were  presented  in  [7,  8],  and  a  study 
of  postfailure  capabilities  related  to  design  was  pre¬ 
sented  in  [9].  These  techniques  will  be  combined  here 
to  maximize  the  postfailure  capability  of  a  manipulator 
by  choosing  between  locked  and  free-swinging  modes  of 
operation  and  using  gravity  to  control  the  free-swinging 
joint  when  the  brakes  are  released.  The  two  failure 
modes  are  illustrated  in  Fig.  1. 

Considerable  work  has  been  done  addressing  the 
related  issue  of  controlling  a  manipulator  with  free- 
swinging  joints  using  dynamic  coupling,  both  with 
brakes  and  without.  In  [10],  conditions  of  integrabil- 

This  work  was  supported  by  a  NASA  graduate  student  re¬ 
search  fellowship  (grant  number  NGT9-2)  and  by  Sandia  National 
Laboratories  under  contract  number  AL-3011. 


Free 

Swinging  Locked 


Fig.  1 .  A  three-link  planar  positioning  manipulator  is  shown 
tracing  a  line  with  joint  one  free-swinging  (left)  and  locked  (right) . 
In  the  free-swinging  case,  the  arm  traces  the  line  while  maintain¬ 
ing  the  composite  center  of  mass  of  the  arm  directly  below  joint 
one,  while  in  the  locked  case,  the  arm  traces  the  line  without  mov¬ 
ing  joint  one.  This  article  will  present  ways  of  using  and  switching 
between  these  two  failure  modes,  by  activating  and  releasing  the 
brakes  on  the  joint  whose  actuator  has  failed,  to  optimize  post¬ 
failure  performance. 

ity  (i.e.,  reduction  to  a  holonomic  constraint)  for  free- 
swinging  joints  were  investigated  and  methods  of  stabi¬ 
lizing  manipulators  without  brakes  on  equilibrium  man¬ 
ifolds  were  presented.  In  [11,  12]  active-braking  meth¬ 
ods  were  presented  based  on  dynamic  coupling.  Oper¬ 
ational  space  methods  were  developed  in  [12].  Optimal 
choices  of  braking  sequences  for  multiple  free-swinging 
joints  were  given  in  [13],  and  robust  methods  were  pre¬ 
sented  in  [14].  These  articles  focus  on  dynamically  con¬ 
trolling  passive  joints  about  equilibrium  points,  not  on 
finding  the  equilibrium  points,  and  not  on  exploiting 
gravitational  forces. 

This  article  addresses  a  complementary  issue:  that 
of  controlling  passive  joints  using  gravity  alone.  Meth¬ 
ods  are  presented  for  converting,  through  repeatedly 
activating  and  releasing  individual  joint  brakes,  be¬ 
tween  locked-joint  failures  and  free-swinging  failures 
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for  manipulators  under  gravitational  forces.  Manipula¬ 
tors  used  in  hazardous  and  remote  environments — those 
where  failure  tolerance  is  likely  to  be  an  issue — are  usu¬ 
ally  slow  moving,  and  this  slow-moving  assumption  al¬ 
lows  the  gravitational-force- based  control  problem  to  be 
cast  as  a  kinematic  one. 

The  organization  of  this  article  is  as  follows:  Sec¬ 
tion  II  will  present  the  swing  angle,  a  key  kinematic 
parameter.  <  Then  Section  III  will  give  velocity- control 
methods  for  both  free-swinging  and  locked  joint  fail¬ 
ure  modes;  Section  IV  will  look  at  ways  to  best  switch 
between  failure  modes  by  activating  and  releasing  the 
joint  brakes;  and  Section  V  will  illustrate  the  methods 
through  an  example. 

II.  The  Swing  Angle 

Let  an  n-degree-of-freedom  manipulator’s  joint 
variables  be  given  by 

q  =  [?i  ?2  •  •  •  qn]T-  (l) 

A  fundamental  kinematic  parameter  for  the  develop¬ 
ments  in  this  article  is  the  swing  angle.  The  swing  angle 
0{  is  defined  as  the  angle  through  which  joint  i  would 
move  to  settle  into  a  stationary  configuration  if  its  ac¬ 
tuator  torque  were  removed  [8].  For  a  viscous  friction 
model,  the  swing  angle  equals  the  smallest-magnitude 
angular  difference  between  the  actual  value  of  joint  i 
and,  with  the  other  joints  fixed,  a  value  that  gives  the 
lowest  potential  energy  (with  the  value  set  to  7 r  when 
the  smallest  magnitude  angular  difference  is  ±7r).  The 
viscous-friction-model  swing  angle  will  be  used  here. 

A.  The  Value  and  Gradient  Calculations 

To  establish  the  swing  angle,  let  M{  be  the  compos¬ 
ite  mass  of  links  i  through  n,  calculated  independently 
of  joint  type  as  follows: 


Let 

«»,  1  =  2.-1  •(<?  X  Ci)>  (4) 

and 

2  (^i— 1  %i— l)  *  (£»  — 1  X  g ),  (5) 

where  g  is  the  upward  pointing  gravity  vector.  Then 
provided  neither  nor  g  is  parallel  to  2,-_1;  these 
allow  the  swing  angle  to  be  calculated  as 

Oi  =  Atan2[uiii,uii2],  (6) 

where  the  range  of  Atan2  is  ( — 7r,  x],  If  either  or  g  is 
parallel  to  £,•_ i,  all  values  of  ?,•  give  the  same  potential 
energy,  and  thus 


Oi  =  0.  (7) 

If  s*_l  or  g  is  parallel  to  i,_i  or  0,-  =  x,  the  gradient 
of  §i  is  either  0  or  undefined.  Otherwise,  from  (6),  the 
entries  of  V #,■  are  calculated  as  follows: 


dOi  _  1  fduiA  dui  2  \ 

%  +  «i,2  V  dq,  Ui'2  dqj  Ui’1)  ■ 


(8) 


Methods  for  calculating  and  are  given  in  [8]. 


B.  Dynamics  at  Low  Velocities 

Inherent  in  the  definition  of  the  swing  angle  0*  is 
the  requirement  that  the  joints  excluding  joint  i  be  sta¬ 
tionary.  In  much  of  the  remainder  of  this  article,  the 
swing  angle  will  be  applied  to  a  moving  manipulator  as 
an  approximation.  A  discussion  of  the  quality  of  this 
approximation  will  be  given  here. 

For  a  viscous  friction  model,  the  differential  equa¬ 
tion  governing  the  motion  of  the  arm  is  expressed  as 


Mi  =  mi  -f  Mi+ 1;  Mn  =  mn.  (2) 

Here  m,-  is  the  mass  of  link  i.  Using  these  values, 
the  composite  first-moment-of-inertia  vector  s*  as  ex¬ 
pressed  in  the  ith  D-H  frame  can  be  calculated  using 

s{*  —  *R*+i(st-+1  +  Si+i  +  Mi+ipi^i+x);  =  0,  (3) 

where  lHi+i  is  the  3x3  rotation  matrix  representing 
D-H  frame  i  +  1  in  frame  i\  st  is  the  first-moment-of- 
inertia  vector  for  link  i  referred  to  and  expressed  in  its 
own  D-H  frame;  and  is  the  vector  from  the  origin 
of  D-H  frame  i  to  the  origin  of  frame  expressed  in 
frame  t. 


r  =  M(q)q  +  C(q,q)q  +  V(q)q  +  g(q).  (9) 

Here,  r  is  the  vector  of  applied  joint  torques;  M(q)  is 
the  manipulator  inertia  matrix;  C(q,  q)  is  the  matrix 
specifying  centrifugal  and  Coriolis  effects,  each  row  i 
of  which  has  the  form  qTC*-(q);  V(q)  is  the  viscous- 
friction  matrix;  and  g(q)  is  the  vector  of  joint  torques 
due  to  gravity.  It  is  assumed  that  the  manipulator’s 
control  law  (feedback  through  r)  allows  accurate  track¬ 
ing  of  a  prescribed  trajectory  for  the  healthy  joints. 

Let  joint  i  be  free  swinging  and  let  e*  be  defined 
such  that 

Qi  —  Qi\qi=0  “h  €i.  (10) 
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Then,  assuming  the  healthy  joints  are  stationary,  (9) 
gives,  through  entry  i  of  the  vector  equation, 

0  =  Mali  +  Vn(ei)ei  +  x  Vi||pi-i  x  g\\  sin(e,), 

.  .  (U) 

where  the  following  were  used:  r*  =  0  (because  joint  i  is 
free  swinging),  Mu  is  not  a  function  of  ,  and  entry  i,  i 
of  C i  is  zero.  If  joint  i  is  free  swinging  with  the  motor 
engaged,  Mu  and  Vu  should  reflect  the  motor  inertia 
and  friction.  Equation  (11)  is  an  approximation  for  a 
slow-moving  arm — provided  e*  and  its  first  and  second 
derivatives  are  bounded,  the  disturbance  to  (11)  can  be 
made  arbitrarily  small  by  time  scaling  the  trajectory 
of  the  healthy  joints.  If  |e*|  is  assumed  small  and  Vu 
changes  little  with  e*,  then  (11)  can  be  approximated 
by  the  following  normalized  linear  differential  equation: 

0  W  +  +  aM**-1  X  x  fi\€i  (12) 

where  V*  is  the  nominal  value.  Note  Mu  ^  0  because 
M  is  positive  definite.  This  system  acts  as  one  under 
a  proportional  plus  derivative  control  law  driving  e*  to 
zero  (and  therefore  qi  to  ^|?i=o)-  It  has  a  damping  ratio 
undamped  natural  frequency  and  time  constant 
rc  given  by  the  following: 


V* 

C  =  7  (13) 

2\/Mii||si*_i  x  x  g\\ 


Provided  the  manipulator’s  motion  is  slow  enough 
that  the  disturbance  to  (12)  is  minimal,  qi  can  be  viewed 
as  tracking  0* \qi=o  in  much  the  same  way  as  the  healthy 
joints  can  be  viewed  as  tracking  their  desired  values.  If 
fast  motion  of  the  arm  (relative  to  rc)  is  desired,  for 
large  £,  joint  i  will  tend  to  lose  tracking  and  lag  behind, 
and  for  small  £,  the  joint  will  tend  to  oscillate.  However, 
even  if  this  occurs,  the  arm  can  be  stopped  at  key  points 
and  allowed  to  settle.  Very  fast  motion  may  invalidate 

(11)  and  marginalize  the  usefulness  of  it  (and  therefore 

(12) )  in  predicting  behavior. 

III.  Single-Mode  Kinematics 

Prior  to  focusing  on  the  process  of  switching  modes, 
in  this  section,  kinematic  methods  will  be  presented  for 
postfailure  control  in  the  presence  of  a  single  mode  of 
failure. 


For  a  healthy  arm,  the  positional-kinematic  equa¬ 
tion  that  a  manipulator  configuration  reach  a  hand  pose 
Thand  is  given  by 

/(q)  =  Thand)  (16) 

where  /(•)  is  the  mapping  from  configuration  to  hand 
pose.  Because  (16)  is  typically  difficult  to  solve,  a  lin¬ 
earization  is  often  made  based  on  the  manipulator  Ja¬ 
cobian  equation.  If  x  is  a  representation  of  the  hand’s 
linear  and/or  rotational  velocity  and  the  corresponding 
manipulator  Jacobian  is  given  by 

J  =  IJi  j2  ..  j„],  (17) 

then  the  manipulator  Jacobian  equation  is 

x  =  Jq.  (18) 

For  solving  (18)  given  a  desired  x,  one  can  apply  any 
number  of  techniques  using  a  suitable  joint-rate  weight¬ 
ing  matrix.  This  weighting  matrix  will  be  labeled 
W(q) — it  is  symmetric  and  if  positive  definite  defines  a 
metric  on  the  q  space  through 

*?(  4)  =  \/qTWq.  (19) 

Different  modes  of  failure  in  this  system  will  be 

addressed  below,  and  their  impact  on  these  equations 
will  be  discussed. 

A.  Locked  Joint 

Assume  a  failure  at  joint  i.  When  the  failed  joint  is 
locked,  a  manipulator  configuration  that  reaches  hand 
pose  Thand  must  satisfy  (16)  and 

Qi  =  A?  (20) 

where  t{  is  the  locked  value.  The  velocity  equation  be¬ 
comes 

x  =  ‘J^q,  (21) 

where  *J  is  formed  by  removing  column  i  from  J;  i.e., 

*J  =  lh  •  •  •  j*-i  J*+i  *  •  ■  J n]>  (22) 

and  *q  is  formed  by  removing  element  i  from  q.  For 
trajectory  generation,  (21)  can  be  solved  for  *q  which 
can  be  integrated  to  find  *q  while  g*  remains  constant. 

The  induced  weighing  matrix  for  *q  is  *  W,  formed 
by  removing  row  i  and  column  i  from  W.  Clearly, 
iq  =  qTWq  when  qi  is  constant.  If  W  is  positive 

definite,  then  lW  is  also. 
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B.  Free-Swinging  Joint 

When  joint  i  is  in  free-swinging  mode,  the  manip¬ 
ulator  configuration  to  reach  a  hand  pose  Thand  while 
stationary  must  satisfy  (16)  and 

0i(  q)  =  0,  (23) 

where  0,-  is  the  swing  angle. 

For  a  slow-moving  manipulator  0;(q)  «  0.  Pro¬ 
vided  neither  s*_x  nor  g  is  parallel  to  iz_x,  this  gives, 
since  element  i  of  V0t-  is  —1, 

~  %TiVeu  (24) 

where  *V0j  equals  V0t*  with  entry  i  removed. 

The  velocity  equation  for  a  slow-moving  manipula¬ 
tor  then  becomes 

x  «  *J‘q,  (25) 

where 

■j  =  +jiiVeiT.  (26) 

Thus,  when  the  manipulator  is  slow  moving,  the  veloc¬ 
ity  equation  has  the  same  structure  for  a  free-swinging 
failure  in  (25)  as  for  a  locking  failure  in  (21).  The  dif¬ 
ference  between  the  locked-joint  and  the  free-swinging 

•  ~  T 

Jacobians  is  the  rank-one  matrix  j V0*  .  For  trajec¬ 
tory  generation,  (25)  can  be  solved  for  *q,  which  can  be 
integrated  to  find  *q.  An  updated  value  of  *q  can  then 
be  used  with  the  previous  value  of  qi  to  solve  for  the  as¬ 
sociated  swing  angle  to  be  added  to  the  previous  value 
of  qi  to  get  the  new  value. 

The  weighing  matrix  induced  by  a  free-swinging 
failure  is 


fw  =  ‘w  +  wjvei  ‘vV  +  ‘w  jvej  +  ‘veswj, 

(27) 

where  Wu  is  diagonal  entry  i  of  W  and  *Wj  is  column 
i  of  W  with  entry  i  removed.  With  this  definition, 
*4  *W*q  «  qTWq  when  qi  is  free-swinging.  As  was 
the  case  for  * W,  if  W  is  positive  definite,  then  *W  is 
also. 

IV.  Switching  Between  Modes 

A  locked-joint  failure  cannot  be  arbitrarily  con¬ 
verted  to  a  free-swinging  failure  without  undesirable  dy¬ 
namic  effects.  Equation  (25)  does  not  apply  when  the 
swing  angle  is  large  because  acceleration-  and  Coriolis- 
based  forces  are  manifest  and  the  joint  swings,  possibly 
violently.  Thus,  the  primary  issue  when  making  this 


type  of  conversion  is  ensuring  that  the  swing  angle  is  at 
(or  at  least  near)  zero. 

Conversely,  when  the  manipulator  is  operating  in 
free-swinging  mode  and  is  slow  moving,  a  brake  can 
be  applied  at  any  time  with  minimal  dynamic  effects. 
Thus,  (25)  can  be  converted  to  (21)  whenever  ‘J  loses 
rank,  for  example.  The  important  parameter  when  this 
type  of  conversion  is  made  is  the  actual  value  of  q{~ 
it  will  be  frozen  until  another  conversion.  Thus,  the 
primary  issue  for  a  free-swinging-to-locked  conversion 
is  setting  the  value  of  the  failed  joint. 

These  two  issues  of  setting  the  swing  angle  to  zero 
in  a  locked-joint  manipulator  and  setting  the  joint  angle 
to  a  desired  value  in  a  free-swinging  manipulator  are  in 
fact  the  same  positional  kinematic  problem — finding  a 
configuration  giving  both  0*  =  0  and  $  =  t{  for  some 
A-  It  will  be  shown  here  that  for  a  slow-moving  manip¬ 
ulator  they  are  amenable  to  the  same  form  of  velocity- 
kinematic  equation. 

A.  Locked  to  Free  Swinging 

Since  this  type  of  conversion  should  be  made  when 
the  value  of  the  swing  angle  is  zero,  the  focus  here  is  on 
finding  a  configuration  of  the  failed,  locked-joint  manip¬ 
ulator  that  gives  a  vanishing  swing  angle.  If  the  arm’s 
dynamic  and  kinematic  parameters  are  well  known,  this 
is  essentially  a  root-finding  problem  for  which  any  num¬ 
ber  of  techniques  can  be  applied. 

Here  the  problem  will  be  cast  as  a  familiar  velocity- 
kinematic  equation.  This  has  the  advantage  of  general 
applicability  and  quality,  if  not  completeness,  of  its  solu¬ 
tion.  It  can  be  used  with  feedback  from  sensors  to  refine 
solutions  when  the  arm’s  parameters  are  in  doubt. 

When  joint  i  is  locked,  the  governing  velocity- 
kinematic  equation  is  given  by 

—  V0i  zq.  (28) 

Values  of  *q  for  control  can  be  found  in  a  number  of 
ways,  with  possible  inclusion  of  additional  constraints. 

For  example,  one  approach  is  to  use  a  weighted 
pseudoinverse  based  on  *  W.  This  allows  0*  to  be  differ¬ 
entially  changed  with  the  minimum  joint-rate  measure 
as  given  by. (19).  If  *W  is  positive  definite  and  provided 

*  V0;  ^  0,  this  solution  for  a  desired  value  of  0*  can  be 
found  as  follows: 

*q=  -  ,T  9i  - — (29) 

Equation  (29)  can  be  used  with  torque  feedback  to 
set  the  swing  angle  accurately  to  zero.  The  swing  angle 
can  be  found  for  small  values  (~f  <  0*  <  j)  using 
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6i  =  sin  1  ( ^ - 7 — 77777 - =77)  ,  (30) 

\lta-i  x  xg\\J 

where  <7*  is  the  measured  torque  at  joint  i. 


B.  Free  Swinging  to  Locked 

The  primary  goal  for  this  type  of  conversion  is  to 
lock  the  joint  at  the  appropriate  value.  So  the  focus  is 
on  seeking  a  configuration  of  the  failed,  free-swinging 
manipulator  that  gives  the  desired  value  of  the  failed 
joint.  The  velocity-kinematic  equation  for  this  focus  is 

Qi  »  *  VffiT< q,  (31) 

and  this  can  be  directly  used  with  measured  joint  posi¬ 
tion  feedback.  It  has  the  same  form  of  solution  as  did 
the  locked-to-free-swinging  conversion.  So,  again  as  an 
example,  if  *W  is  positive  definite  and  provided 
is  defined  and  not  equal  to  zero,  a  healthy-joint-rate 
solution  giving  a  desired  value  of  can  be  found  using 

%  = - T^l - *w (32) 

*V0t-  *w  *'V0,- 

V.  Example 

The  simple  planar  redundant  arm  shown  in  Fig.  1 
will  be  used  as  an  illustrative  example.  Its  link  lengths 
are  1  m,  the  link  masses  are  10  kg,  the  center  of  mass 
of  each  link  is  at  the  center  of  the  link,  and  the  link 
inertias  are  modeled  as  thin  rods. 


A .  Switching  Strategy 

To  illustrate  a  strategy  for  postfailure  operation 
through  switching,  assume  that  a  failure  of  joint  one  has 
occurred.  The  beginning  configuration  is  the  top  one 
from  the  right-hand  side  of  Fig.  1.  The  goal  is  to  reach 
the  point  {—2.5,0}.  To  achieve  this,  it  is  necessary  to 
use  a  mode-switching  scheme.  To  reach  {-2.5,0},  q\ 
must  be  greater  than  cos”^— 1§)  and  less  than  n  - 
cos”1  (—“|).  The  intersection  of  this  range  of  values 
with  those  possible  for  joint  one  after  a  free-swinging 
failure  gives  tan^f )  <  <  tt  -  cos”^-^). 

Let  the  joint-rate  weighting  matrix  be  given  by 
W  =  diag(9,4, 1)  which  is  an  appropriate  choice  for 
reducing  the  relative  burden  on  the  inboard  joints.  A 
switching  sequence  based  on  using  this  value  of  W  in 
(29)  and  (32)  for  moving  q\  and  relocking  it  in  this  range 
to  achieve  the  task  is  shown  in  Fig.  2. 

B.  Joint  Trajectory  Generation 

In  this  section,  postfailure  operation  through  joint 
trajectory  generation  will  be  explored.  The  failure  for 


this  example  lies  in  joint  one.  The  joint-one  swing  angle 
is  given  by  -r-.  V  \„ 

01  =  Atan2[  -  (5cos(^)  +3cos(?i  +  q2)  +  cos +  q2  +  $s)), 
—  (5sin(?/)  +  3sin(?j  +  52)  +  sin  (qj  +  q2  +  ?$))] 

(33) 

and,  provided  0\  ^  7r,  the  gradient  is  given  by 


V0i 


-1 

-10-15  cos (frg)-6  cos(g$)-5  cos(g3  +  gj) 
35+30  cos (?s)+6  cos(?$)+10  cos(q2  +  qs) 

_ —1—3  cosfgg)— 5  cosjqz+qs) _ 

35+30  cos(^)+6  cos(gs)+10  cos($2  +  0S) 


(34) 


The  motor  will  be  assumed  disconnected  from  the 
failed  joint  for  this  simulation  experiment,  so  Mu  is 
the  diagonal  entry  of  the  mass  matrix  induced  by  the 
inertia  of  the  manipulator’s  links,  and  it  is  assumed  that 
Vu  =  15.0. 

A  trajectory  was  generated  using  the  method  given 
in  Section  III-B  for  an  end-effector  path  given  by  the  line 
segment  from  (-.75,  -2.0, 0.0)  to  (-.75,0.0,0.0).  This 
is  the  path  shown  in  Fig.  1,  and  it  lies  well  within  the 
joint-one  free-swinging  postfailure  workspace.  To  form 
an  end-point  trajectory  from  this  path,  a  trapezoidal 
velocity  profile  was  used,  with  a  20%  speed-up  and  20% 
slow-down  time.  Three  configurations  along  the  ideal¬ 
ized  joint  path  for  this  trajectory  are  shown  on  the  left 
side  of  Fig.  1.  For  this  joint  path,  £  E  [.046,  .145]  and 
lou  E  [2.40,3.08];  both  start  at  the  low  end  and  finish 
at  the  high  end  of  their  ranges.  Thus  the  system  would 
be  expected  to  be  underdamped  and  oscillate  with  2.0 
to  2.6  seconds  per  cycle  (with  the  lower  value  at  the 
end  of  the  trajectory).  And  this  is  in  fact  what  is  seen 
in  dynamic  simulations.  The  error  in  joint  one  for  four 
trajectory  times  is  illustrated  in  Fig.  3. 


VI.  Summary 

This  article  presented  ways  to  exploit  a  manipula¬ 
tor’s  postfailure  capabilities  when  conversion  between 
free-swinging  and  locked-joint  failure  modes  is  possible. 
Velocity-control  methods  were  discussed  for  both  types 
of  failures,  and  ways  of  effectively  converting  between 
the  two  types  were  presented.  An  example  illustrat¬ 
ing  switching  strategies  and  postfailure  operation  was 
given. 
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Step  One 


Fig.  2.  Three  configurations  for  each  of  three  steps  of  a  switching 
strategy  are  shown.  The  goal  is  to  reach  {-2.5, 0},  shown  by  cross 
hairs.  In  step  one,  the  manipulator  is  moved  using  (29)  from  its 
initial  configuration  to  a  configuration  giving  a  zero  swing  angle, 
where  the  brake  is  released.  In  step  two,  the  arm  is  moved  using 
(32)  to  place  joint  one  at  the  desired  value,  where  the  brake  is 
reapplied.  In  the  final  step,  the  manipulator  is  moved  to  the 
desired  hand  position  by  solving  (18)  for  joint  rates  that  give 
straight-line  end-point  motion. 


Fig.  3.  The  joint-one  error  as  the  arm  traced  the  two-meter 
vertical  trajectory  shown  in  Fig.  1  with  joint  one  free  swinging. 
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Abstract 

This  paper  describes  a  study  of  high-precision  servomech¬ 
anisms  using  traction  drive  and  proposes  a  method  to  de¬ 
tect  slip  velocity  with  high  accuracy  and  high  resolution. 
A  new  two-phase  type  PLL  using  non-sinusoidal  signals 
was  used  for  detecting  this  slip  velocity.  We  applied  the 
proposed  method  to  measure  frequency  characteristics  of 
torque  transmission  in  traction  drive,  and  obtained  fre¬ 
quency  characteristics  by  using  cross  correlation  method. 
Experimental  results  of  servo-control  based  on  the  measured 
characteristics  to  improve  stability  and  load  characteristics 
of  servomechanisms  by  slip-velocity  feedback  are  also  pre¬ 
sented. 

1  Introduction 

High-precision  servomechanisms  require  high-accuracy 
reduction  mechanisms  for  torque  matching.  The  usual  re¬ 
duction  mechanisms  are  gear  reducers.  However,  the  gear 
reducers  have  large  transmission  error  due  to  the  backlash, 
profile  error,  pitch  error  of  gears.  If  the  frequency  compo¬ 
nents  of  transmission  error  are  distributed  only  in  lower  fre¬ 
quency  range,  it  is  easy  to  compensate  by  varying  motor  ve¬ 
locity.  On  the  other  hand,  if  the  frequency  is  beyond  the 
response  frequency  of  servomotors,  it  becomes  difficult  to 
compensate.  Unfortunately,  transmission  errors  of  reduc¬ 
tion  gears  are  distributed  over  wide  frequency  range  so  that 
it  becomes  difficult  for  us  to  achieve  high-accuracy  servo- 
control  by  using  reduction  gears. 

Traction  drive  has  excellent  transmission  characteristics 
compared  with  reduction  gears.  Backlash  of  traction  drive 
is  negligible  and  the  frequency  components  of  transmission 
error  are  small  in  high  frequency  range.  Actually,  trac¬ 
tion  drive  contains  only  transmission  error  associated  with 
shaft  eccentricity  that  has  low  frequency.  This  is  signifi¬ 
cant  to  high-precision  servomechanisms  because  the  trans¬ 
mission  error  can  be  compensated  more  easily  than  gear 
reducers.  Therefore  EMURA  applied  traction  drive  to  a 
constant-velocity  servomechanism.  The  authors  developed 
a  high-precision  servomechanism  and  demonstrated  the  per¬ 
formance  of  traction  drive  for  high-precision  servomecha¬ 
nisms  by  experimental  results[l]. 


Traction  drive  transmits  torque  by  viscous  force  of  fluid 
film  formed  between  two  rolling  rollers [2].  Torque  trans¬ 
mission  of  traction  drive  is  accompanied  with  slip  between 
the  two  rollers.  The  torque  transmission  characteristics  ver¬ 
sus  slip  are  very  important  to  the  servomechanisms  that  use 
traction  drive.  Applying  traction  drive  to  positioning  ser¬ 
vomechanisms,  we  have  to  carry  out  feedback  control  of  ro¬ 
tary  angle  of  output  axis.  This  means  that  the  slip  takes  place 
inside  feedback  loop  and  affects  the  performance  of  the  ser¬ 
vomechanisms.  In  the  experiments  of  servo-control,  we  ob¬ 
served  resonant  oscillations  caused  by  the  slip.  Therefore  it 
is  important  to  understand  torque  transmission  characteris¬ 
tics  versus  slip.  The  transmission  characteristics  of  traction 
drive  have  been  investigated  by  many  researchers  [3]  and  in 
the  case  of  steady  transmission  torque  it  has  become  well 
known  that  the  torque  is  approximately  proportional  to  the 
slip  ratio.  However,  most  of  reported  results  are  only  about 
static  characteristics  and  the  dynamic  characteristics  have 
not  yet  clarified,  because  it  is  very  difficult  to  measure  the 
dynamic  characteristics.  On  the  other  hand,  the  dynamic 
characteristics  of  torque  transmission  are  very  important  to 
the  dynamic  performance  of  servomechanisms. 

Therefore,  in  this  study,  the  authors  investigated  the  trans¬ 
mission  characteristics  versus  slip  by  experiments.  Slip  be¬ 
tween  the  two  rollers  is  very  small  so  that  it  is  difficult  to 
detect  with  high  accuracy  by  using  conventional  methods, 
in  particular,  at  a  high-velocity  rotation.  Emura  proposed  to 
use  non-sinusoidal  2-phase  type  PLL  for  high-resolution  de¬ 
tection  of  slip  velocity [4].  This  method  used  two  encoders 
mounted  on  input  axis  and  output  axis  respectively.  By  in¬ 
terpolating  the  phase  of  the  two  encoders,  we  are  able  to  de¬ 
tect  the  difference  of  rotary  velocity  between  the  two  rollers 
with  high  accuracy  and  high  resolution.  However,  the  ec¬ 
centricity  and  run-out  of  rollers  induce  large  ripples  to  the 
velocity  difference,  and  the  output  of  detection  circuits  con¬ 
tains  large  error.  In  this  study,  cross  correlation  method  was 
used  to  remove  such  error  for  precise  investigation  of  the 
frequency  characteristics  of  torque  transmission. 

The  experimental  results  showed  that  the  torque  trans¬ 
mission  characteristics  have  large  nonlinearity  and  the  lin¬ 
ear  range  is  very  small.  The  nonlinearity  of  torque  trans¬ 
mission  induces  unstability  like  oscillations  to  servomecha- 
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Measured  wave  (slip  +  run-out) 


nisms  and  the  robustness  for  load  fluctuation  becomes  poor. 
To  improve  the  stability  and  load  characteristics,  we  pro¬ 
posed  a  slip-velocity  feedback  control  to  the  servomecha¬ 
nism  basing  on  the  measured  torque  transmission  charac¬ 
teristics.  The  experimental  results  showed  that  the  stability 
and  load  characteristics  of  the  servomechanism  were  signif¬ 
icantly  improved. 

This  paper  describes  a  new  method  measuring  the  dy¬ 
namic  characteristics  of  transmission  torque  and  gives  ex¬ 
perimental  results  of  servo-control  based  on  the  measured 
dynamic  characteristics. 

2  Measurement  of  Slip  Velocity 
2.1  Slip  Velocity  of  Traction  Drive 

The  traction  drive  type  reducer  used  in  the  servomecha¬ 
nism  is  shown  in  Fig.  1 .  Two  rollers  are  pressed  against  each 
other  by  pressing  force  Fr  and  are  coupled  by  traction  force 
Ft.  The  rollers  are  lubricated  by  traction  oil.  The  other  no¬ 
tations  of  Fig.  1  are  as  follows. 

$i :  rotary  angle  of  driving  shaft 
$2*  rotary  angle  of  driven  shaft 

R\  :•  radius  of  driving  roller 

Ri\  radius  of  driven  roller 

Tm :  torque  of  driving  motor 

Ti\  load  torque. 

Slip  between  the  two  rollers  occurs  when  torque  is  transmit- 


£(/) 

Correlation  | 

wvwvwvwvm 

C(t) 


Wave  of  slip  ratio 


Fig.  2.  Detection  of  slip  velocity  by  cross  correlation. 


ted.  The  slip  ratio  S  is  defined  as  follows. 

5-  ‘KRiOx-RiBi) 

{R\9\  +  RiOi) 

For  a  constant-velocity  servomechanism,  the  average  veloc¬ 
ity  (R\6\  +  RiQi)/!  is  kept  constant,  thus  it  is  enough  for 
us  to  detect  the  difference  of  rotary  velocity  R\6\  —  R262. 
Two  encoders  were  mounted  to  each  axis.  The  output  pulse 
trains  of  encoders  have  the  phase  of  the  pulses  proportional 
to  rotation  angle.  To  know  the  velocity  difference  of  the 
two  axes,  we  only  need  to  detect  phase  difference  of  the  two 
pulse  trains. 

In  order  to  obtain  high-accuracy  detection,  high- 
resolution  encoders  are  required.  However,  because  high- 
resolution  encoders  can  not  rotate  at  high  rotary  velocity,  we 
have  to  use  low-resolution  encoders.  Therefore  the  authors 
used  non-sinusoidal  2-phase  type  PLL  method  to  interpolate 
the  low-resolution  output  of  encoders  for  high-resolution 
detection.  This  method  compares  the  phase  difference  be¬ 
tween  two  pulse  trains  and  output  a  pair  of  triangular  waves 
with  their  phase  proportional  to  slip  velocity.  The  triangular 
waves  were  interpolated  with  high  accuracy  and  high  reso¬ 
lution  by  2-phase  type  PLL.  The  slip-velocity  detector  that 
uses  non-sinusoidal  2-phase  type  PLL  was  described  in  de¬ 
tail  in  [4]. 


2.2  Detection  of  Slip  Velocity  Using  Cross  Correla¬ 
tion 

By  using  above-mentioned  method,  we  can  investigate 
the  frequency  response  of  torque  transmission.  In  the  exper¬ 
iments,  we  controlled  the  driving  roller  to  rotate  at  a  constant 
velocity,  and  gave  load  torque  of  sinusoidal  wave  to  output 
axis.  By  detecting  the  slip  velocity  with  high  resolution,  we 
can  know  the  relation  of  transmitted  torque  and  slip  veloc- 
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ity.  However,  from  (1),  it  is  easy  to  find  that  the  run-out  of 
the  rollers  induces  error  to  the  detection.  This  situation  is 
shown  in  Fig.2,  where  the  reference  wave  is  used  for  pro¬ 
ducing  load  torque.  When  slip  is  small,  slip  velocity  has  the 
same  frequency  as  that  of  load  torque.  However  the  mea¬ 
sured  data  contain  components  associated  with  eccentricity 
of  rollers.  To  remove  the  error  we  used  cross  correlation 
method  expressed  as  follows. 

N—\ 

c(t)=-Y,9d  +  tMi),  (2) 

i=0 

where,  h(i)  is  sampled  data  from  slip-velocity  detector, 
g(i )  =  sin(ki)  is  the  reference  signal  that  has  the  same  fre¬ 
quency  as  given  load  torque,  N  is  the  total  number  of  ref¬ 
erence  data,  i  is  an  integer,  and  t  indicates  time  lag.  For 
example,  the  output  of  slip-velocity  detector  contains  slip- 
velocity  signal  A  sin(ki)  and  error  B  sin(rru)  caused  by  run¬ 
out  of  rollers,  then  we  have 


C(t)  =  —  sin{fc(£  +  t)}{Asin(fci)  +  ff  sin(7m)} 

i= 0 

A  1  N~l 

=  —  cos  (kt)  +  a7  [—A  cos{2  k(i  +  £)} 

■i=0 

+£  sin {k(i  + 1)}  sin(rm)],  (3) 

where,  k  4  m.  If  iV  is  large,  we  have  lim  C(t)  = 

N— ►  oo 

Acos(fc£)/2.  This  means  that  we  can  carry  out  the  mea¬ 
surement  very  precisely  by  using  sufficient  number  of  data. 
Fig. 3  shows  the  experimental  results.  The  output  of  slip- 
velocity  detector  contains  components  associated  with  load 
torque  and  other  components  with  different  frequency.  The 
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Fig.  3.  Measured  results  with  cross  correlation. 


Fig.  4.  Measured  torque  transmission  characteristics 


components  with  different  frequency  from  load  torque  were 
effectively  removed  by  the  cross  correlation.  From  the  re¬ 
sults,  we  made  certain  that  the  error  caused  by  run-out  of 
rollers  is  removed  and  slip-velocity  can  be  obtained  by  cross 
correlation  method. 

2.3  Frequency  Characteristics  of  Torque  Trans¬ 
mission 

In  this  study,  torque  transmission  characteristics  of  a  trac¬ 
tion  drive  used  for  a  servomechanism  were  investigated  by 
experiments.  Radii  of  driving  roller  and  driven  roller  of  this 
traction  drive  are  0.0 1 8m  and  0. 1 1 664m.  The  reduction  ratio 
of  this  traction  drive  is  6.48.  The  encoder  of  driving  shaft 
generates  5  000  of  two-phase  square  pulses  per  revolution 
(ppr)  (quadrupled  resolution  is  20  000  ppr)  and  the  encoder 
of  driven  shaft  generates  two-phase  square  pulses  of  32  000 
per  revolution  (quadrupled  resolution  is  129  600  ppr).  Be¬ 
cause  the  designed  reduction  ratio  is  6.48,  we  get  the  equal 
resolutions  for  both  shafts.  In  the  experiments,  input  axis 
was  controlled  to  rotate  at  a  constant  velocity  as  described 
above. 

For  comparison  with  frequency  characteristics  of  torque 
transmission,  static  characteristics  were  measured.  In  the 
experiments,  a  constant  load  torque  was  imposed  on  the  out¬ 
put  axis  and  the  slip  ratio  was  obtained  by  averaging  the 
output  of  slip- velocity  detector,  Fig.4  is  an  example  of  mea¬ 
sured  static  characteristics  of  torque  transmission,  where  ro¬ 
tary  velocity  of  output  axis  is  72rpm.  The  results  showed 
that  the  torque  transmission  characteristics  have  strong  non¬ 
linearity.  When  slip  ratio  is  small  (|S|  <  0.02%)  the 
torque  is  proportional  to  slip  ratio  and  the  torque  gets  sat¬ 
urated  when  slip  becomes  large  ( \S\  >  0.02%).  The  max¬ 
imum  transmission  torque  of  linear  range  is  about  0.5N*m. 
This  means  that  the  servomechanism  can  not  transmit  large 
torque  so  as  to  keep  slip  ratio  small  in  linear  area.  This  is 
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because  larger  torque  is  not  available  when  slip  increases. 

In  the  investigation  of  frequency  characteristics  of  torque 
transmission,  we  changed  the  load  torque  with  sinusoidal 
shape  and  with  the  frequency  range  from  0.1  Hz  to  180 
Hz.  Slip  ratio  was  detected  using  cross  correlation  method. 
The  cases  of  load  torque  of  small  amplitude  0.13N-m  and. 
large  amplitude  0.32N-m  were  investigated.  When  ampli¬ 
tude  is  smaller  than  0.25N-m,  the  torque  transmission  is 
in  linear  range  according  to  static  characteristics  shown  in 
Fig .4.  Fig. 5  shows  the  measured  frequency  characteristics 
of  torque  transmission,  where  the  slip  ratio  was  normalized 
by  the  pick  to  pick  value  of  load  torque.  For  the  case  of  linear 
range,  when  frequency  is  lower  than  0.2Hz,  torque  transmis¬ 
sion  characteristics  are  the  same  as  that  of  static  characteris¬ 
tics.  Slip  ratio  increases  with  the  increase  of  the  frequency 
and  normalized  slip  ratio  increases  rapidly  in  the  high  fre¬ 
quency  range.  However,  the  slip  ratio  remains  in  the  linear 
area  if  the  frequency  of  load  torque  is  lower  than  10Hz.  Ac¬ 
cording  to  static  characteristics,  when  the  normalized  slip 
ratio  is  lager  than  0.04%/(N-m),  the  torque  transmission  is 
beyond  linear  range  to  nonlinear  area.  The  results  mean  that 
slip  ratio  becomes  large  more  easily  under  changing  trans¬ 
mission  torque. 

For  the  case  of  large  load  torque,  torque  transmission  is 
in  nonlinear  area  according  to  static  characteristics.  This 
can  be  observed  from  Fig. 5  where  the  normalize  slip  ratio 
is  larger  than  0.04%/(N-m)  even  in  low  frequency  range. 
The  slip  does  not  increase  when  the  frequency  is  lower  than 
10Hz,  While  the  frequency  becomes  higher  than  20Hz,  nor¬ 
malized  slip  ratio  increases  rapidly.  From  Fig. 5,  we  can  also 
observe  that  the  frequency  characteristics  of  torque  trans¬ 
mission  behave  the  same  for  large  load  torque  and  small  load 
torque.  This  means  that  torque  transmission  was  in  nonlin¬ 
earity  in  both  cases  of  small  and  large  load  torque.  From  the 
experimental  results,  we  have  the  conclusions  that  torque 
characteristics  become  nonlinearity  more  easily  when  the 


transmission  torque  changes  rapidly  so  that  over-large  slip 
occurs. 

3  Servo-Control  of  Traction  Drive  Based  on 
Measured  Torque  Transmission  Character¬ 
istics 

Based  on  the  measured  torque  transmission  character¬ 
istics,  we  know  that  slip  ratio  becomes  large  when  trac¬ 
tion  drive  transmits  large  torque.  The  large  slip  results  in 
resonant  oscillations  to  the  positioning  servomechanisms. 
Moreover,  over-large  slip  will  cause  damages  to  roller  sur¬ 
faces.  This  is  fatal  to  a  high-precision  servomechanism.  In 
high-accuracy  positioning  control,  high  feedback  gain  is  al¬ 
ways  used.  When  there  is  positioning  error  that  is  usually 
caused  by  load  torque  change  or  some  other  disturbances, 
the  servomechanism  needs  traction  drive  which  can  transmit 
large  torque  to  suppress  the  tracking  error.  Consequently, 
large  slip  between  rollers  takes  place.  In  such  time,  over¬ 
large  slip  occurs  and  results  in  resonant  oscillations. 

3.1  Feedback  of  slip  velocity 

To  avoid  these  phenomena,  it  is  required  to  suppress  the 
slip  ratio.  Therefore,  we  introduced  a  slip- velocity  feedback 
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control  to  the  servomechanism.  This  method  uses  a  control 
strategy  that  when  the  slip  ratio  becomes  large  the  controller 
carries  out  feedback  of  slip  velocity.  The  reason  that  slip- 
velocity  feedback  is  not  carried  out  for  small  slip  ratio  is 
that  when  slip  velocity  is  small  the  detection  error  becomes 
comparatively  large  and  induces  error  to  the  servo  control. 
The  control  strategy  is  expressed  as  follows. 

f  u  =  uq  |5|  <  So 

\  u  =  uo  -  ksvs  |S|  >  So 

where,  vs  indicates  slip  velocity,  ks  is  feedback  gain,  u  is 
controller  output,  u0  is  the  controller  output  without  slip- 
velocity  feedback,  and  So  is  a  constant  less  than  the  slip  ratio 
when  torque  reaches  saturation. 

3.2  Control  system 

Implementation  for  the  experiments  is  shown  in  Fig. 6. 
The  driving  roller  is  driven  with  a  DC  servomotor.  Two  en¬ 
coders  are  mounted  on  the  driving  shaft  and  driven  shaft. 
Slip-velocity  detector  uses  non-sinusoidal  2-phase  type  PLL 
to  interpolate  the  phase  difference  of  the  two  encoders.  The 
positioning  servo  controller  uses  a  tracking  error  detector 
that  can  interpolate  the  rotary  angle  of  output  axis  to  high 
resolutionf  1].  In  the  experiments  we  achieved  a  resolution 
of  0.625sec. 

Block  diagram  of  the  servomechanism  is  shown  in  Fig.7, 
where 

0r :  reference  input 
Ji :  moment  of  inertia  of  driving  shaft 
J2\  moment  of  inertia  of  driven  shaft 
c\ :  viscous  damping  coefficient  of  driving  shaft 
c2:  viscous  damping  coefficient  of  driven  shaft 
Kfl:  viscous  friction  coefficient  of  traction  drives 


(a)  without  slip-velocity  feedback 


0  0.1  0.2 

Time  [s] 

(b)  with  slip -velocity  feedback 


Fig.  8.  Experimental  results  of  slip-velocity  feedback 


control. 

Ga(s ): 

transfer  function  of  servomotor  and  servo 

-amplifier 

G($y. 

transfer  function  of  position  controller 

vs- 

slip  speed  R\0\  -  R202 . 

This  system  is  basically  for  a  positioning  servo  control. 


191 


Slip  ratio  (1%/div)  Slip  ratio  (1%/div) 


We  added  slip-velocity  feedback  to  the  basic  system.  Switch 
function  is  according  to  Equation  (1)  and  (4),  where  the  av¬ 
erage  velocity  of  the  two  roller  (R\6i  +  was  ap¬ 

proximated  by  Rrfr.  Positioning  control  and  slip-velocity 
feedback  control  were  achieved  by  using  a  microcomputer. 
Because  the  switch  function  of  slip-velocity  feedback  con¬ 
trol  requires  high-speed  operations,  we  set  the  sampling  pe¬ 
riod  of  control  to  10"V. 

3.3  Step  response  of  servomechanism 

To  investigate  the  stability  of  the  servomechanism,  ex¬ 
periments  of  step  response  were  carried  out.  Fig. 8  shows 
the  experimental  results  of  step  response  of  the  servomecha¬ 
nism.  The  amplitude  of  angular  step  input  is  lOOOsec.  From 
the  results  shown  in  Fig.8,  we  can  see  that,  without  slip- 
velocity  feedback,  the  slip  ratio  became  larger  than  1.5% 
instantaneously  and  large  overshoot  appeared  to  make  the 
settle  time  of  5%  longer  than  0.08s.  While,  by  the  control 
method  with  slip-velocity  feedback,  the  slip  ratio  was  con¬ 
trolled  to  less  than  0.3%  and  the  settle  time  became  0.04s. 
The  results  indicate  that  the  dynamic  characteristics  are  im¬ 
proved  significantly  by  slip- velocity  feedback. 

3.4  Load  characteristics  of  servomechanism 

Load  characteristics  are  very  important  for  servomecha¬ 
nisms.  Because  the  nonlinearity  of  torque  transmission  of 
traction  drive,  load  characteristics  of  servomechanisms  that 
use  traction  drive  are  very  poor  when  load  torque  is  large. 
Fig. 9  is  the  experimental  results  of  load  characteristics.  In 
the  experiments  a  load  torque  of  step  change  was  added  to 
the  output  axis  through  load  motor.  From  Fig.9(a),  we  can 
find  that  large  tracking  error  appears  instantaneously.  This 
is  because  that  slip  becomes  large  and  torque  transmission 
reached  the  nonlinear  area.  The  slip  ratio  is  large  than  0.02% 
during  instantaneous  period  of  load  torque  and  existence  of 
other  disturbances.  Over-large  slip  can  be  suppressed  by  in¬ 
troducing  slip-velocity  feedback.  Fig. 9(b)  shows  the  exper¬ 
imental  results  with  slip-velocity  feedback  control.  Because 
the  over-large  slip  did  not  take  place,  no  large  tracking  er¬ 
ror  appears  instantaneously  as  that  in  Fig.9(a).  Therefore, 
we  can  confirm  that  slip-velocity  feedback  control  is  useful 
to  improve  load  characteristics  of  servomechanisms  that  use 
traction  drive. 

4  Conclusions 

In  this  study,  the  authors  investigated  the  torque  trans¬ 
mission  characteristics  of  the  traction  drive.  The  cross 
correlation  method  was  used  for  detecting  frequency  re- 

1  sec  is  used  to  stand  for  second  of  rotary  angle  and  s  for  second  of  time  in  this 
paper. 
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(b)  with  slip-velocity  feedback 

Fig.  9.  Improvement  of  load  characteristics  by  slip- 
velocity  feedback  control. 

sponse  of  slip  ratio.  Considering  the  measured  torque  trans¬ 
mission  characteristics,  we  introduced  slip-velocity  feed¬ 
back  method  to  improve  the  dynamic  characteristics  of  ser¬ 
vomechanisms.  The  experiments  confirmed  the  effective¬ 
ness  of  proposed  method. 
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Abstract 

This  paper  both  presents  the  current  approach  of  Intelli¬ 
gent  Robotics  Laboratory  (IRL)  to  intelligent  robotics 
systems  at  Vanderbilt  University  and  outlines  the  upcom¬ 
ing  research  issues  which  tend  to  realize  intelligently  be¬ 
having  humanoid  robots  in  a  distributed  computing  envi¬ 
ronment  by  making  use  of  the  most  recent  technologies. 

1.  Introduction 

The  rapid  growth  of  the  power  of  computers,  and  con¬ 
current  decline  in  their  cost,  has  increased  dramatically 
the  potential  of  robots  to  interact  naturally  with  the  world. 
Until  recently  robots  were  useful  only  in  highly  structured 
environments  such  as  an  assembly  line,  where  they  could 
perform  repetitive  tasks.  They  have  been  quite  unlike 
animals,  which  must  react  and  adapt  to  survive  while  pur¬ 
suing  their  goals  in  a  dynamic  environment.  An  animal 
can  react  and  adapt  because  it  can  sense  the  world,  it  has 
some  concept  of  how  its  actions  affect  the  world,  and 
when  its  actions  do  not  have  the  desired  effect,  it  can 
change  them  (or  it  dies).  Fast,  cheap  computers  probably 
can  now  (or  will  soon  be  able  to)  processes  sensory  data 
in  sufficient  quantity  quickly  enough  to  permit  a  robot  to 
adapt  to  a  natural,  unstructured  environment. 

Such  adaptability  requires  more  than  raw  computational 
power.  It  requires  the  robot's  sensory  system  to  be  prop¬ 
erly  organized,  it  requires  sensing  to  be  appropriately 
coupled  to  the  robot's  actions,  and  it  requires  that  the  ac¬ 
tions  of  the  robot  in  response  to  specific  sensory  input  be 
changeable  by  the  robot  itself.  Proper  organization  is 
necessary  to  permit  simultaneous  sensing  in  a  number  of 
different  modalities  and  to  enable  shifts  of  attention  from 
one  task  to  another  in  response  to  external  events  and  in¬ 
ternal  status.  The  coupling  of  sensing  to  action  embeds 
the  robot  in  its  environment;  it  links  the  sensual  to  the 
corporeal  so  that  the  geometry  and  dynamics  of  the  exter¬ 
nal  world  can  be  understood  in  terms  of  those  of  the  robot. 
A  mapping  from  sensory  input  to  action  that  is  malleable 
by  the  robot  is  necessary  for  learning,  and  learning  is  the 
essence  of  adaptability. 


This  paper  concerns  the  logical  organization  of  a  sen¬ 
sory  system  for  a  humanoid  robot.  The  robot  is  ISAC, 
developed  at  the  Center  for  Intelligent  Systems  at  Vander¬ 
bilt  University.  ISAC  has  two  7-DOF  arms,  an  active, 
stereo,  color,  vision  system,  6-axis  force-torque  sensors  at 
the  wrists,  simple  haptic  sensors  on  the  fingers  of  its  an¬ 
thropomorphic  hand,  haptic  sensors  and  proximity  sensors 
on  the  palm  of  the  hand,  and  ISAC  has  an  array  of  infra¬ 
red  motion  detectors  on  its  torso.  The  robot  has  sound 
input  which  is  currently  be  modified  to  permit  sound 
source  localization.  Data  from  these  sensors  will  be  proc¬ 
essed  as  if  it  is  projected  on  a  sphere  that  surrounds  the 
robot.  Albus  named  the  structure  an  EgoSphere  [1].  We 
refer  to  it  as  a  Sensory  EgoSphere  or  SES. 

In  theory,  the  sensors  and  low-level  sensory  processing 
algorithms  commonly  in  current  use  can  provide  an  exten¬ 
sive  set  of  spatial  features  at  any  point  on  the  SES.  For 
example  each  point  could  have  light  intensity,  hue,  color 
saturation,  edge  intensity,  edge  orientation,  depth  to  first 
surface,  motion  direction,  speed,  etc.  But  in  reality,  such 
extensive  sets  of  information  are  too  large  to  permit  real¬ 
time  processing.  The  complexity  of  sensory  information 
processing  can  be  reduced  through  the  use  of  attentional 
networks  [2].  These  are  sensory  processing  algorithms 
that  detect  data  of  potential  interest  as  a  function  of  the 
task  at  hand.  They  are  de  facto  information  filters.  By 
directing  the  computational  resources  to  important  but 
limited  and  localized  sets  of  sensory  data  at  any  one  time, 
an  AN  causes  the  rest  of  the  data  to  be  ignored.  Shifts  in 
attention  permit  exploration  of  the  environment.  By  con¬ 
tinuously  monitoring  several  different  sensory  modalities 
with  ANs  that  are  capable  of  interrupting  each  other,  at¬ 
tention  may  be  redirected  from  one  location  to  another. 
For  example,  a  sudden  loud  sound  detected  at  one  location 
by  an  audio  AN  could  interrupt  image  analysis  at  another 
location  and  cause  the  visual  AN  to  shift  attention  to  the 
direction  of  the  sound. 

2.  The  Learning  System  Architecture 

This  section  will  introduce  the  techniques  that  are  cur¬ 
rently  in  use  in  our  robotics  system. 
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2.1.  Intelligent  Machine  Architecture  (IMA) 

IMA,  being  the  backbone  of  all  the  software  written  in 
IRL,  is  a  software  system  design  paradigm  that  permits  the 
concurrent  execution  of  software  agents  on  separate  ma¬ 
chines  while  facilitating  extensive  inter-agent  communi¬ 
cation.  In  other  words,  IMA  is  a  distributed  software  ar¬ 
chitecture  specifically  designed  for  generating  intelligent 
machine  applications  [3], 

IMA  runs  under  MS  Windows  NT  4.0  and  makes  use  of 
DCOM  to  manage  the  inter-agent  communications. 
DCOM  (Distributed  Component  Object  Model)  is  a  serv¬ 
ice  of  MS  Windows  NT  which  allows  remote  objects  to 
be  treated  as  local,  namely,  it  is  the  framework  of  distrib¬ 
uted  computing. 

Agents  are  loosely  coupled,  which  facilitates  parallel 
processing.  For  larger  systems,  IMA  takes  advantage  of 
both  distributed  and  symmetric  multiprocessing  computer 
systems  more  effectively  than  do  monolithic  architectures. 
Each  agent  acts  locally  based  on  its  internal  state  and  pro¬ 
vides  a  set  of  services  to  other  agents  through  various  re¬ 
lationships.  The  loosely  couple,  asynchronous  operation 
of  decision-making  agents  simplifies  the  system  model  at 
high  level.  Whereas  over-simplification  of  the  higher 
levels  of  a  system  can  lead  to  non-robust  operation,  a  col¬ 
lection  of  asynchronously  executing  agents  is  more  stable. 
It  provides  an  abstraction  that  eliminates  many  concerns 
about  synchronization,  because  the  architecture  provides 
each  agent  with  a  knowledge  of  time  that  allows  each  lo¬ 
cal  decision  process  to  evaluate  information  based  on  its 
age  relative  to  that  agent.  The  agents  in  IMA  work  like  a 
group  of  control  loops,  continually  sampling  inputs,  up¬ 
dating  states  and  computing  new  outputs  [3], 

IMA  was  designed 

•  to  facilitate  distributed  implementation, 

•  to  be  interface  driven,  and 

•  to  allow  software  module  reusability  [4]. 

IMA  is  different  from  traditional  software  systems  in 
that: 

•  IMA  uses  a  system  level  model  that  is  agent  based 

•  IMA  uses  an  agent  level  model  that  is  component- 
object  based 

As  a  result  of  this,  each  resource,  task  or  domain  ele¬ 
ment  can  be  modeled  as  a  software  agent.  This  two  level 
architecture  allows  the  generation  of  reusable,  extensible 
programs  from  the  software-engineering  point  of  view  and 
also  introduces  parallelism,  scalability,  reactivity  and  ro¬ 
bustness  from  the  systems  engineering  point  of  view  [5]. 

With  this  agent-based  architecture,  along  with  the  prop¬ 
erties  of  reusability  and  scalability,  many  basic  routines 
have  been  created  for  the  IRL  database.  New  applications 
may  only  need  to  introduce  some  additional  agents  to 


handle  the  new  situation  while  benefiting  from  many  of 
the  previously  written  agents. 

2.2.  Spreading  Activation  (SA) 

Classical  AI  approaches,  being  deliberate  and  central¬ 
ized,  have  some  drawbacks.  As  a  result  of  being  deliber¬ 
ate,  any  failure  throughout  the  application  of  a  plan  results 
in  re-planning  from  scratch.  Also,  centralized  systems  are 
heavily  dependent  on  the  performance  of  the  central  unit. 
If  it  fails,  then  the  whole  system  fails.  On  the  opposite 
side  of  the  planning  spectrum  are  reactive  planners.  A 
fully  reactive  system  does  not  have  to  go  back  and  start 
from  scratch  when  the  current  action  fails.  It  simply 
chooses  the  second  best  action  towards  achieving  its  goal. 
This  type  of  behavior  has  some  advantages  over  deliberate 
planners,  but  suffers  from  getting  stuck  at  local  minimas 
or  vicious  circles. 

The  first  attempt  to  combine  deliberative  and  reactive 
behaviors  in  a  single  adjustable  distributed  system  came 
from  MIT.  P.  Maes  introduced  the  idea  of  Spreading 
Activation  Networks  by  which  she  intended  to  have  a  sys¬ 
tem,  the  behavior  of  which  could  be  adjusted  to  move 
towards  deliberate  or  reactive  behavior  by  changing  a 
global  parameter  [6].  This  algorithm  has  an  agent-based 
structure,  where  the  agents  are  composed  of  some  com¬ 
petence  modules.  These  modules  compete  to  accomplish 
the  agent  goals,  and  the  agents  try  to  fulfill  the  global 
goal.  By  the  use  of  some  global  parameters  assigned  to 
the  system,  a  smooth  shift  between  the  following  proper¬ 
ties  can  be  established: 

•  adaptive  <r  biased  towards  ongoing  goals 

•  goal  oriented  <r  situation  relevant 

•  fast  <-  thoughtful 

•  conservative  <r  ->  careless  about  the  goal  conflicts 

The  advantages  of  this  algorithm  are: 

•  the  system  has  a  distributed  search  method 

•  the  system  produces  simultaneous  solutions  so 
there  are  always  fail-back  plans  available 

•  its  performance  can  be  tuned  via  some  global  pa¬ 
rameters 

•  it  is  fault  tolerant. 

Besides  the  advantages,  there  are  some  problems  the 
solution  of  which  are  not  addressed  in  this  approach.  The 
first  being  the  tuning  mechanism  for  the  global  parameters 
(the  settings  of  which  will  effect  the  overall  performance 
of  the  system)  and  the  second  one  is  keeping  a  record  of 
past  actions. 

The  second  attempt  towards  achieving  a  system  that  can 
smoothly  be  shifted  between  deliberate  and  reactive  be¬ 
havior,  which  is  highly  inspired  by  the  work  of  Maes,  is 
done  by  S.  Bagchi.  The  shortcomings  of  the  initial 
Spreading  Activation  Network  is  investigated  in  detail  and 
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a  new  version  of  this  algorithm  (a  probabilistic  approach) 
is  proposed  [7]. 

Spreading  Activation  Networks  rely  on  the  basic  princi¬ 
ple  of  passing  awards  from  the  goals  towards  the  agents, 
which  is  referred  as  “ utility  passes  backward ’  and  the 
agents  apply  as  candidates  for  accomplishing  a  task  de¬ 
pending  on  their  abilities  and  past  successes,  which  is 
referred  as  “ probability  passes  forward \  Figure  1  shows 
the  links  that  connect  the  components  of  a  Spreading  Ac¬ 
tivation  Network  agent.  An  agent  is  composed  of  a  set  of 
preconditions  and  actions.  When  the  preconditions  of  an 
agent  are  all  true,  it  can  initiate  an  action  and  change  the 
current  state  of  the  world.  The  resulting  states  (the  post¬ 
conditions  of  the  agent)  will  affect  the  preconditions  of 
some  other  agents  and  the  system  will  evolve  itself  in  this 
manner. 


ajx  =  aj  was  executed 


Figure  1 .  Representation  of  an  precondition,  action,  post¬ 
condition  relation  (  adapted from  [8] ). 

The  major  difference  of  this  study  with  respect  to  the 
initial  Spreading  Activation  Network  structure  is  that  the 
events  are  handled  in  a  probabilistic  manner  and  some  sort 
of  action  reports  are  stored  to  keep  track  of  the  dynamic 
environment  effects  and  inner  dynamics.  This  not  only 
allows  the  system  to  accumulate  experience  over  time  and 
prevents  the  system  from  repeating  the  same  mistakes 
over  and  over  again  but  also  lets  the  system  understand  its 
own  dynamics  (in  other  words  reliability  and/or  capability 
of  individual  agents  composing  the  SA  network).  These 
properties  eliminate  the  shortcomings  mentioned  for  the 
initial  design  of  the  Spreading  Activation  Networks  and 
makes  it  a  very  good  real  time  algorithm. 

An  important  point  to  note  about  the  SA  mechanism  in¬ 
troduced  so  far  is  the  static  structure  of  the  network.  It  is 
a  pre-wired  network.  It  actually,  is  not  static  in  the  sense 
that,  in  time,  some  links  may  have  0  weights  and  practi¬ 
cally  drop  out  from  the  network,  however,  new  links  are 


not  formed  on  the  fly  depending  on  the  current  state  of  the 
world.  If  a  new  problem  is  to  be  handled,  a  new  network 
design  has  to  be  implemented  by  the  programmer.  The 
obvious  consequence  of  this  is  that,  even  though  the  sys¬ 
tem  can  learn  to  pick  the  best  solution  that  is  present  in  the 
network,  it  can  never  evolve  itself  towards  novel  solu¬ 
tions. 

A  recent  research  on  building  Spreading  Activation 
Agents  by  using  the  IMA  mainframe  has  been  completed 
in  the  IRL  [4].  Currently  this  agent  structure  is  being  used 
in  our  applications. 

2.3.  Sensory  EgoSphere  (SES) 

Albus  defines  EgoSphere  as 
a  two  dimensional  spherical  surface  that  is  a  map  of 
the  world  as  seen  by  an  observer  at  the  center  of  the 
sphere.  Visible  point  on  regions  or  objects  in  the 
world  are  projected  on  the  egosphere  wherever  the 
lines  of  sight  from  a  sensor  at  the  center  of  the 
egosphere  to  points  in  the  world  intersect  the  surface 
of  the  sphere  [1]. 

Albus  suggests  that  a  2D  egosphere  is  sufficiently  in¬ 
formative  for  an  intelligent  agent  to  model  the  sensory 
world.  To  emphasize  its  sensory  function,  we  call  the 
structure  the  Sensory  EgoSphere  (SES)  in  this  text 

Any  sensory  data  that  can  be  localized,  (such  as  the 
video  input  stream  from  an  active  camera  head,  a  direc¬ 
tional  sound  source,  the  distance  to  the  closest  surface, 
etc.)  can  be  stored  on  a  SES.  It  is,  therefore,  a  multireso¬ 
lution,  2DxlD  map  of  the  robot’s  environment  indexed  by 
elevation,  azimuth,  and  time  associated  with  data  specific 
descriptors  (i.e.,  FFT  parameters  or  verbal  descriptors  of 
objects  found  in  an  image  etc.).  From  the  view  point  of 
having  different  data  types  with  different  resolutions, 
egosphere  can  be  viewed  as  layers  of  data  wrapped  around 
the  egosphere  as  illustrated  in  figure  2.  A  sensory  data  set 
of  a  specific  type  at  a  specific  SES  location  can  be  stored 
as  an  object  with  a  timer  that  indicates  its  age.  Objects  at 
a  specific  SES  location  can  be  deleted  from  the  sphere 
after  a  period  of  time  depending  on  the  type  of  data  or  the 
arrival  of  new  up-to-date  sensory  information  can  over¬ 
write  the  older  information  at  the  same  location.  Some 
quick  methods  of  checking  the  validity  of  the  currently 
posted  data  on  the  egosphere  and  the  current  state  of  the 
world  is  essential.  To  fulfill  this,  data  specific  descriptors 
are  needed.  The  descriptors  are  not  only  needed  for 
checking  the  correlation  with  the  current  state  of  the  world 
but  they  are  also  essential  in  establishing  the  links  on 
DBAM  (which  is  introduced  in  the  next  section). 

Structurally,  SES  is  a  multiresolutional  (multi-layered) 
database.  Functionally,  SES  is  a  multi-user  accessible 
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database.  It  can  be  considered  to  be  an  associative  mem¬ 
ory  where  the  association  is  through  proximity.  Visible 
objects  that  have  been  recognized  can  be  labeled  on  the 
SES,  and  the  labels  used  as  search  keys.  Any  of  the 
agents  in  the  robot  can  query  the  database.  For  example, 
if  the  robot  is  trying  to  serve  a  beverage  to  a  person,  a 
possible  search  in  its  workspace  could  be  for  a  coke  can. 
In  the  classical  sense,  this  might  be  performed  through 
visually  searching  the  environment  for  the  distinctive  col¬ 
ors  on  a  coke  can  or  for  cylindrical  objects.  In  our  case, 
by  passing  a  query  for  “coke  can”  to  the  SES  database  the 
robot  can  get  the  most  recent  location  where  a  coke  can 
was  spotted  on  the  egosphere  and  narrow  its  visual  search 
space  significantly.  Later  in  the  paper  we  describe  how 
SES  access  might  be  aided,  through  learning,  to  be  more 
efficient. 


Data  level  1 


Data  level  2 


Data  level  n 


Figure  2.  The  possible  data  levels  on  the  egosphere.  As 
an  example,  Data  level  1  may  contain  sound  data,  data 
level  2  may  contain  sonar  data  and  the  last  level  may 
contain  images  captured  from  the  robots  environment 

The  SES  provides  a  natural  sense  of  spatial  coherence 
and  continuity  to  the  robot  so  that  imprecise  (but  entirely 
natural)  interactions  with  the  user  such  as: 

IS  AC :  I  could  not  find  the  spoon. 

USER:  It  is  right  in  front  of  you  on  the  table 

can  be  made  tractable. 

This  particular  interaction  uses  symbolic  information  — 
object  labels  and  location  cues  —  which  the  robot  can  use 
to  directly  query  the  database  or  to  restrict  its  search  to 
specific  regions  of  the  EgoSphere 

2.4.  Database  Associative  Memory  (DBAM): 

One  important  component  of  our  system  is  an  associa¬ 
tive  memory  that  is  built  on  a  standard  database  architec¬ 
ture.  This  will  permit,  among  other  things,  a  symbol  to 
prime  or  to  restrict  a  search  of  the  SES.  The  DBAM  logi¬ 
cally  subsumes  the  SES  to  provide  spatial  associations 
between  the  objects  that  surround  the  robot.  The  DBAM, 
further,  will  associate  specific  actions  with  certain  objects. 


This  will  enable  the  robot  to  decompose  a  goal  into  the 
fundamental  actions  that  it  can  perform.  Our  current  re¬ 
search  focuses  on  the  spatial  association. 

Whereas  the  SES  provides  spatial  association  —  im¬ 
plicit  association  through  proximity  and  direction  — 
among  the  data  in  it,  other  components  of  the  DBAM  pro¬ 
vide  symbolic  associations.  Our  initial  design  includes  an 
association  lookup  table-  (ALT)  to  keep  track  of  associa¬ 
tions  such  as  the  name  of  an  object  with  its  SES  locations. 
Later,  it  will  probably  be  necessary  to  trace  associations 
through  more  complex  structures. 

Figure  3  illustrates  these  concepts.  The  following  two 
scenarios  may  clarify  how  SES  and  DBAM  will  work  to 
gather. 


Voice 

Command 

processor 

"Coke  Can” 
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Figure  3.  An  instance  form  DBAM  illustrating  symbolic 
and  spatial  association  among  SES  data  and  ALT. 

One  of  the  very  basic  ways  to  start  interaction  with  the 
robot,  is  to  go  near  it  and  talk  to  it  (i.e.,  “Hello  ISAC,  how 
are  you  today?”).This  can  start  a  chain  of  events:  sound 
localization  routines  will  determine  where  the  sound  came 
form,  the  sonar  sensors  will  try  to  locate  where  the  motion 
is  and  the  camera  head  of  the  robot  will  try  to  find  the  face 
of  the  user  and  establish  an  eye-to-eye  contact  with  the 
user.  Also,  the  user  identification  routines  will  try  to 
identify  the  person  and  at  the  end,  a  similar  association  to 
the  one  shown  in  figure  3  can  be  established. 

A  second  situation  that  may  arise  throughout  an  inter¬ 
action  is  to  find  something  in  the  environment  during  the 
interaction  with  a  user.  As  previously  stated,  suppose 
that,  the  robot  is  to  serve  coke  to  the  user.  On  searching 
for  the  coke  can,  the  DBAM  will  provide  the  locations 
where  the  coke  can  was  found  previously  on  the  SES.  As 
previously  mentioned,  each  data  on  SES  will  be  time- 
stamped  and  therefore,  the  DBAM  will  provide  locations 
of  the  coke  can  starting  from  the  most  recently  located 
one. 
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Each  record  in  the  DBAM  has  a  numerical  activation 
level.  This  number  can  be  modified  by  other  agents  such 
as  sensing,  learning,  or  task  control  agents  Each  record  is 
linked  to  others  that  are  in  some  way  related.  The  link  is 
weighted  by  the  activation  level  of  the  record  linked  to. 
The  links  between  some  DBAM  records  can  be  modified 
by  spreading  activation  (SA)  during  task  execution  or 
through  reinforcement  learning  (RL)  conditioned  on  the 
success  or  failure  of  a  task. 

3.  Learning  in  the  SES  and  DBAM 

An  intelligent  system  should  be  able  to  learn  from  its 
own  experience.  It  should  remember  what  it  saw  before 
and  be  able  to  refer  to  previous  events.  It  should  not  re¬ 
peat  mistakes.  It  should  be  able  to  learn  new  skills 
through  observation  and  repetition.  These  ideals  are  far 
from  being  met.  However,  machine  learning  is  currently 
possible;  one  can  design  a  system  that  can  learn  some 
things. 

Non  declarative  memories  can  be  formed  through  trial 
and  error  with  SA  and  RL  to  train  sensory-motor  coordi¬ 
nation  maps  or  task  sequence  specifications.  Such  memo¬ 
ries  enable  the  robot  to  work  smoothly  with  its  environ¬ 
ment  while  take  its  own  physical  structure  to  best  advan¬ 
tage. 

Declarative  memories  are  those  in  which  various  ob¬ 
jects  are  related  to  one  another,  or  in  which  episodes  are 
correlated  and  associated  with  objects.  This  type  of 
memory  also  permits  the  association  of  a  goal  with  the 
purposive  qualities  of  known  objects. 

The  formation  of  a  declarative  memory  requires  two 
phases,  data  acquisition  and  memory  consolidation.  Vari¬ 
ous  sensory  data  is  acquired  as  the  robot  interacts  with  its 
environment.  Some  of  this  information  is  obtained  in  the 
course  of  task  execution  and  is  directly  coupled  to  actions 
performed.  Extraneous  information,  may  also  be  re¬ 
corded.  This  is  information  that  may  not  be  directly  re¬ 
lated  to  the  task  at  hand,  but  might  be  useful  in  other  con¬ 
texts. 

Memory  consolidation  (MC)  is  the  process  whereby  the 
robot  forms  associations  between  recently  acquired  data 
and  previously  stored  memories.  The  process  will  run 
when  the  robot  is  otherwise  unoccupied.  Recent  data  will 
be  compared  to  existing  DBAM  records.  For  example, 
objects  data  from  imagery  could  be  compared  to  existing 
object  descriptions  and  if  recognized,  the  new  objects 
would  be  relabeled.  If  not,  they  could  be  stored  as  new 
records.  Data  acquired  during  a  task  execution  could  be 
replayed  in  sequence  to  form  associations  based  on  tem¬ 
poral  proximity.  The  set  of  such  links  between  data  rec¬ 
ords  would  constitute  a  pattern  which  could  be  compared 
to  Other  learned  data  sequences.  Measures  of  their  simi¬ 


larities  could  be  used  to  link  different  tasks.  The  similari¬ 
ties  between  the  MC  process  just  described  and  dreaming 
in  humans  is  worth  further  study. 

Figure  4  shows  the  bi-directional  data  flow  between  the 
memory  and  the  learning  units.  The  formation  of  associa¬ 
tions  within  the  DBAM  and  SES  is  the  most  critical  part 
of  our  current  research.  SA  provides  learning  over  the 
established  database.  Memory  consolidation 
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Figure  4.  There  is  bi-directional  data  flow  between  SES 
and  SA  network. 

As  a  result,  the  DBAM/SES,  the  SA/RL  network,  and 
the  MC  procedure  provide  a  mapping  from  sensory  inputs 
to  actions,  from  objects  to  objects,  and  from  task  to  task. 

4.  Goal:  The  Shifting  of  Intelligence  from  the 
Programmer  to  the  Robot 

A  limitation  of  current  SA  networks  is  that  they  are  pre¬ 
wired.  New  states  are  not  added;  only  the  probabilities  of 
changing  current  states  are  modified.  One  can  build  a  SA 
network  by  linking  software  modules.  To  enable  the  robot 
to  handle  a  new  task,  a  programmer  can  build  a  network 
with  available  modules.  It  would  be  much  more  useful  if, 
by  examining  associations  in  the  DBAM,  the  SA  network 
could  link  new  states  into  itself. 

An  easy  but  inefficient  solution  to  this  problem  is  to 
form  a  huge  network  wherein  all  available  modules  are 
linked  to  each  other.  Incompatible  module  dynamics 
would  eliminate  many  of  these  links  by  default.  The  sys¬ 
tem  would  have  to  be  run  to  evolve  itself  in  time.  But 
such  a  procedure  would  probably  require  human  feedback 
and  is  likely  to  require  large  time  intervals.  It  also  re¬ 
quires  a  good,  if  not  complete,  knowledge  of  all  the  pres¬ 
ent  modules.  Hence  the  obvious  solution  is  inefficient, 
lacks  scalability,  and  precludes  the  introduction  of  new 
computational  modules. 

To  create  a  dynamically  linked  SA  network  would  re¬ 
quire 

•  an  inter-modular  communication  schema 

•  a  command  to  action  mapping  mechanism 

The  inter-modular  communication  enables  the  system  to 
know  itself.  There  are  languages  present  that  are  designed 
to  manage  inter-agent  communication  such  as  KQML  [9]. 
Such  a  language  enables  software  modules  to  communi- 
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cate  their  objects,  and  pre-  and  post-  execution  conditions 
to  each  other.  It  also  should  possess  a  descriptor  grammar 
that  enables  one  module  to  understand  the  actions  of  an¬ 
other. 

Our  approach  to  system  development  at  the  IRL  does 
rely  on  the  systems  ability  to  learn  everything  from  expe¬ 
rience.  Nor  are  we  attempting  to  create  a  high-level  sym¬ 
bolic  AI.  Instead,  we  approach  the  development  toward 
both  ends  from  the  center.  We  are  designing  cognitive 
structures  that  will  enable  the  robot  to  make  immediate 
use  of  domain  knowledge  and  specific  action  plans,  yet 
enable  it  to  react  to  unforeseen  occurrences  and  to  evalu¬ 
ate  outcomes  to  modify  both  knowledge  and  plans. 

5.  Conclusion 

Current  research  on  robotic  control  systems  performed 
at  the  Intelligent  Robotics  Laboratory  of  the  Center  for 
Intelligent  Systems  at  Vanderbilt  University  was  de¬ 
scribed.  A  system  including  a  Database  Associative 
Memory,  a  Sensory  EgoEphere,  a  Spreading  Activation 
network  that  uses  Reinforcement  Learning,  and  a  Memory 
Consolidation  procedure  are  being  designed.  It  is  thought 
that  this  system  can  provide  a  robot  with  a  sensory-input 
to  action-output  mapping.  The  structure  of  system  is  de¬ 
scribed  and  future  research  proposed  to  enable  the  system 
to  learn  autonomously. 
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Abstract. 

This  paper  describes  the  design  of  an  accelerometer 
capable  of  measuring  the  linear  and  angular 
accelerations  in  a  cartesian  system .  To  arrive  to  the  final 
dimensions  and  geometry ,  which  will  satisfy  the 
requirement  under  fabrication  constraints,  an  iterative 
finite  elements  structural  analysis  was  performed.  Also , 
modal  analysis  of  the  proposed  structure  was  performed 
in  order  to  determine  the  best  locations  for  capacitance 
sensing  pads.  The  stiffness  matrix  of  the  sensor's 
structure  was  found  and  used  to  determine  its  theoretical 
capabilities. 

It  was  found  that  the  sensor  has  a  high 
accuracy  if  used  to  measure  the  three  components  of 
linear  accelerations  (average  detection  error  0. 02%),  or 
only  the  three  components  of  angular  ones  (average 
detection  error  0.44%).  However,  when  all  six 
acceleration's  components  have  to  be  measured,  the 
average  error  increases  to  about  3%. 

1.  Introduction 

Some  recent  accelerometers  are  designed  and 
fabricated  using  Micro-Electro-Mechanical-Systems 
(MEMS)  technology.  Practically  all  MEMS 
accelerometers  consist  on  a  mass  attached  to  the  substrate 
through  a  flexible  structure.  The  displacement 
measurement  of  the  mass  and  the  stiffness  of  the 
structure  are  used  to  determine  the  magnitude  of  the 
acceleration. 

Generally  the  structure  is  composed  of  folded  beam 
in  order  to  reduce  the  device  size  and  due  to  fabrication 
constraints.  A  capacitance  transducer  where  one  plate  is 
fixed  and  the  other  is  attached  to  the  mass  measures  the 
mass  displacement. 

The  goal  of  designing  six  DOF  accelerometers  has 
been  driven  by  the  fact  that  existing  accelerometers  are 


uniaxial  -  detect  acceleration  along  one  axis.  Moreover, 
no  existing  accelerometer  architecture  is  also  capable  of 
detecting  both  linear  and  angular  loads  at  the  same  time. 

Analog  Devices  fabricates  several  models  of  single¬ 
axis  accelerometers  [4],  One  of  its  products  features  twin 
sensors,  perpendicular  to  each  other,  fabricated  on  the 
same  die,  which  make  it  possible  to  detect  accelerations 
in  both  the  x  and  y  directions[5]. 

Neuw  Ghent  Technology  Inc.  and  Crossbow  Inc. 
associate  up  to  three  single-axis  sensors  to  have  a  multi¬ 
axis  detection  device  [7,11]. 

2.  Sensor  Geometry 

MEMS  fabrication  technology  is  limited  to  two 
dimensional  structures.  Thus,  in  order  to  detect 
accelerations  in  the  three  perpendicular  directions,  the 
structure  behavior  along  the  third  axis  (perpendicular  to 
the  substrate  plane)  should  be  analyzed. 


Fig.l.  Analog  Devices  ADXL50’s  accelerometer. 
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The  geometry  of  the  proposed  sensor  consists  of 
central  movable  mass  linked  to  the  substrate  by  eight 
tethers,  four  for  each  detection  axis,  as  shown  in  Figure 

2.  The  dimensions  of  the  sensors  were  constrained  by  the 
MEMS  Design  Rules,  which  are  dictated  by  the 
micromachining  fabrication  process.  These  design  rules 
are  defined  by  the  resolution  and  alignment  capabilities 
of  the  lithography  system.  Generally,  they  define  the 
minimum  features  sizes  and  spaces,  and  the  minimum 
overlap  and  spacing  between  relevant  levels.  The  rules 
affecting  the  sensor  geometry  are  detailed  in  the  Table  1. 

3.  Structure  Analysis 

The  ADXL181  accelerometer  is  rated  +500g,  but  in 
order  to  insure  its  reliability  it  was  tested  under 
mechanical  shocks  and  drops  with  peak  acceleration  of 
1500g  and  4000g  respectively. 


Table  1:  Design  rules  effecting  the  sensor’s  geometry 


Parameter 

Minimum  or  Mandatory 
Value  [pm] 

Tethers  width 

2.0 

Anchors  height 

2.0 

Structure  thickness 

2.0  or  3.5 

Polyl  enclose  Anchor  1 

4.0 

PolyO  enclose  Polyl 

4.6 

Similarly,  the  sensor  was  designed  to  withstand 
acceleration,  in  all  three  axes,  up  o  3000g.  At  this  level 
of  loading  the  stress  should  be  below  the  fracture  stress  of 
polysilicon  (3GPa  [3]),  and  the  deflection  in  Z  direction 
should  be  below  2  (the  layer  thickness).  After  few  design 
iterations  the  sensor’s  dimensions  were  finalized  as 
shown  in  Figure  2.  Note  the  thickness  of  the  sensor  is 
3.5pm  that  correspond  to  double  layer  of  polysilicon. 

A  finite  elements  program,  ANSYS,  was  used  to 
analyze  the  structure.  The  model  consisted  of  1267  4 
nodes  solid  pyramid  elements  with  6  DOF.  It  was  found 
that  at  3000g  loading  the  maximum  deflection  in  the 
sensor’s  plane  (X-Y)  was  0.005pm  and  0.015pm  in  the 
perpendicular  direction  to  its  plane  (Z).  These  results  are 
in  the  same  order  magnitude  of  the  commercial  sensor 
described  above.  Also,  maximum  principal  stress  was 
under  0.5%  the  fracture  strength  of  polysilicon. 

4.  Location  of  Capacitance  Pads 

The  best  locations  for  the  capacitance  pads  were 
determined  by  modal  analysis  of  the  structure.  The  first 


six  mode  shapes  and  the  corresponding  frequencies  are 
given  in  Table  2  and  are  shown  in  Figure  3. 


To  detect  the  six  components  of  an  acceleration 
vector,  minimum  of  six  measurements  are  required, 
which  in  imply  six  capacitance  pads.  However,  in  order 
to  detect  angular  acceleration  about  the  X  or  Y  axes  it  is 
necessary  to  use  at  least  two  sensing  pads  as  illustrated  in 
Figure  4. 

The  sensor  has  seven  sensing  pads  as  illustrated  in 
Figure  5. 


Table  2.  First  six  mode  shapes  of  the  sensor. 


Mode 

Frequency 

fkHz] 

Description 

1 

37.547 

Deflection  in  Z  direction 

2 

55.693 

Rotation  about  Y  axis 

3 

61.096 

Deflection  in  Y  direction 

4 

63.032 

Deflection  in  X  direction 

5 

74.681 

Rotation  about  Z  axis 

6 

102.132 

Rotation  about  X  axis 
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5.  Sensor’s  Stiffness  Matrix 


Fig.  3.  Structure  mode  shapes. 


Fig.  4.  Use  of  two  pads  for  angular  acceleration 
measurements. 


First  the  compliance  the  matrix,  C,  of  the  sensor  was 
determined  by  calculating  a  combination  of  deflections  at 
the  detection  pad  locations  while  the  sensor  is  exposed  to 
a  unit  acceleration  in  a  single  direction.  This  will 
produce  one  column  of  the  matrix  as  shown  in  Eq.  1. 


X' 

~cn 

0 

0 

0 

0 

o' 

- 1 

£ 

X 

_ 1 

C21 

0 

0 

0 

0 

0 

0 

8. 

C31 

0 

0 

0 

0 

0 

0 

K 

Q. 

0 

0 

0 

0 

0 

0 

K 

c„ 

0 

0 

0 

0 

0 

0 

K 

Qi 

0 

0 

0 

0 

0 

0 

The  entries  5  and  X9  which  corresponds  to  linear  and 
angular  acceleration  at  the  prospective  directions,  are 
given  below: 


(2) 

S,=d, 

(3) 

=  ~(^2  +  ^4  +^s) 

(4) 

K =|(<W.) 

(5) 

(d,-dt) 

(6) 

K=\(dt-d,) 

(7) 

The  terms  d*  i=1..7  are  the  displacement  at  the  pads’ 
locations,  calculated  as  the  average  of  the  displacements 
of  all  pad’s  nodes. 

The  stiffness  matrix  was  obtained  by  taking  the 
inverse  of  the  compliance  matrix  C.  As  indication  of  the 
separation  between  the  acceleration  components,  the 
condition  number  of  the  stiffness  matrix  was  determined. 
The  condition  number  found  to  be  equal  to  145.60. 
Lower  values  of  the  condition  number  indicate  better 
sensor.  To  lower  the  condition  number  of  the  stiffness 
matrix  additional  pads  should  be  considered  [1]. 
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6.  Estimation  of  Detection  Errors 

To  characterize  the  performance  of  the  sensor,  once 
the  stiffness  matrix  was  determined,  the  sensor  was 
exposed  to  known  accelerations,  the  displacements  at  the 
pads  were  calculated  and  used  to  determine  the 
acceleration  using  the  stiffness  matrix. 

Three  different  cases  were  considered  for  this 
comparison : 

L  Detection  of  both  linear  and  angular  acceleration 
components  (6  outputs). 

2.  Detection  only  linear  acceleration  components  (3 
outputs). 

3.  Detection  of  only  angular  acceleration  components 
(3  outputs). 

For  the  first  case,  linear  accelerations  in  the  range  of 
0-500g  and  angular  acceleration  in  the  range  of  0-1E6 
rad/s2  were  applied.  Nine  different  loading  were  used, 
which  resulted  in  an  average  detection  error  of  1.375% 
with  a  maximum  value  of  17.3%  for  the  detection  of  the 
angular  acceleration  component  about  the  X  axis. 

For  the  second  case,  the  same  range  of  linear 
accelerations  were  applied  in  nine  different  loading 
which  resulted  in  an  average  detection  error  of  0.02%. 

Similarly,  nine  three-component  angular 
accelerations  were  applied  on  the  model,  which  resulted 
in  an  average  detection  error  of  0.44%. 

7.  Conclusions 

The  structure  of  micromachined  polysilicon  sensor 
capable  of  detecting  both  linear  and  angular 
accelerations,  was  proposed. 

Finite  element  solution  of  the  sensor’s  model 
indicate  that  it  is  possible  to  detect  a  full  acceleration 
vector  with  accuracy  of  few  percents.  This  performance 
can  be  improved  by  adding  more  sensing  pads. 

Eventually,  the  structure  has  to  be  fabricated  with  all 
supporting  circuit  and  then  tested  in  order  to  evaluate  its 
performance.  But  the  simulation  results  presented  here  is 
a  good  starting  point  for  the  design-fabrication  process. 


8,  References 

[1]  C.  M.  Subramanian,  “  Development  of  a  graphical  user 
interface  for  a  Stewart  platform.”  Master  of  Science  and 
Engineering  Degree  Thesis,  Dept.  Mech.  Eng.,  Florida  Atlantic 
University,  pp  50-52,  July  1993. 

[2]  G.  de  Jong,  “Smart  Capacitive  Sensors,  Physical, 
Geometrical  and  Electronic  Aspects”,  Delft  University  Press - 
2628  CN  Delft,  The  Netherlands,  1994. 

[3]  S.  Bart,  J.  Chang,  T.  Core,  L.  Foster,  A.  Olney,  S.  Sherman, 
W.  Tsang,  “Design  Rules  for  a  Reliable  Surface 
Micromachined  IC  Sensor”,  1995  International  Reliability 
Physics  Proceedings,  IEEE  Catalog  No.  95CH3471-0. 

[4]  “Accelerometer  Data  Sheets,  Application  Notes  and 
Worldwide  Sales  Directory”,  ©Analog  Devices  Inc.,  1995. 

[5]  H.  Samuels,  “Single  and  Dual-Axis  Micromachined 
Accelerometers”,  Analog  Dialogue,  Volume  30,  Number  4 
pp3-7,  1996. 

[6]  D.  A.  Koester,  R.  Mahadevan,  K.  W.  Markus,  “Multi-User 
MEMS  Processes,  Introduction  and  Design  Rules”,  MCNC, 
MEMS  Technology  Applications  Center,  Research  Triangle 
Park,  North  Carolina  27709,  October  1994,  Rev.3. 

[7]  “Low  g  Accelerometer  Solutions”,  Neuw  Ghent 
Technology,  LaGrangeville,  NY  12540-5705. 

[8]  R.  Allan,  “Silicon  MEMS  Technology  is  coming  of  age 
commercially”.  Electronic  Design,  pp75-88,  January  1997. 

[9]  T.  W.  Kenny,  H.  K.  Rockstad,  J.  K.  Reynolds,  W.  J.  Kaiser, 
“Technical  Support  Package  on  Dual-Element  Tunneling 
Accelerometer  with  Dual  Feedback”,  NASA  Tech  Brief,  Vol.21, 
No. 2,  Item# 98,  February  97. 

[10]  P.  M.  Echtemach,  U.  E.  Israelsson,  “Technical  Support 
Package  on  Micromachined  Cryogenic  Capacitive  Pressure 
Transducers”,  NASA  Tech  Brief,  Vol.21,  No. 2,  Item#68, 
February  97. 

[11]  “CXL04M3  Triaxial  accelerometer”.  Crossbow 
Technology  Inc,  Berkeley,  CA  94708. 

[12]  T.  A.  Core,  W.  K.  Tsang,  S.  J.  Sherman,  “Fabrication 
Technology  for  an  Integrated  Surface-Micromachined  Sensor”, 
Solid  State  Technology,  October  1 993. 

[13]  J.  Bryzek,  K.  Petersen,  W.  Me  Culley,  “Micromachines  on 
the  march”,  IEEE  Spectrum,  pp20-31,  May  1994. 

[14]  B.  Riedel,  “A  Surface-Micromachined,  Monolithic 
Accelerometer”,  Analog  Dialogue,  Volume  27,  Number  2  pp3- 
7,  1993. 

[15]  E.  Hoffman,  B.  Wameke,  E.  Kruglick,  J.  Weigold,  K.  S.  J. 
Pister,  “3D  Structures  with  Piezoresistive  Sensors  in  Standard 
CMOS”,  Presented  at  MEMS  ' 95 ,  Amsterdam,  The 
Netherlands,  UCLA  MEMS  TRN  95-1,  December  94. 

[16]  Jarlath  McEnthee,  “Properties  of  Polysilicon”, 
http://bell.  mma.  ed u/ -fmcen  t/siliconprop.  htm. 


202 


The  2nd  International  Conference  on  Recent  Advances 
in  Mechatronics,  Istanbul,  Turkey,  May  24-26,  1999 


©  UNESCO  Chair  on  Mechatronics 
Bogaz^i  University,  Istanbul,  Turkey 


Three-Dimensional  Target  Tracking  Using  An  Adjustable  Infrared  Sensor 

Configuration 


Ali  Safak  Sekmen,  Mark  Cambron,  and  Mitch  Wilkes 


Intelligent  Robotics  Laboratory 
Department  of  Electrical  and  Computer  Engineering 
Vanderbilt  University ,  Nashville ,  TN  37235 
Sekmen ,  mac ,  wilkes  @  vuse.  vanderbilt .  edu 


Abstract 

A  low  cost  adjustable  infrared  sensor  configuration 
consisting  of  five  identical  passive  infrared  motion 
detectors  has  been  added  to  a  service  robot ,  ISAC.  The 
intersections  and  unions  of  the  motion  detectors  are  used 
to  track  a  moving  object  in  three  dimensions.  Each 
motion  detector  is  placed  on  a  5-inch  long  square  piece  of 
wood .  The  infrared  motion  detectors  produce  a  5V  output 
when  they  detect  a  moving  object  in  three  dimensions. 
Otherwise ,  an  output  of  OV  is  produced.  ISAC  has  a 
camera  head  containing  two  cameras.  Four  bits  of 
information  are  extracted  from  the  measurements.  These 
are  the  tilt  and  pan  angles  of  the  camera  head  of  ISAC 
and  pan  angles  of  the  left  and  right  cameras  of  the 
camera  head. 

1.  Introduction 

For  a  service  robot,  it  is  very  important  to  be  able  to 
track  the  person  it  serves.  Moreover,  it  may  be  serving 
more  than  one  person.  In  this  paper,  an  infrared  sensor 
configuration  for  target  tracking  is  introduced.  This 
configuration  has  been  added  to  a  service  robot  called 
ISAC  [1,2]. 

In  robotics,  acoustic  sensors  are  generally  used  for 
applications  such  as  robot  navigation  [3,4],  target  tracking 
[5,6],  obstacle  avoidance  [7],  and  mapping  [8,9].  They 
provide  a  simple  means  of  determining  locations  of 
objects.  Kuc  did  three-dimensional  tracking  using 
qualitative  bionic  sonar  [5].  However,  sonar  sensors  have 
some  limitations.  For  example,  their  high  beamwidth 
makes  it  difficult  to  localize  objects  correctly.  In 
addition,  multiple  reflections  are  sometimes  difficult  to 
interpret.  When  infrared  sensors  are  used  alone,  it  is  not 
possible  to  obtain  sharp  position  information.  Since  both 
infrared  and  sonar  sensors  have  their  own  disadvantages, 
Flynn  combined  sonar  and  infrared  sensors  for  mobile 
robot  navigation  [10]. 


The  importance  of  target  tracking  is  also  evident  in 
videoconferencing  systems.  Cameras  in  current 
videoconferencing  systems  are  generally  controlled 
manually  by  the  user.  Voice  source  localization  is  often 
used  to  solve  this  problem.  However,  if  the  speaker  moves 
but  does  not  talk,  then  there  may  be  a  delay  for  the 
camera  to  track  the  speaker.  Sound  localization  has  other 
problems  such  as  noise  and  reverberation,  and  the 
influence  of  specular  reflection  from  the  flat  surfaces  of 
furniture  such  as  a  table  [11]. 

ISAC  (shown  in  Figure  1)  is  a  service  robot.  It  has  a 
camera  head  consisting  of  two  cameras  (left  and  right). 
The  sensor  configuration  is  placed  on  the  body  of  ISAC. 
An  advantage  of  this  sensor  configuration  is  that  it  does 
not  only  use  the  intersections  of  the  beam  patterns  it 
also  uses  the  union  of  them.  In  this  study,  we  show  how 
an  adjustable  infrared  sensor  configuration  can  be  used  to 
obtain  some  position  information. 

In  the  next  section,  the  system  configuration  is 
described.  Section  3  describes  the  method  for  the  pan 
and  tilt  calculations  of  the  camera  head  and  left  and 
right  cameras  of  the  head.  Section  4  explains  the  vision 
system  of  ISAC.  In  Section  5,  the  tracking  results  for  one 
and  two  moving  objects  are  presented.  In  Section  6, 
some  conclusions  are  given. 

2.  System  Configuration 

The  transducer  in  our  infrared  system  is  a  78O0-KT 
PIR  detector  which  is  a  sensor  module  developed  for 
human  body  detection.  It  can  detect  the  movement  of 
body  heat  in  the  horizontal  direction,  up  to  190  cm  aw2y 
with  nearly  1 80°  field  of  view.  The  sensor  configuration 
is  shown  in  Figures  2  and  3.  Note  that  the  angles  between 
the  wood  pieces  can  be  adjusted. 
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Figure  1.  Dual  arm  humanoid  robot  (ISAC). 


Figure  2.  Top  view  of  sensor  configuration. 


Figure  3.  Adjustable  sensor  configuration. 


Figure  4  illustrates  the  range  of  one  of  the  detectors. 
Its  detection  region  can  be  thought  as  a  half  elliptic 
surface  with  a  horizontal  length  of  190cm  and  a  vertical 
length  of  75cm.  The  vertical  range  is  not  shown  in  Figure 
4.  Each  detector  produces  5V  when  a  body  moves  in 
its  range  otherwise  the  output  is  0V. 
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Figure  4.  Range  of  one  of  detectors  (top  view). 

3.  Method  For  Tracking 

Figure  5  illustrates  the  intersections  and  unions  of  the 
ranges  of  the  detectors.  The  main  idea  is  to  decide  which 
sensors  can  detect  a  human  body  in  which  regions.  Table 
1  shows  the  regions  and  the  transducers  that  are  active  in 
these  regions. 


Figure  5.  Union  of  sensitivity  regions. 

One  interesting  point  derived  from  Table  1  is  that 
there  are  some  situations  in  which  there  cannot  be  only 
one  moving  object.  For  example,  if  there  is  only  one 
person,  there  is  no  way  for  sensors  I-V,  I-IV,  I-III,  II-V, 
IMV,  III-V,  I-II-V,  I-II-IV,  I-III-IV,  I-III-V,  I-IV-V,  II- 

III- V,  II-IV-V,  MI-IV-V,  I-II-III-IV-V  to  be  activated 
simultaneously.  We  call  these  regions  impossibility 
regions.  In  these  cases,  it  means  there  are  two  or  more 
people  moving  around.  Since  ISAC  has  two  cameras  (left 
and  right)  we  can  easily  calculate  the  angles,  that  the 
cameras  must  turn  in  those  regions.  For  example,  if  I-II- 

IV-  V  are  active  simultaneously,  we  assume  there  are  two 
people  and  one  of  them  is  in  Region  2  (I  and  II  active), 
and  the  other  is  in  Region  14  (IV  and  V  are  active). 
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Table  1.  Active  detectors  in  specified  regions. 


REGION 

DETECTORS 

1 

1 

2 

1-11 

3 

II 

4 

I-II-III 

5 

I-II-III-IV 

6 

II-III 

7  1 

II-III-IV 

8 

II-III 

9 

III 

10 

III-IV 

11 

II-III-IV-V 

12 

III-IV- V 

13 

III-IV 

14 

1V-V  1 

15 

IV 

16 

V 

Note  that  the  sensor  configuration  and  the  camera  head 
are  not  on  the  same  level.  According  to  Table  1,  the  pan 
angles  of  the  cameras  and  the  pan  and  tilt  angles  of  the 
camera  head  are  calculated.  The  camera  head  is  pointed  to 
the  center  of  the  region  in  which  the  person  is  detected 
For  example,  when  the  person  moves  from  Region  4  to 
Region  3,  the  pan  angle  of  the  camera  head  does  not 
change  whereas  the  tilt  angle  decreases.  In  some  regions 
the  pan  angles  of  the  right  and  left  cameras  are  the  same 
(e.g.  Region  4)  while  in  some  regions  they  are  different 
(e.g.  Region  1).  For  example,  for  Region  1,  the  camera 
head  is  pointed  to  the  center  (by  adjusting  the  pan  and  tilt 
angles  of  the  camera  head),  the  right  camera  is  pointed  to 
the  upper  section  of  Region  1,  and  the  left  camera  is 
pointed  to  the  lower  section. 

4.  Camera  Fixation 

Vision  is  often  necessary  for  robots  that  work  with  and 
around  humans.  The  active  vision  system  locates  a  face 
from  stereo  images  and  fixates  upon  it.  Fixation  is  the 
process  of  centering  each  camera  on  a  target  point  (a  face 
in  this  example).  This  behavior  is  similar  to  that  of  a 
human  turning  their  head  as  someone  walks  in  front  of 
him  or  her.  An  active  vision  system  solves  many  vision 
problems  [12].  In  particular,  the  tracking  of  an  object  in 
stereo  facilitates  segmentation  and  distance  estimation 
while  greatly  reducing  image  clutter. 


PAN 


Figure  6.  Vision  System. 

The  active  vision  system  (Figure  6)  is  composed  of  a  4 
degree  of  freedom  camera  head  and  2  color  CCD  cameras. 
Figure  6  shows  a  simple  schematic  illustrating  the  axes  of 
movement  of  the  camera  head.  The  4  DOF  are  pan,  tilt, 
left  verge,  and  right  verge. 

The  union  and  intersection  of  the  detectors  give  a 
location  area  in  which  the  person  is  located  (Section  3). 
The  cameras  then  move  to  a  predetermined  location  in 
which  the  cameras  are  looking  to  the  center  of  the  given 
area.  At  this  point  a  human  tracking  behavior  takes  over 
control  of  the  pan  tilt  head.  It  is  desirable  to  fixate  upon 
the  human  and  look  them  eye-to-eye  (or  at  least  camera- 
to-eye). 

The  goal  of  the  color  module  is  to  help  locate  a  face  in 
a  set  of  stereo  color  images  and  guide  the  camera  head  to 
center  it  in  both  cameras.  To  accomplish  this,  skin  tone 
color  models  were  created.  The  color  segmentation 
routine  separates  pixels  into  candidate  skin-tone  pixels 
and  non-skin-tone  pixels.  Pixels  are  segmented  by 
determining  if  the  fall  within  a  pre-defined  RGB  color 
model  space  [13].  From  this  process  a  binary  image 
mask  is  created. 

Given  the  location  of  the  skin-tone  centroid  in  the 
mask  image,  the  camera  tracker  then  moves  the  camera 
head  to  guide  the  centroid  towards  the  center  of  the  image. 
The  cameras  move  according  to  a  direction  vector 
generated  by  the  distance  from  the  skin-tone  centroid  to 
the  center  of  the  image  view.  The  amount  of  the  cameras’ 
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movement  is  proportional  to  the  magnitude  of  the 
direction  vector.  The  y-component  of  the  direction  vector 
controls  the  tilt  motor  and  the  x-component  controls  the 
verge  motors.  Once  the  target  has  reached  the  center,  the 
tracker  is  acknowledged  that  the  target  has  been  fixated 
and  then  stops  moving  camera  head.  Once  a  skin  color 
blob  is  fixated  upon  we  check  for  faces  by  using  a 
template  matching  routine  [14].  A  confidence  measure  is 
used  to  indicate  the  success  of  the  fixation  routine. 

5.  System  Performance 

The  signals  from  the  sensor  configuration  were 
sampled  every  2  seconds.  The  results  of  tracking  one 
person  were  very  good.  The  person  was  always  in  the 
view  range  of  at  least  one  of  the  cameras.  The  cameras 
tracked  the  person  with  100%  accuracy  in  all  regions 
except  for  regions  5  and  1 1  of  which  the  boundaries  are 
not  sharp.  When  there  were  two  persons  in  the 
impossibility  regions,  the  results  were  satisfying.  One  of 
the  cameras  looked  at  one  person  while  the  other  looked 
at  the  other  person.  When  the  people  were  outside  the 
impossibility  regions,  the  results  were  not  so  good  since 
there  are  some  regions  in  which  it  is  not  possible  to  be 
able  to  track  two  persons.  For  example,  if  one  of  the 
people  is  in  Region  1  while  the  other  is  in  Region  2,  the 
two  cameras  are  going  to  look  at  Region  2  since  the  first 
and  second  detectors  are  activated  simultaneously. 

6.  Conclusion 

An  adjustable  passive  infrared  sonar  configuration 
has  been  described  by  which  ISAC  can  track  one  or  more 
persons  by  calculating  the  camera  angles  according  to  the 
qualitative  data  coming  from  this  configuration.  The 
system  performance  can  be  improved  by  increasing  the 
number  of  detectors  and  the  angles  between  the  detectors. 
If  a  circular  configuration  is  used  with  a  large  number  of 
detectors,  then  ISAC  can  track  a  person  moving  around  it. 
Moreover,  if  different  kinds  of  detectors  that  have  greater 
range  are  used  in  the  configuration,  then  the  system  can 
be  extended  to  be  used  in  videoconferencing  systems. 
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Abstract 

Although  acoustic  sensors  are  inexpensive  and 
convenient  to  use  in  time-of-flight  (TOF)  ranging 
systems ,  they  have  some  important  limitations  one  of 
which  is  their  low  angular  resolution  that  makes 
localization  difficult.  In  this  paper ,  an  adaptive  multi¬ 
sensor  sonar  system  is  introduced  to  compensate  for  the 
low  angular  resolution  and  improve  the  localization 
accuracy.  The  sensor  configuration  is  used  to  estimate 
the  radius  of  curvature  and  location  of  cylindrical 
objects .  Sensitivity  analysis  of  the  curvature  estimate 
with  respect  to  some  measurement  errors  and  certain 
system  parameters  is  provided.  Two  methods  of  TOF 
estimation  are  used  and  compared:  thresholding  and 
curve-fitting  methods.  Theory  and  simulations  are 
verified  by  experimental  data  from  a  real  sonar  system. 
The  adaptive  configuration  improves  the  estimates  by 
35-45%  and  the  curve  fitting  method ,  compared  to  the 
thresholding ,  brings  an  improvement  of  about  30%  in 
the  absence  of  noise  and  50%  in  the  presence  of  noise. 
The  radius  of  curvature  estimation  is  shown  to  be  useful 
for  target  discrimination. 

1.  Introduction 

One  of  the  major  limitations  of  acoustic  sensors  is 
their  high  beamwidth  that  reduces  the  localization 
accuracy.  An  adaptive  sonar  sensor  configuration 
consisting  of  three  transmitter/receiver  ultrasonic 
transducers  is  used  to  increase  the  localization  accuracy. 
With  this  configuration,  the  radius  of  curvature  and 
location  of  cylindrical  objects  are  estimated.  The  radius 
of  curvature  estimate  can  be  used  to  differentiate  basic 
types  of  reflectors.  For  large  values  of  radius,  the  target 
can  be  assumed  to  be  a  planar  wall,  and  for  values  close 
to  zero,  it  can  be  assumed  to  be  an  edge. 

Sonar  sensors  have  been  widely  used  in  robotics 
applications  including  underwater  robotics.  Several 
researchers  used  different  sensor  configurations.  Kuc 
used  a  system  which  adaptively  changes  its  position  and 
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configuration  in  response  to  the  echoes  it  detects  [1].  In 
[2],  Kleeman  and  Kuc  classified  target  primitives  as 
plane,  corner,  edge,  and  unknown.  Flynn  combined 
infrared  and  sonar  sensors  to  compensate  for  the  high 
beamwidth  of  sonar  sensors  [3].  In  [4],  Sabattini 
illustrated  that  advanced  filtering  techniques  are  required 
for  making  sonar  data  more  accurate  and  reliable. 
Peremans  et  al  used  a  linear  configuration  with  three 
ultrasonic  transducers  [5]. 

When  the  reflection  point  of  the  object  is  not  along 
the  line-of-sight  (LOS)  of  the  ultrasonic  transducer, 
there  is  an  exponential  decline  in  the  amplitude  of  the 
reflected  sonar  signal  which  decreases  the  signal-to- 
noise  (SNR).  Depending  on  the  orientation  of  the  target, 
the  transducers  of  the  configuration  are  rotated  towards 
the  target  to  obtain  more  accurate  estimates. 

In  Section  2,  the  adaptive  sensor  configuration  is 
described  and  the  reason  to  use  this  configuration  is 
explained.  The  radius  of  curvature  and  location 
estimation  algorithm  is  presented  in  Section  3.  Two 
methods  of  TOF  are  described  in  Section  4.  Section  5 
provides  sensitivity  artalysis  of  curvature  estimation.  The 
simulation  results  are  presented  in  Section  6.  In  section 
7,  the  experimental  results  are  illustrated.  Finally,  some 
conclusions  are  drawn  in  Section  8. 

2.  Multi-sensor  Sonar  Configuration 

For  a  cylindrical  target  at  range  z  and  making  an 
angle  a  with  the  LOS  of  an  ultrasonic  transducer,  the 
received  time  signal  reflected  by  the  target  is  a 
sinusoidal  enveloped  by  a  Gaussian  which  is  given  by 
[6]: 

^/2  _£i 

S!Jt)  =  pr  °2"e  2af  sin[2tf.(/-/.)](l) 

where  z  is  the  distance  between  the  transducer  and  the 
surface  of  the  object,  pc  is  the  reflection  coefficient 
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which  increases  with  the  radius  of  curvature,  Am„x  is  the 
maximum  amplitude,  zmin  =a2  /  X  (a  is  the  radius  of  the 
transducer  aperture),  a  is  the  deviation  angle  from  the 
LOS,  oa  =  ao  1 2  ( is  the  half  beamwidth  angle),  to  is 
the  time-of-flight,  /o  is  the  resonant  frequency,  and 

o,=\/f0.' 

Equation  (1)  shows  that  when  the  deviation  angle 
between  the  object  and  sensor  is  not  zero  ( a  *  0 ),  there 
is  an  exponential  decline  in  the  amplitude  of  the 
reflected  sonar  signal.  Hence,  as  the  deviation  angle 
increases,  SNR  decreases.  Therefore,  sonar  data  is  most 
reliable  when  the  target  lies  along  the  LOS  of  the 
transducer,  and  at  nearby  ranges  due  to  the  l/z3/2term 
in  Equation  (1).  In  this  study,  a  sensor  configuration 
composed  of  three  ultrasonic  transducers  is  used  as 
shown  in  Figure  1.  First,  linear  configuration  of  these 
sensors  are  used  and  some  measurement  are  taken,  and 
then,  the  transducers  are  rotated  around  their  center 
according  to  these  measurements  to  make  them 
perpendicular  to  the  target. 


Figure  1.  The  sensor  and  target  configuration. 

3.  Algorithm 

A  cylindrical  object  with  radius  R  and  orientation  6 
is  considered.  The  radius  of  curvature  (/?),  the  distance 
between  the  central  transducer  and  the  center  of  the 
target  (r  =  h  +  /?),  and  the  deviation  angle  of  the 
central  transducer  (6)  are  estimated  as  follows: 

•  The  following  measurements  are  taken  by  the 
transducers: 


r  ct 

h  =  — =  r-R  +  n 
2 

hr  =  -2drsin0  -  R  +  nr  (2) 

h,-  —  =  Vr2  -\-d2  +  2drsin0  - R  +  n. 

7  2  7 

where  ho ,  hr>  and  h,  are  the  distances  between  the 
surface  of  the  object  and  the  central,  right,  left 
transducers  respectively.  ta,tr,  and  t}  are  the  TOFs  for 
the  central,  right,  and  left  transducer  respectively,  d  is 
the  transducer  separation.  no ,  nr ,  and  are  noise  terms 
that  can  be  modeled  as  spatially  uncorrelated  Gaussian 
noise  since  the  noise  correlation  coefficient  is  small  for 
the  acoustic  transducers  (most  of  the  noise  on  the 
sensors  is  dominated  by  the  thermal  noise)  [5]. 

•  The  probability  density  function  of  the  measurement 
vector  z  is  given  as: 

Pi  =  (3) 

where  the  vectors  z,  h(r,6,R) ,  and  the  error  correlation 
matrix  C  are  given  by 


z  = 


ho' 

r-R 

1 

hr 

,  h(r,&,  R)  = 

^Jr2  +d2  -2drsin  0  -R 

h, 

^r2  +d2  +2dr  sin  6  -R 

o 

o 

V 

C  = 

o 

o 

i 

o 

o 

Q 

1 _ 

(4) 


(5) 


The  r,6,and  R  values  maximizing  Equation  (3)  are 
the  maximum  likelihood  estimates  which  can  be 
found  by  taking  the  inverse  of  z  =  h(r,  6 ,  R)  as: 

P_  2d2  +  2(/z,  +hr)h,  -2 h2o-h2-h; 
2h2r+2h;-4h; 

4  d[h'  +  R  ) 


0  =  sin' 


k  fe+/i;)-2fc+rf0 

4A  -2[hr  +h, ) 


(6) 

(7) 


(8) 
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•  The  deviation  angles  for  the  right  and  left 
transducers  are  given  by 


/ 

V 

Ml 

’  ~[h+R 

>♦«■] 

I+r) 

2d 

{h,  +  R) 

{h  +  R 

Mj 

V 

2d{hr  +  R) 

) 

(9) 

(10) 


5.  Sensitivity  Analysis 

To  illustrate  how  some  measurement  errors  and 
certain  system  parameters  affect  the  radius  of  curvature 
estimate,  a  sensitivity  analysis  has  been  provided.  A 
small  perturbation  is  added  to  a  variable  and  the  change 
in  the  radius  of  curvature  estimate  has  been  observed. 
For  example,  the  error,  Ah,,  in  the  left  transducer 
measurement  affects  the  curvature  estimate  as: 


•  The  left,  central,  and  left  transducers  are  rotated  by 
0t ,  6,  and  6r  respectively  and  r,  6,  and  R  are 
estimated  again. 

4.  TOF  Estimation 

In  this  study,  two  methods  of  TOF  estimation  are 
used.  The  first  one,  thresholding ,  is  the  most  common 
method  in  TOF  ranging  systems.  When  a  pulse 
transmitted  by  the  sensor  encounters  an  object,  an  echo 
is  produced  and  the  TOF  of  the  pulse  is  considered  to  be 
the  first  time  value  at  which  the  amplitude  of  the  echo 
exceeds  a  preset  threshold  value.  Although  the 
thresholding  method  is  very  fast,  it  is  not  accurate 
enough  for  some  applications.  The  second  method  of 
this  study,  curve-fitting ,  is  employed  to  decrease  the 
error  in  TOF  estimation  by  thresholding.  In  this  method, 
a  parabolic  curve  of  the  form  ao(t  —  tc)2  is  fitted  to  the 
onset  of  the  sonar  echo.  First  initial  estimates  of  the  two 
parameters  ao  and  to  are  obtained  by  using  samples  of 
the  signal  around  the  thresholding  point.  Initial  estimate 
for  to  is  found  by  simple  thresholding,  and  ao  is 
estimated  from  the  second  derivative  approximation 
around  the  thresholding  point  [7].  The  iterative 
Lavenberg-Marquart  nonlinear  least-square  algorithm  is 
initialized  by  these  values.  To  estimate  ao  and  to ,  50 

samples  of  echo  around  the  threshold  point  have  been 
used  in  simulations  and  experiments.  The  methods  are 
illustrated  in  Figure  2. 


(curvw— <lt)  (Ihiavheld) 


Figure  2.  TOF  estimate. 


AR  (h2r+(h,+Ahf\-2(hi+,r-)  fc+V)-2fcw;)  (11) 
4/i„-2  (V+/i,+A/i,)  4h.-2{hr+h,) 


ilho  (mm) 

(a) 


(c) 

Figure  3.  Sensitivity  analysis. 
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For  sensitivity  analysis,  a  stationary  cylindrical  object 
at  0  =  0°  with  radius  5  cm  is  considered.  Figure  3(a) 
illustrates  the  effect  of  an  error,  varying  between  0-0.4 
mm,  in  ho.  The  transducer  separation  is  set  to  10  cm. 

The  error  in  R  increases  linearly  with  A ho  and 
nonlinearly  with  ha  itself.  Also,  a  positive  error  in 
ho  leads  a  positive  error  in  R  since  for  constant  hr  and 
h, ,  increasing  ho  causes  an  overestimate  as  shown  in 
Figure  1.  Figure  3(b)  and  (c)  display  the  effect  of  d  on 
the  R  estimate  when  the  constant  errors  A ho  =0.18  mm 
and  A hr  =  0. 1 8  mm  are  added  respectively.  For  small 
values  of  djho ,  the  error  is  high  since  the  resolution 
provided  by  the  differential  TOF  information  between 
the  central  and  surrounding  transducers  will  not  be 
sufficiently  large  to  estimate  the  curvature  reliably. 
Hence,  as  range  increases,  the  transducer  separation 
should  also  be  increased  to  achieve  a  higher  resolution. 

6.  Simulation  Results 

The  reflected  sonar  signal  is  modeled  by  Equation  (1) 
for  the  simulations  with  Amax=l,  zmin  =5.8cm, 
pc  =  0.45/? -0.022,  /.  =  49.4  kHz,  c  =  343.5  m/s.  For 
simulations,  100-iteration  Monte-Carlo  simulation  study 
is  employed  and  the  curve-fitting  TOF  estimate  method 
is  used.  First,  the  linear  configuration  of  the  transducers 
are  used  to  estimate  the  range,  deviation  angle,  and 
radius  of  curvature  by  Equations  6-8  respectively.  Then, 
the  central,  left,  and  right  transducers  are  rotated  around 
their  centers  by  the  angles  found  by  Equations  7,9,  and 
10  respectively.  Finally,  the  estimates  for  rotated 
configuration  are  calculated  again  by  using  Equations  6- 
8.  From  now  on,  the  first  estimate  corresponds  to  the 
estimate  taken  by  the  linear  configuration,  and  second 
estimate  corresponds  to  the  estimate  taken  by  the 
rotated  configuration.  In  all  simulations,  dashdot  (or  dot) 
and  dashed  lines  indicate  the  mean  of  the  estimate  and 
±o  (standard  deviation)  for  the  linear  and  rotated 
positions  respectively. 

Figures  4(a)  and  (b)  illustrates  the  radius  of  curvature 
estimates  when  the  thresholding  method  is  used  in  the 
absence  and  presence  of  noise  respectively.  The  first 
estimate  worsens  after  d-  37  cm  in  the  absence  of  noise 
while  second  estimate  continues  improving.  The  error  in 
the  first  estimate  is  22.8%  while  it  is  9.2%  for  the 
second  estimate.  In  the  presence  of  noise,  the  second 
estimate  provides  approximately  40%  better  results. 
Also,  the  standard  deviation  is  less  for  the  second 
estimate.  Figures  5(a)  and  (b)  shows  the  same  results 
when  the  curve-fitting  method  is  used  to  estimate  TOF  in 
the  absence  and  presence  of  noise  respectively.  In  the 


absence  of  noise,  the  first  estimate  gets  worse  after  43cm 
(it  was  37  for  the  thresholding  method).  The  error  is 
1 1 .2%  for  the  first  estimate  and  it  is  0.4%  for  the  second 
estimate.  In  the  presence  of  noise,  the  second  estimate  is 
better  than  the  first  estimate.  Comparison  of  Figure  4(b) 
and  5(b)  shows  that  the  curve-fitting  method  provides 
better  estimates  than  that  of  the  thresholding  method. 
The  improvements  are  approximately  60%  and  20%  for 
the  first  and  second  estimates  respectively. 
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Figure  4.  R  estimate  versus  d  with  thresholding  in  the 
absence  (a)  and  presence  (b)  of  noise. 


Figure  5.  R  estimate  versus  d  with  curve-fitting  in  the 
absence  (a)  and  presence  (b)  of  noise. 
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Figure  6(a)  illustrates  the  range  estimate  r  versus 
transducer  separation  d.  The  curve-fitting  method  is 
employed  for  TOF  estimates.  The  first  estimates 
improve  until  d  =  12  cm  and  then  it  worsens  since  now 
the  target  is  located  at  a  very  low  SNR  region.  R  =  5  cm, 
0=5°  are  considered.  Figure  6(b)  shows  the  range 
estimate  r  versus  deviation  angle  6.  /?  =  5  cm  and 

d  =  10cm  are  considered.  When  0  =  6°,  the  left  and 
when  0  =  12°,  the  right  transducer  start  measuring 
wrong. 
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Figure  6.  r  estimate  versus  (a)  d  (b)  6 . 

7.  Experimental  Results 
7.1.  Experimental  Set-up 

The  set-up  is  constructed  for  3D  applications.  The 
unit  consists  of  five  transmitter/receiver  Polaroid 
transducers  with  the  resonant  frequency  49.4  kHz.  The 
central  transducer  is  flanked  by  four  transducers 
symmetrically  as  shown  Figure  7.  Only  the  left,  central, 
and  right  transducers  are  used  for  the  experiments.  A  4- 
channel  DAS-50  A/D  card  with  12-bit  resolution  and  1 
MHz  sampling  frequency  is  used  to  sample  the  echo 
signals. 


Figure  7.  The  experimental  set-up. 


7.2.  Results 

Table  1  displays  some  experimental  results  for  the 
radius  of  curvature  estimate  of  a  cylindrical  object  with 
radius  50  mm  when  the  transducer  separation  is  10  cm. 
The  curve-fitting  method  is  employed  for  TOF  estimate. 
In  the  table,  Rx  and  ^correspond  to  the  radius  of 
curvature  estimates  of  the  linear  and  rotated 
configurations  respectively.  It  is  concluded  that  the 
estimates  are  improved  by  the  adaptive  configuration. 


Table  1.  Experimental  results  for  a  cylinder  of 

R  -  50  mm  and  d  =  7.5  mm. 


K 

(mm) 

E(R,) 

(mm) 

(mm) 

E(R2) 

(mm) 

°ki 

(mm) 

300 

48.27 

6.84 

48.93 

4.10 

400 

51.13 

10.08 

50.27 

6.25 

500 

49.17 

15.23 

49.44 

9.29 

600 

52.33 

22.43 

51.79 

13.24 

700 

44.96 

27.61 

46.63 

16.57 

800 

44.55 

35.02 

48.36 

21.52 

900 

50.64 

45.42 

50.25 

26.80 

1000 

59.87 

57.41 

55.48 

33.39 

8.  Conclusion 

In  this  study,  radius  of  curvature  and  location 
estimates  of  cylindrical  objects  by  a  multi-sensor  sonar 
system  has  been  investigated.  Two  methods  of  TOF  has 
been  used  and  it  has  been  shown  that  the  curve-fitting 
method  improves  TOF  estimate.  Moreover,  the  adaptive 
configuration  decreases  the  effect  of  low  angular 
resolution  of  sonar  sensors  considerably.  The  results  are 
useful  for  target  discrimination  especially  for  primitive 
types  such  as  walls  and  edges.  For  high  values  of 
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curvature,  the  target  is  assumed  to  be  a  planar  wall,  and 
for  values  close  to  zero,  the  target  is  assumed  to  be  an 
edge.  The  method  can  be  generalized  to  spherical 
objects  as  well  [8]. 
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Abstract 

Coordination ,  the  process  by  which  an  agent  reasons 
about  its  local  actions  and  the  anticipated  actions  of 
others  to  try  and  ensure  the  community  acts  in  a  coherent 
manner,  is  perhaps  the  key  problem  of  the  discipline  of 
multi-agent  systems.  In  this  paper  we  use  the  Q-learning 
method  to  guide  the  agents  to  act  cooperatively  in  multi¬ 
agent  systems.  A  special  Q-function  consists  of  two  parts, 
one  denotes  agent's  individual  pursuit  and  the  other 
denotes  the  cooperative  requirement,  is  proposed.  And  we 
use  the  pursuit  problem  as  our  testbed  system.  In  the 
testbed  system ,  agents  have  been  created  which  choose 
their  actions  by  maximizing  the  Q-values  of  them.  The 
result  of  our  simulation  is  good. 

1.  Introduction 

Modem  applications  of  computing  arise  most  fre¬ 
quently  in  environments  that  are  open,  heterogeneous, 
distributed,  dynamic,  large,  and  with  autonomous 
components.  For  these  reasons,  they  require  solutions  that 
marry  artificial  intelligence  (AI)  and  traditional  tech¬ 
niques  to  yield  extensibility  and  flexibility.  Agents  are  an 
issue  of  this  marriage.  Currently,  many  agent  approaches 
are  centralized  in  a  single  agent.  However,  centralization 
has  obvious  shortcomings  in  accommodating  the  above 
properties  of  open  environments.  Consequently,  there  has 
been  increasing  interest  in  multi-agent  systems,  which  can 
yield  the  benefits  of  intelligent  agent  while  preserving 
openness  and  scalability. 

What  sets  multi-agent  systems  apart  from  single  agents 
is  that  they  require  the  agents  to  behave  in  a  coordinated 
manner,  agents  must  follow  some  protocol  even  to 
compete  effectively.  Therefore,  the  designer  of  a  multi¬ 
agent  system  must  handle  not  only  the  application- 
specific  aspects  of  the  various  agents,  but  also  their 
interactions  with  one  another.  Current  approaches  to 
constructing  multi-agent  systems  offer  no  special  coordi¬ 
nation  support  to  tstalematehe  designer,  who  must 
manually  ensure  that  the  agents  interact  appropriately. 
This  can  lead  to  unnecessarily  rigid  or  sub  optimal 
designs,  wasted  development  effort,  and  sometimes  to  the 
autonomy  of  the  agents  being  violated.  These  limitations, 


in  effect,  subvert  many  of  the  features  that  make  multi¬ 
agent  systems  attractive  in  the  first  place. 

Researchers  in  the  field  of  Distributed  Artificial 
Intelligence  (DAI)  have  developed  a  variety  of  agent 
coordination  schemes  under  different  assumptions  about 
agent  capabilities  and  relationships,  such  as,  intelligent 
tutoring  systems[l],  traffic  or  air-traffic  control[2],  virtual 
theater[3],  realistic  virtual  training  environments^], 
virtual  interactive  fiction[5].  Most  of  those  researches 
hitherto  have  been  focused  on  enabling  individual  agents 
to  cope  with  the  complexities  of  those  dynamic  domains. 
One  promising  approach  that  has  emerged  is  the  use  of 
hierarchical  reactive  plans.  Reactive  plans  are  qualified  by 
preconditions,  which  help  select  plans  for  execution  based 
on  the  agent's  current  high-level  goals/tasks  and  beliefs 
about  its  environment. 

Millind  Tambe[6]  studied  Multi-Agent  System  based 
on  the  concept  of  team  not  on  individuals.  His  model  is 
based  on  joint  intention  framework,  which  implemented 
cooperation  using  the  concept  of  teamwork. 

This  paper  is  focus  on  applying  Q-leaming  method  on 
the  coordination  problem  among  the  agents  of  multi-agent 
system. 

2.  Coordination 

Multi-agent  systems  are  a  particular  type  of  DAI 
system,  in  which  autonomous  intelligent  agents  inhabit  a 
world  with  no  global  control  or  globally  consistent 
knowledge.  These  agents  may  still  need  to  coordinate 
their  activities  with  others  to  achieve  their  own  local  goals. 
It  is  widely  believed  that  a  group  of  cooperating  agents 
engaged  in  problem  solving  can  solve  a  task  faster  than 
either  a  single  agent  or  the  same  group  of  agents  working 
in  isolation  from  each  other. 

Coordination  is  central  to  multi-agent  systems,  without 
it  any  benefits  of  interaction  vanish  and  the  group  of 
agents  quickly  degenerates  into  a  collection  of  individual 
with  a  chaotic  behavior.  Agents  could  benefit  from 
receiving  information  about  what  others  are  doing  or  plan 
to  do,  and  from  sending  them  information  to  influence 
what  they  do.  But  the  capabilities  needed  for 
collaboration  cannot  be  patched  on  but  must  be  designed 
in  from  the  start. 
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Almost  all  of  the  coordination  schemes  developed  to 
date  assume  explicit  or  implicit  sharing  of  information.  In 
the  explicit  form  of  information  sharing,  agents  com¬ 
municate  partial  results,  speech  acts,  resource  availabi¬ 
lities,  etc.  to  other  agents  to  facilitate  the  process  of 
coordination.  In  the  implicit  form  of  information  sharing, 
agents  use  knowledge  about  the  capabilities  of  other 
agents  to  aid  local  decision-making.  We  believe  that  the 
less  an  agent  depends  on  shared  information,  and  the 
more  flexible  it  is  to  the  on-line  arrival  of  problem¬ 
solving  and  coordination  knowledge,  the  better  it  can 
adapt  to  changing  environments. 

As  flexibility  and  adaptability  are  key  aspects  of 
intelligent  and  autonomous  behavior,  we  are  interested  in 
investigating  mechanisms  by  which  agents  can  acquire 
and  use  coordination  knowledge  through  interactions  with 
its  environment  (that  includes  other  agents)  without 
having  to  rely  on  shared  information. 

Coordination  strategy  among  the  agents  of  multi-agent 
systems  can  be  classified  to  explicit  strategy  and  implicit 
strategy.  In  explicitly  cooperative  systems,  the  agents 
interact  and  exchange  information  or  perform  actions  so 
as  to  benefit  other  agents.  On  the  other  hand,  implicitly 
cooperative  agents  perform  actions  that  are  a  part  of  their 
own  goal-seeking  process  but  these  actions  affect  the 
other  agents  in  beneficial  ways. 

We  implement  the  cooperation  among  the  agents  by  an 
implicit  strategy,  which  is  based  on  a  kind  of  co-learning 
method.  Each  agent  sends  signal  to  relative  agents,  and 
the  agent  can  calculate  the  cooperative  value  by  the  signal 
sent  from  its  partner. 

3.  Reinforcement  Learning 

Reinforcement  learning,  learning  procedure  of  which  is 
driven  by  scalar  values  given  to  the  learner  to  reinforce  its 
behavior,  is  a  promising  area  of  machine  learning. 

In  reinforcement  learning  problems[7],  reactive  and 
adaptive  agents  are  given  a  description  of  the  current  state 
and  have  to  choose  the  next  action  from  a  set  of  possible 
actions  so  as  to  maximize  a  scalar  reinforcement  or 
feedback  received  after  each  action.  The  learner's 
environment  can  be  modeled  by  a  discrete  time,  finite 
state,  Markov  decision  process  that  can  be  represented  by 
a  4-tuple  ( S,  A,  P  ,r)  , where  P  :  SXSXA-+  [0  ;  1] gives 
the  probability  of  moving  from  state  s}  to  ,s2on  performing 
action  a  ,  and  r:  SX  A  is  a  scalar  reward  function. 
Each  agent  maintains  a  policy  ,  n ,  that  maps  the  current 
state  into  the  desirable  action(s)  to  be  performed  in  that 
state.  The  expected  value  of  a  discounted  sum  of  future 
rewards  of  a  policy  n  at  a  state  x  is  given  by 

Vy  =  E{Y(  ^Y  rrt)  >  where  r*t  is  the  random 

variable  corresponding  to  the  reward  received  by  the 
learning  agent  t  time  steps  after  if  starts  using  the  policy  n 


in  state  s ,  and  y  is  a  discount  rate  (0  <  y  <  1).  Various 
reinforcement  learning  strategies  have  been  proposed 
using  which  agents  can  develop  a  policy  to  maximize 
reward  accumulated  overtime. 

Watkins  firstly  applied  the  Q-  leaming[8]  algorithm  to 
multi-agent  system.  The  Q-leaming  algorithm  is  designed 

to  find  a  policy  n*  that  maximizes  V*  (  s)  for  all  states 

s  e  S.  The  decision  policy  is  represented  by  a  function, 

Q:SXA->m 

which  estimates  long-term  discounted  rewards  for  each 
state-action  pair.  The  Q  values  are  defined  as 

e;(s,a)=vra’*(s) 

where  a;n  denotes  the  event  sequence  of  choosing 
action  a  at  the  current  state,  followed  by  choosing  actions 
based  on  policy  n.  The  action,  a ,  to  perform  in  a  state  s  is 
chosen  such  that  it  is  expected  to  maximize  the  reward, 

V**  (5)  =  max  QT  (s,  a )  for  all  s  e  S. 

1  aeA  ' 

If  an  action  a  in  state  s  produces  a  reinforcement  of  R 
and  a  transition  to  state  s\  then  the  corresponding  Q  value 
is  modified  as  follows: 

Q(s,  a)  <-(l-P)Q(s,a)  +  P(R+  ymaxg  (s', a')) 

a'eA 

The  update  rule  above  is  used  to  guide  the  agents’ 
selections  of  action. 

There  are  also  some  other  works  related  to  learning  in 
multi-agent  system.  Tan  [9],  and  Sen  and  Sekaran[10] 
represent  works  in  multi-agent  reinforcement  learning 
systems. 

Shoham  and  TennenhoItz[l  1]  discuss  co-learning  and 
the  emergence  of  conventions  in  multi-agent  systems  with 
simple  interactions. 

We  address  the  coordination  problem  in  multi-agent 
system.  The  Q-Ieaming  method  is  employed  and  a  special 
Q-function  is  defined.  The  calculation  of  Q-value  depends 
on  not  only  how  to  achieve  the  agent’s  local  goal  quickly 
but  also  how  to  maintain  the  coordination  with  its  partner. 
Agents  select  actions  according  to  the  Q-value,  what  can 
lead  to  good  Q-value  is  selected  in  every  turn. 

4.  Implementation 

Some  previous  works  on  using  learning  techniques  to 
coordinate  multiple  agents  have  mostly  relied  on  using 
prior  knowledge,  or  on  cooperative  domains  with 
unrestricted  information  sharing.  Even  previous  proposals 
for  using  reinforcement  learning  for  coordinating  multiple 
agents[9]  have  relied  on  explicit  information  sharing.  We 
concentrate  on  systems  where  agents  share  no  problem¬ 
solving  knowledge  and  each  agent  is  using  reinforcement 
learning  techniques  to  optimizing  its  own  environment 
reward. 

Agents  employing  Q-leaming  method  decide  its  actions 
according  to  the  Q-value.  To  consider  the  coordination  of 
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agents,  the  Q-value  should  not  calculate  only  based  on  the 
agents’  local  benefit.  Otherwise  the  coordination  of  agents 
will  not  achieved.  So  we  combine  the  cooperative  factor 
and  local  factor  to  calculate  the  Q-value.  The  special  Q- 
function  in  this  paper  consists  of  two  parts.  One  part 
(presented  as  Vperceive)  denotes  the  value  calculated  by  the 
perceptual  information  (i.e.  local  factor),  and  the  other 
denotes  the  value  (presented  as  Vcoordinate)  calculated  by  the 
information  sent  by  the  partners  (i.e.  cooperative  factor). 
The  formula  used  to  calculate  Q-value  is: 

Q-  ^Vperceive  (l“S)Vcoor(jjnate 
where, 

8  e  (0,1). 


5.  Experiment-pursuit  problem 

We  use  the  pursuit  problem  as  our  simulation  example. 
The  pursuit  problem  is  a  well-known  tested  problem  from 
the  Distributed  Artificial  Intelligence  literature.  We  based 
our  implementation  on  the  description  of  the  game  and 
definitions  of  performance  measures  that  are  described  in 
literature  [12]. 

The  example  we  consider  here  is  the  situation  that  four 
predators  pursuit  one  prey  in  a  grid  world.  The  prey 
chooses  randomly  between  its  possible  actions:  staying 
where  it  is,  or  moving  one  of  the  four  directions(  or  less, 
if  some  directions  are  blocked).  Diagonal  moves  are  not 
allowed.  The  predators  are  placed  at  random  positions.  At 
each  time-step,  the  prey  may  move  first.  Then  the 
predators  move.  They  move  one  by  another,  thus  avoiding 
collisions.  Possible  outcomes  are  capture,  stalemate  and 
escape.  A  capture  occurs  when  the  four  positions  around 
the  prey  (left,  right,  above  and  below)  are  occupied  by 
predators. 

Each  predator  can  perceive  the  environment  and  can 
calculate  the  distance  between  itself  and  prey,  and  receive 
the  signal  from  its  neighbors.  We  believe  that  the  capture 
situation  can  not  be  attained  if  the  predator  select  their 
actions  only  based  on  how  to  minimize  the  distance 
between  themselves  and  prey.  It  is  also  important  that  the 
predators  employ  the  cooperative  action  at  each  time-step. 
So  we  use  a  special  Q-function  that  compose  of  each 
predator’s  individual  goal  (i.e.  minimizing  the  distance 
between  prey  and  itself  )  and  the  coordination  factor(i.e. 
maintaining  the  suitable  angle  between  predators)  to 
calculate  the  Q-value  that  guide  each  predator’s  selections 
of  actions.  To  our  4-predators  example,  the  proper  angle 
for  capture  between  the  predators  is  tc/2. 

Let  us  use  Als  A2,  A3  and  A4  denote  four  predators  and 
P  denotes  prey,  and  suppose  that  A,  has  two  neighbours 
A2  and  A3.  Then  the  Q  function  which  A!  use  is  as  the 
following: 

Q  =  sD  +  (  l-s)V 

where , 
ee(0,l); 


D=l/Distance(A1,P);  And  Distance^,  P)  denotes  the 
distance  between  prey  and  Aj; 

V  =  (Distribution^  ,  A2 )  +  Distribution^,  ,A2))/2; 


Distribution (At  9  A.)  =  1  - 


j-Phi(At,Aj) 


The  formula  contains  an  important  parameter  s.  This 
had  to  be  fixed  first.  This  parameter  8  is  initially  set  to 
zero.  With  such  a  parameter,  the  agents  judge  the 
coordination  with  their  neighbors  to  be  useful  when  they 
maximize  their  distribution  around  the  prey.  This  yields 
no  prey-following  behavior,  but  causes  the  agents  to  try  to 
maintain  angles  of  90  degrees.  Then  let  the  parameter  s 
increase  gradually.  Even  at  low  value  of  the  parameter  8 
(e.g.  8  =  0.08),  the  predators  move  towards  the  random 
moving  prey.  Sometimes,  the  agents  would  converge  to 
the  orthogonal  positions  around  the  prey  (i.e.  left,  right, 
above,  and  below  it),  a  capture  situation  occurs,  but  in 
many  cases  they  converged  to  the  diagonal  positions,  a 
stalemate  situation  occurs,  as  is  shown  in  figure  1 . 


P 

A, 

A, 

Figure  1  A  stalemate  situation;  the 
parameter  e  is  too  low 

As  the  angles  between  the  agents  are  also  optimal  in 
this  situation,  it  is  another  stale  configuration.  However, 
in  the  4-connected  pursuit  problem  we  investigate  here, 
only  the  first  one  is  a  capture  situation(i.e.  the  four 
orthogonal  positions  around  the  prey  are  occupied  by 
predators).  It  turns  out  that  raising  the  parameter  8  yields 
just  result.  With  the  parameter  value  of  0.15,  we  tested  the 
performance  of  the  system.  This  turned  out  to  be 
satisfactory  for  our  purpose. 

Figure  2  shows  another  stalemate  situation.  The  reason 
for  this  stalemate  is  opposite  to  the  stalemate  situation 
described  above.  Here,  the  parameter  8  was  too  high(e=l). 
We  went  to  lower  this  parameter  gradually  to  increasing 
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the  influence  of  distribution  factor  until  this  stalemate 
situation  disappeared.  This  first  occurred  at  e  =  0.95. 


Figure  2  Another  stalemate  situation;  the 
parameter  e  is  too  high 

We  do  our  simulations  at  an  IBM  comportable  PC, 
programming  with  Microsoft  Visual  C++  on  the  platform 
of  Windows  95. 

Our  simulations  run  100  trials  in  the  various  grid 
worlds,  including  40X40  grid,  50X50  grid,  60X60  grid. 
In  all  of  these  situations,  we  firstly  suppose  that  the  prey 
moves  to  the  position  that  is  furthest  away  from  the 
nearest  predator  and  rests  in  10%  of  its  moves,  instead  of 
randomly  choosing  between  the  allowed  options.  The  four 
predators  act  one  by  one,  and  they  always  select  the 
action  that  can  maximize  the  Q-value  at  every  time-step. 
The  final  results  show  that  the  success  ratio  is  1 00%. 

6.  Conclusion 

In  this  paper  we  use  a  Q-learning  method  with  special 
Q-function  to  discuss  the  coordination  in  multi-agent 
system.  The  main  idea  of  this  paper  is  to  build  the  Q- 
function  that  combine  the  local  factor  and  cooperative 
factor,  and  try  to  make  the  agents  employing  Q-leaming 
technique  with  such  Q-function  coordinate  their  actions. 
The  experiment  results  show  that  this  idea  is  effective. 
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Abstract 

In  this  paper ,  we  propose  a  method  to  improve  Genetic 
Algorithm  using  Boltzmann  selection  which  Michael  has 
suggested.  But  Michael  uses  temperature  schedule  (the 
initial  temperature,  the  cooling  rate),  which  can  be 
applicable  only  to  the  limited  range  of  problems. 

We  propose  a  new  method  to  find  the  critical 
temperature  and  the  cooling  rate  as  parameters  of  the 
temperature  schedule.  The  critical  temperature  can  be 
derived  from  the  distribution  of  each  individual's  fitness. 
Through  the  application  of  the  island  model  where  each 
island  has  differing  cooling  rate,  it  is  proved  that  it  is 
unnecessary  to  find  the  optimal  cooling  rate.  The 
simulation  on  the  TSP ’s  with  various  city  sizes  proves  the 
proposed  critical  temperature  correct. 

1.  Introduction 

To  solve  the  optimization  problem,  researches  based 
on  natural  phenomena(the  physical  or  biological  process) 
has  been  conducted.  For  example,  the  GA(Genetic 
Algorithm)  and  the  SA  (Simulated  Annealing)  take  their 
idea  from  the  evolution  and  the  annealing  process,  each 
other.  The  GA[8][9][10]  is  a  searching  algorithm  that 
has  its  origin  on  the  evolution  by  “the  natural  selection”. 
The  GA  uses  operators  like  reproduction,  crossover,  and 
mutation,  etc.  During  the  process,  it  forces  individuals 
with  relatively  high  fitness  survive.  The  SA[6][7] 
simulates  the  annealing  process  of  solid  using  statistical 
physics  method.  By  giving  some  probability  to  transit  to 
higher-energy  states,  it  can  escape  from  local  minima. 

The  SA  is  naturally  serial  processing  algorithm  and 
difficult  to  implement  on  parallel  systems  with 
appreciable  linear  speedup.  Many  approaches  have  been 
tried  to  make  SA  applicable  to  parallel  system. 

Hyeon-Joong[2]  introduced  a  population-oriented  SA, 
where  each  individual  in  the  population  can  intelligently 
have  its  own  annealing  schedule  in  an  adaptive  fashion. 

Saul  [5]  decomposed  annealing  algorithms  of  a 
multiprocessor  design  into  subtask,  which  could  be  run 
on  an  experimental  shared-memory  multiprocessor. 


To  solve  job  shop  scheduling,  Krishna[l]  developed 
distributed  SA  algorithms  (the  temperature  modifier,  the 
locking  edges  and  the  modified  locking  edges 
algorithms). 

Recently,  researches  about  hybrid  approaches  that 
combined  the  power  of  the  GA  and  the  SA  have  been 
conducted. 

Hao  Chen[4]  applied  GSA(Genetic  Simulated 
Annealing)  to  SIMD(Single  Instruction  Stream-Multiple 
Data  stream)  machine  to  solve  the  TSP  and  the  Error 
Correction  Code  Design. 

Michael[9]  suggested  the  Boltzmann  selection  for  the 
GA.  He  showed  that  the  translation  invariance  property 
of  the  Boltzmann  selection,  and  introduced  the 
temperature  as  the  convergence  control  parameter,  and 
explained  why  the  Boltzmann  selection  outperforms  the 
general-used  proportional  selection  on  some  problems. 
But  he  determined  the  temperature  schedule  by 
performing  experiments  on  just  one  problem. 

In  this  paper,  a  new  temperature  schedule  for  the  GA 
using  the  Boltzmann  selection  to  improve  GA 


2.  GA  using  Boltzmann  selection 
2.1.  Terms 

A  population  consists  of  many  individuals.  Generally, 
the  GA  uses  a  fixed  population  size. 

Each  individual  represents  one  instant  of  given 
optimization  problem  and  has  a  chromosome  having 
information  of  the  instant. 

A  chromosome  composes  of  a  fixed  size  of  alleles, 
which  is  usually  a  bit  or  an  integer. 

In  the  TSP(Travel  Salesman  Problem)  which  is  the 
test  problem  used  in  this  paper,  the  chromosome  is 
simply  a  list  of  the  positions  about  N  cities.  For  example 
of  10-city,  954213867  means  that  the  visiting  order 
is  9,  5,  4,  so  forth. 

The  fitness  of  an  individual  in  a  GA  is  the  value  of  the 
objective  function  for  its  phenotype.  The  GA  objective  is 
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to  find  maximum  fitness  in  the  solution  space  according 
to  the  given  problem. 

2.2.  GA  using  Boltzmann  selection 

1.  Initialization 

The  initial  population  consists  of  a  number  of 
individuals  with  random  allele  values.  In  the  TSP, 
random  visiting  orders  are  used. 

2.  Calculation  fitness 

Before  selection  procedure,  the  fitness  of  individual 
should  calculated. 

3.  Selection 

The  selection  probability  of  the  Boltzmann  selection  is 
as  follows.  [9] 


£l 


k 


Pj  :  the  selection  probability  of  the  j-th’  individual 
fj  :  the  fitness  of  the  the  j-th’  individual 


In  selection  procedure,  two  individuals  are  selected  for 
crossover,  the  next  procedure. 

4.  Crossover 

In  crossover  procedure,  the  selected  individuals 
exchange  its  information  with  the  probability  of  the 
parameter  ‘crossover  rate’  by  the  crossover  operator  and 
generate  child  solutions. 

In  the  TSP,  the  simple  standard  crossover  operator 
would  be  almost  guaranteed  to  generate  invalid  solution. 
The  PMX[8]  method  ensures  that  all  child  solutions 
generated  are  legal. 

Consider  the  following  example  for  10  city. 

A:984|567|I320 

B:87  1|230|9546 

->  By  the  crossover  operator(PMX) 

A’:984|2  3  0|1657 

B’:801|567|9243 

where,  the  A  and  the  B  are  chromosomes  of  two 
selected  individuals.  The  section  bounded  by  the  two  |  is 
the  crossover  section,  which  is  randomly  chosen.  The 
PMX  operator  performs  position-wise  exchanges,  in  the 
bounded  crossover  range.  In  the  example,  the  ‘5’  in  the  A 
exchanges  the  ‘2’  in  the  A  not  the  B. 


5.  Mutation 

After  the  crossover  operation,  with  the  probability  of 
the  parameter  “mutation  rate”  alleles  of  the  child 
individual  are  mutated(changed).  In  the  TSP,  a 
subsequence  of  the  tour  is  chosen  and  the  order  is 
reversed. 


6.  Loop 

The  1-5  procedures  are  repeated  until  all  individuals 
converged  during  the  given  iteration  number. 


3.  Temperature  schedule 
3.1.  Temperature 


The  Boltzmann  selection  does  not  the  selective 
pressure  in  high  temperature.  In  a  sufficiently  high 


temperature, 


L 

T 


is  approximately  zero.  So  P. 


l/#of 


individuals.  The  selection  probabilities  of  each  individual 
are  same,  and  there  is  little  selective  pressure[9]. 

But  in  a  sufficiently  low  temperature,  the  Boltzmann 
selection  would  select  individual  with  the  maximum 
fitness.  Other  individuals  may  have  relatively  low 
probability.  In  this  case,  the  selective  pressure  is  high. 

Therefore,  the  temperature  acts  like  the  parameter  that 
controls  the  degree  of  the  selective  pressure  (the 
convergence  control  parameter). 

In  high  temperature,  low  selective  pressure  lets  an 
individual  (solution)  search  the  solution  space  freely.  As 
the  temperature  becomes  lower,  the  selective  pressure 
becomes  higher,  so  ‘good  individual’  with  higher  fitness 
has  higher  probability  to  survive(selected)  and  crossover. 


3.2.  Critical  temperature 

The  following  temperature  schedules  are  generally 
used  for  an  annealing  algorithm. 


T,=(Te 


cooling  rate 


)% 


-(2) 


The  temperature  schedule  (2)  has  two  parameters:  the 
initial  temperature  and  the  cooling  rate.  These 
parameters  are  found  usually  by  ‘trial  and  error’  or  many 
experiments. 

As  explained  above,  in  high  temperature  at  initial 
stage,  there  is  no  difference  between  the  selection 
probabilities  of  ‘good  individual’(with  higher  fitness)  and 
‘bad  individual’  (with  lower  fitness).  Therefore 
individuals  in  the  population  are  inclined  to  search  the 
solution  space  freely  by  the  mutation  pressure.  As  the 
temperature  becomes  lower,  the  selection  pressure  is 
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higher.  At  a  certain  temperature,  the  mutation  pressure 
becomes  relatively  much  lower  than  before.  This 
temperature  is  called  the  critical  temperature. 

Using  this  critical  temperature  as  the  initial 
temperature,  wasteful  annealing  time  in  high  temperature 
can  be  removed  and  the  problem  of  the  premature 
convergence  in  low  initial  temperature  can  be  solved. 
But,  Michael  ignored  such  an  important  critical 
temperature.  He  chose  the  temperature  through  which  the 
best  individuals  can  be  selected  only  by  repeated 
experiments  on  just  one  problem  as  initial  temperature 
and  applied  it  to  other  problems.  [9]. 

In  this  paper,  to  derive  the  critical  temperature,  a  new 
selection  constant  a  is  introduced  as  follows. 


a  = 


-(3) 


a  :  the  selection  constant 

Pmax  :  selection  probability  of  the  maximum  fitness 
individial 

Pmm :  selection  probability  of  the  minimum  fitness 
individial 

a  is  a  ratio  of  selection  probabilities  between  the 
maximum  and  the  minimum  fitness  individual.  The  eq. 
(3)  and  the  eq.  (1)  can  be  combined  into  eq.  (4) 

fm  ax 

D  p  T  fmzx.  ~/min 

a=_i=  =e  T  -(4) 

D  /min 

e  T 


If  the  difference  between  the  maximum  and  the 
minimum  fitness  is  known,  using  the  eq.  (4),  the 
selection  constant  a  can  be  found. 

When  the  selection  constant  a  reaches  a  certain 
value,  the  selective  pressure  meets  the  mutation  pressure 
and  the  mean  values  of  the  fitness  begins  to  decrease. 
This  is  the  critical  temperature  point.  Thus,  if  the  a 
value  at  the  critical  temperature  is  known,  the  critical 
temperature  can  be  easily  found  using  the  following  the 
eq.  (5),  which  is  easily  derived  from  the  eq.  (4). 


-(5) 


Tc  :  the  critical  temperature 


/max  “/min  :  difference  between  the  maximum 

and  the  minimum  of  fitness 

ac  :  the  a  value  at  the  critical  temperature 

From  the  simulation  performed  later,  valid  ac  value 
is  e . 

3.3.  Island  model  &  cooling  rate 

The  cooling  rate  is  defined  as  the  rate  of  temperature 
decreasing  per  one  generation.  The  range  of  cooling  rate 
is  between  0.9  and  1.  If  the  cooling  rate  is  near  1,  the 
temperature  would  be  decreasing  slowly,  and  individuals 
search  the  solution  space  sufficiently  for  a  long  time,  and 
the  annealing  time  becomes  sufficiently  long.  Therefore 
optimum  solution  can  be  found  easily.  But  the  time  of  the 
convergence  is  too  long. 

Otherwise,  the  temperature  would  be  decreasing 
quickly,  and  without  searching  the  sufficient  range  of 
searching  space,  it  can  easily  get  into  the  local  minima. 
So  the  bad  solution  can  be  obtained.  But  in  this  case,  the 
solution  can  be  found  quickly. 

The  island  model  using  several  cooling  rates  can  solve 
this  tradeoff.  The  island  model  is  a  good  evolution  model 
using  several  isolated  islands.  And  in  this  model  the 
individuals  migrate  through  the  islands  with  a  certain 
ratio.  [10] 

In  this  paper,  cooling  rates  applied  to  each  island  are 
3  values:  small,  medium,  near  l(for  example,  0.9,  0.99, 
0.999).  In  each  island,  1%  of  its  population  per 
generation  migrate  to  the  island  with  the  next  smaller 
cooling  rate  per  one  generation  [3].  At  island  with  the 
lowest  cooling  rate,  no  migration  from  the  island  to 
other  island  occurs. 

The  island  with  the  lowest  cooling  rate  would 
converge  fastest  but  good  solution  is  not  guaranteed.  The 
individual  from  the  island  with  higher  cooling  rate 
migrates  and  the  individuals  which  migrate  from  the 
island  with  higher  cooling  rate  to  the  island  lower 
cooling  rate  can  be  selected,  crossovered  and  escape  from 
local  minima.  Therefore,  using  the  island  model  with 
differing  cooling  rates  can  show  fast  convergence  and 
good  solution. 

3.4.  The  proposed  algorithm 

The  algorithm  using  the  proposed  temperature 
schedule  as  following. 
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Figure  1.  the  flowchart  of  the  proposed  GA 
algorithm 


1.  initialize 

1.1.  calculate  the  initial  temperature  by  the  eq.  (5) 

1.2.  generate  random  individuals  in  the  population  of 
the  each  island 

1.3.  set  the  crossover  rate,  the  mutation  rate 

2  evolve  each  island  by  one  generation 

2.1  select  two  individuals  using  the  Boltzmann 
selection 

2.2  crossover  the  individuals  with  the  probability  of 
the  crossover  rate 

2.3  repeat  the  2. 1-2.2  processes  until  generate  all 
individuals  in  the  population 

2.4  mutate  all  individuals  in  the  population  with  the 
probability  of  the  mutation  rate. 

3.  multiply  each  cooling  rate  by 

T  T  T 

cooling  rate,  fast  cooling  rate, normal  1  cooling  rate, slow 

9  9 

4.  migrate  randomly  selected  1  %  from  each  island  to 
next  lower  cooling  rate  island 

5.  The  1~4  procedure  are  repeated  until  all  individuals 
converged  or  in  the  given  iteration  number. 


To  show  the  proposed  temperature  correct,  the  TSPs 
with  100  individuals  are  simulated.  In  this  case,  no 
island  model  applied  to  the  simulations. 

The  figure  2  is  graphs  for  10,  50,  100,  200,  1000  city 

with  the  initial  temperature  ~  ^max  ~^nin^ 
and  the  cooling  rate  is  0.9995  and  iteration  number  is 
50,000.  Three  kind  of  fitness  (the  maximum,  the 
minimum,  and  the  mean  value  about  the  population)  is 
plotted.  The  direction  of  the  generation  is  from  right  to 
left. 


Figure  2.  the  fitness  graphs  versus  the 
temperature  for  the  10,  50, 100,  200, 1000  city 
TSPs 


Following  table  is  the  mean  values  of 
while  first  1000  iteration. 


f  ~f 

J  max  J  m 


4.  Simulation 


4.1.  Parameters 


Parameters  used  in  the  simulation  are  as  follows. 
Number  of  individuals  per  island  ;  100 
Mutation  rate  :  0. 1 
Crossover  rate  :  0.6 


T  ■  O  Q 

cooling  rate ,  fast  ' 

^ cooling  rate, normal  ’ 

T  *  O  QQQ 

cooling  rate, slow  * 


4.2.  Critical  temperature 


Table  1.  the  mean  values  of  f  -  f.  with 

J  max  J  min 

_  various  city  sizes _ 


Number 
of  City 

Mean  of 

./max 

Mean  of 

/ 

J  min 

Mean  of 

f  ~  f 

J  max  J  min 

10 

-3.17718 

-6.09914 

2.92196 

50 

-23.01106 

-30.77153 

7.76047 

100 

-47.12069 

-57.45719 

10.3365 

200 

-96.13731 

-110.47499 

14.33768 

1000 

-502.2578 

-534.57958 

32.32178 

Vertical  arrow-lines  are  drawn  to  the  figure  according 
to  the  mean  values  of  the  table  1.  The  critical 
temperature  with  rapid  decrease  and  vertical  lines  are 
near  agreed.  This  fact  shows  that  experimental  value  of 
CL  is  e  .  Thus,  the  eq.  (5)  can  be  simplified  as  follows. 


220 


1  c  J  max  J  min 

For  the  10  city  TSP,  the  critical  temperature  from  the 
eq.  (6)  is  somewhat  incorrect,  lower  than  the  temperature 
shown  by  the  graph.  In  the  case  of  small  number  of  city, 
while  high  temperature,  the  probability  of  finding  good 
solution  is  higher  than  bigger  city  size  case.  So,  fitness 
values  become  larger  in  the  higher  temperature  than  the 
temperature  calculated  by  the  eq.  (6). 

4.3.  Cooling  rate 


Table  2.  compare  the  proposed  temperature 
schedule  with  other  various  schedules 


Cooling 

rate 

Initial  temperature 

100 

10 

1 

General 

temperat 

ure 

schedule 

0.9 

3289,- 

8.5143 

3512,- 

8.57301 

3231,- 

8.54775 

0.99 

3840,- 

8.49933 

4416,- 

8.49171 

3167,- 

8.53157 

0.999 

9904,- 

8.91667 

9686,- 

8.41409 

8768,- 

8.49519 

Proposed 

method 

Prop 

osed 

cooling 

rate 

5112,  -8.36853 

(the  eq.(6)  is  used  for  the  initial 
temperature) 

Above  table  summaries  mean  values  of  the  result(the 
iteration  number  when  occur  best  fitness  and  its  fitness 
value)  of  15  repeated  simulation  on  the  100-city  TSP  and 
10000  iteration  using  general  schedules  and  the  proposed 
schedule. 

As  shown  by  the  table  2,  the  general  schedule  with  the 
initial  temperature  10  and  the  cooling  rate  0.999  shows 
the  best  fitness,  but  the  calculation  time  is  long.  When 
the  initial  temperature  is  10  and  the  cooling  rate  is  0.99, 
the  fitness  of  the  solution  is  ranked  the  second  and  the 
calculation  time  is  half  of  best  case.  According  to  the 
application,  if  fast  convergence  is  important,  the  latter 
schedule  is  proper,  if  optimum  fitness  is  important  and 
the  calculation  time  is  unimportant,  former  schedule  can 
be  used.  The  decision  of  the  temperature  schedule  like 
above  processing  should  be  performed  whenever  the 
Boltzmann  selection  applies  to  the  application 

Even  through  the  proposed  method  have  no  need  to 
take  time  to  find  the  temperature  schedule  above,  it 
shows  the  best  fitness  in  the  table  2.  But  the  calculation 
time  is  not  yet  best. 

4.4.  Examples  of  the  TSP  results 


The  figure  3  shows  results  of  proposed  algorithm  with 
10,  50,  100,  200  city  TSPs  and  10000  iteration. 


Figure  3.  the  example  of  10,  50, 100,  200-city 


5.  Conclusion 

This  paper  presents  the  temperature  schedule  of  the 
GA  using  the  Boltzmann  selection.  The  proposed 
schedule  consists  of  the  critical  temperature  derived  from 
the  distribution  of  the  individuals’  fitness  and  the  island 
model  having  different  cooling  rates. 

For  more  than  100  city  TSP,  the  critical  temperature 
is  nearly  correct.  The  proposed  schedule  shows  better 
fitness  than  other  general  schedules  by  the  simulation. 
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Abstract 

In  this  paper  we  are  interested  with  human 
expression  and  communication  through  Man 
Machine  Interaction  and  with  the  trends  that 
improve  human-friendly  interfaces  for  new 
processes  taking  into  account  kansei 
technologies.  These  technologies  are  a  central 
part  of  the  development  of  new  interfaces. 
Besides  the  design  of  new  tools  and  new 
* intelligent  *  functions,  specially  based  on 
gesture  interaction ,  is  showing  the  importance  of 
co-designing  the  systems  (hardware  part  and 
software  part). 

Human  wants  to  be  able  to  work  his  way. 
Man-Machine  Customization  is  a  Mechatronics  field. 


1.  Introduction 

Making  and  using  the  computers  and  all 
computer-assisted  machines  and  devices  is  more  and 
more  a  mechatronics  field.  The  introduction  of 
multiparameterised  capture  tools  has  brought  new 
possibilities  in  various  domains  such  as  robotics,  CAD 
systems,  art,  computer  assisted  painting. 

Though  many  people  think  that  computer  graphics 
possibilities  are  now  sufficiently  convenient  for  artists 
and  designers,  such  as  car  designers,  we  think  that 
there  are  still  many  devices  missing  allowing  natural 
gesture  and  intelligent  interaction. 


Let  us  give  a  simple  gesture  example  such  as  a 
Japanese  word  written  with  calligraphy,  which  means 
‘silk*.  There  are  rules  to  write  this  kind  of  word. 
(Fig.  1) 


Fig.  1  :  ‘Silk’  in  Japanese. 


With  a  computer  environment  you  cannot  realize  that 
sort  of  symbol  and  drawing  with  rapid  and  simple 
direct  brush  strokes;  it  is  no  more  possible  using  latin 
letters  as  following  (fig.  2).  The  reason  is  that  to 
render  that  kind  of  writing  or  painting,  the  gesture 
and  the  brush  have  to  be  working  together  allowing  a 
vivid  stroke. 


Fig.  2  :  Calligraphy  mode 
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In  various  cultures  calligraphy  expressed  the 
need  of  shared  knowledge  [1],  with  art  and  beauty, 
using  specific  brushes  in  slightly  different  manners. 
There  are  precised  rules  for  all  the  cycle  :  in  the 
preparation  of  the  tools,  in  the  use  and  in  the  drawing. 

We  intend  to  realize  such  a  kind  of  brushes  and 
more  generally  capture  tools  with  ‘intelligent’ 
possibilities,  in  design,  in  painting  and  in  robotics  [2]. 
In  a  general  way,  we  want  to  have  in  hands  tools  easy 
to  use,  gesture  sensitive,  customizable  and  efficient. 

We  shall  see  successively  the  problem  position, 
the  state  of  the  art,  the  notion  of  customization  and 
kansei,  the  proposed  ‘intelligent’  devices,  and  the 
‘SMART  PEN’  presentation. 

2.  The  problem  position 

Nowadays,  new  technologies  are  everywhere,  in 
personal  and  in  working  life.  We  may  consider  with 
Aart  BIJL  [3]  several  aspects  : 

-  what  do  people  have  to  know  in  order  to  use  these 
new  technologies? 

-  and  how  does  that  fit  with  whatever  else  people 
know  and  do? 

We  may  consider  as  well  : 

-  what  does  it  bring  to  people?  Such  as  easiness, 
pleasure  in  using  functions,  possibility  of  creation  for 
architects,  artists,... 

-  which  new  functions  are  being  created,  and/or  have 
to  be  created  that  will  allow  new  customs. 

In  man-machine  interaction,  some  of  these 
functions  are  based  on  the  use  of  new 
multiparameterised  intelligent  capture  tools,  allowing 
to  take  into  account  gesture  interaction. 

We  believe  that  many  new  devices  are  still  to  be 
created  for  gesture  rendering.  Workshops  are  now 
concerned  with  only  gesture  man  machine  interaction 

[4]. 

Some  systems  allow  mixed  interaction  with  voice 
and  gesture  for  example,  as  we  say  ‘give  me  a  pen’ 
and  then  write. 

One  of  the  new  trends  is  to  allow  the  art  people 
to  use  their  senses  and  their  sensibility  in  man- 
machine  interaction. 


3*  The  state  of  the  art 

3.1.  Twenty  years  for  a  mouse 

Capture  tools  are  not  easy  to  be  developed.  So, 
it  took  twenty  years  to  the  mouse  invented  by 
Douglas  Englebert  to  become  of  ordinary  use. 

Some  devices  are  being  created,  that  are  no  more  on 
the  market  place.  It  was  the  case  for  the  Max 
digitiser,  with  seven  degrees  of  freedom  [5].  We  think 
that  various  reasons  such  as  the  cost  of  devices  and 
applications,  the  lack  of  drivers  available,  the  lack  of 
power  of  the  computer  machines  a  few  years  ago, 
were  stopping  the  realization  of  interesting  devices. 
Internet  technologies  will  allow  easier  integration. 

3.2  In  Graphics  domain 

Already  fifteen  years  ago,  126  levels  in  half  a 
centimeter  flexibility  of  a  pen  pressure  were  proposed; 
they  could  not  be  discernible. 

The  effect  of  just  an  added  parameter  to  x-y  selection 
could  not  be  calculated  in  realtime,  with  usual 
environments,  so  it  was  unpleasant  and  inefficient  to 
use  the  pressure  of  such  a  pen. 

Other  parameters  were  first  simulated  with 
potentiometers,  then,  added  inside  the  pen  for  gesture 
rendering  [2],  associated  with  graphic  tablets,  or 
captured  with  CCD  cameras. 

In  man  machine  interaction  (MMI),  and  specially  in 
Graphics  computer  arts,  there  are  two  kinds  of 
processes  :  those  calculated  in  real  time,  and  those 
whose  effect  is  delayed.  The  brush  stroke  should  leave 
a  trace  in  real  time,  even  if  that  trace  is  changing  with 
the  time,  such  as  in  real  water  painting  ;  the  result  is 
different  when  the  paper  is  drying! 

Graphic  palettes  such  as  Quantel,  are  giving  very  rich 
possibilities  using  image  treatment  and  synthesis,  but 
gesture  subtlety  is  not  yet  sufficiently  expressive. 

A  special  domain  is  graphic  creation  for  2D  and  3D 
cartoons. 

3.3  In  Robotics 

Many  multiparameterized  tools  are  designed  for 
robots  activities.  Robots  now  use  multimodal  input 
devices  [6]and  emotional  effects.  Teams  are  working 
on  friendly  robots  to  improve  human-robot 
communication  [7]. 

Next  rEEE/ Conference  on  Intelligent  Robots  and 
Systems  (IROS  99)  interest  is  on  ‘Human  and 
Environment  Friendly  Robots  with  High  Intelligence 
and  Emotional  Quotients4  [8], 
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3.4.  The  general  context 

These  new  interfaces  for  personal  and 
cooperative  work  take  into  account  various 
developments  in  many  domains  : 

-  in  mechatronics  design, 

-  in  techniques  for  creating  and  using  3D,  in  real, 
Virtual  and  Augmented  Reality  worlds,  [9,10] 

-  in  context-based  systems,  4 intelligent’ systems  using 
methods  such  as  fuzzy  logic,  clustering,  soft 
computing,  reinforcement  learming, 

,  in  the  interaction  and  cooperation,  using  cognitive 
sciences,  affective  engineering, 

-  in  techniques  for  handwriting  and  typewriting 
recognition  (OCR,  online  character  recognition),  for 
latin,  japanese  and  Chinese  writing.  They  should 
benefit  of  multiparameterised  capture  tools. 

-  in  techniques  bringing  new  possibilities  of  custo¬ 
mization ,  in  communications  and  cooperation,  using 
telecommunications,  the  Internet  technologies,  such 
as  VRML  extentions. 

These  studies  allow  the  design  and  use  of  new 
‘intelligent’  devices  for  human  interfaces. 

4.  Customization  and  Kansei 

We  have  presented  previously  the  necessity  of 
customization  in  MMI  [11].  It  is  a  topic  among  the 
streams  that  carry  out  the  new  interfaces  and 
processes.  -  For  exemple,  hall  of  the  people  who  buy 
the  new  Macintosh  Inc.  IMAC  get  a  Internet 
connection  the  same  day,  surely  because  of  the 
connection  mode  simplicity  -  not  a  complicate 
specialist  operation. 

So,  if  men  try  to  improve  the  cycle  impression- 
expression,  besides,  analyses  and  modelisation 
transfer  partly  human  impression  and  decision  to 
machine  control.  This  is  one  of  the  objectives  of 
kansei  technologies. 

These  two  streams  will  tend  to  improve  M-M 
interaction. 

4.1  ‘DEVICE’  :  a  human-friendly  interface 

With  Sheffield  university  we  have  proposed 
‘DEVICE’  (Design  Environment  with  Visual 
Interaction  and  Creative  Expression)  [12],  a  human- 
friendly  interface  :  a  user-definable  menu  allows  to 
link  gesture  multiparameterised  capture  and  its 
interpretation  for  driving  processes  in  design  and 
painting  and  robotics.  The  user  is  able  to  enter  in  a 
personal  way  and  simultaneously  information  on  the 
objects  and  on  the  way  to  decode  them. 


In  Robotics,  the  method  allows  to  give 
importance  to  natural  aptitudes,  and  knowhow  and  to 
build  ‘testbeds’  for  trying  new  devices  [6]. 

4.2.  Three  levels  for  customization 

In  designing  and/or  carrying  out  computer 
assisted  processes,  we  have  three  levels  interacting 
where  customization  intervenes  :  construction,  use 
and  control  or  decision. 

4.2.1.  customization  through  the  construction  of 
the  tools,  starting  from  a  multiparameterised  capture 
one  builds  the  models  of  the  command,  the  task  and 
the  context.  Construction  is  defined  by  the  user 
graphically,  without  programming  [2]. 

In  painting,  we  define  paper  or  surfaces,  inks  and 
colors,  and  brushes  or  pens  allowing  gesture  multiple 
capture;  we  build  their  specifications  and  rules  of 
action  and  interaction. 

The  user  sensitivness  is  required.  He  tries  and  if  the 
effect  is  not  satisfying,  he  or  she  can  begin  again  the 
construction,  as  a  painter  is  choosing  his  tools  and 
palettes. 

4.2.2.  customization  in  the  way  of  using  the  tools. 
We  gave  in  [6]  the  qualities  required  for  man  machine 
interaction  tools;  the  tools  should  be  ‘well  in  hand’.  If 
you  change  the  way  you  are  moving  or  activating  a 
brush  or  a  tool,  the  result  should  be  different,  as  with 
real  mechanical  or  simple  tools:  robots,  hammers, 
knives,  .... 

So,  playing  the  violin  can  give  an  idea  of  the 
fact  that  ‘easy  to  use’  or  ‘easy  to  interpret’  are  full  of 
meaning.  Personal  qualities  and  training  are  the  base 
for  a  good  result.  The  violin  interpretation  takes  into 
account  many  physical,  mental,  emotional,  fashionable 
considerations. 

The  ‘hyperviolin’  (fig.3)  was  designed  to  measure 
human  performance  (testbed),  to  build  ever-more- 
powerful  entertainment  interactive  systems  for  the 
general  public.  It  is  a  kansei  analyse  [13]. 


Fig.3  :  A  kansei  ‘ hyperinstrument ’ 
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4.2.3.  customization  for  the  control.  In  context- 
based  systems,  with  many  degrees  of  freedom  (DOF) 
and  sensors,  the  control  is  permanent;  the  part 
depending  of  the  man  decision  is  reduced  by  the 
machine  control.  Context  parameters  should  be 
accessible  to  help  the  user  in  his  decision. 

Driving  cars,  planes  or  online  robots, 
painting,  playing  music,  are  self-controlled  activities. 
They  use  conscious  and  unconscious  levels  of  control; 
with  left  side  and  right  side  of  the  mind  [14]. 

4.3.  ‘DEVICE’  and  Kansei  technology 

In  each  of  these  three  levels  of  the  ‘Device’ 
method,  perception  and  sensitivity  may  guide  first  the 
customers. 

The  capture  with  several  degrees  of  freedom 
emphasizes  the  gesture  and  facilitates  personal 
expression.  It  is  as  well  a  mean  to  analyse  this 
expression,  like  with  the  hyperinstruments. 

Furthermore,  the  ‘DEVICE’  method  can  be 
extended  to  the  case  of  multimodal  input  :  co-verbal 
gesture,  or  fusion  of  gesture  with  other  modalities, 
vision  and  other  senses  [4], 

The  architecture  of  the  systems,  the  level  of  fusion, 
the  temporality,  the  priorities,  the  representation  of 
this  multimodality  have  to  be  studied. 

Through  a  user-definable  menu,  this  environment 
taking  into  account  in  a  general  way  sensibility  for 
personal  creation  of  functions  [15],  activated  by 
gesture  and  other  senses,  allows  the  user  to  enter  in 
the  machine  his  analyse  and  to  precise  preferences  in 
the  construction  of  the  tools  and  in  their  use;  we  are 
in  phase  with  Kansei  engineering  technology. 

"KANSEI  is  a  human  ability  to  derive  images 
from  stimulation,  and  to  express  them.  Also,  KANSEI 
is  affected  by  human  knowledge  and  emotion."  [13]. 

This  kansei  technology  tries  to  analyse  the  way 
people  use  their  senses  to  realise  or  to  do  something  - 
find  the  defects  in  a  production  of  glasses,  pronounce 
vowels,  playing  the  violin,  why  not  painting... 

Human  wants  to  be  able  to  work  his  way,  when 
possible.  In  this  study  it  implies  that  human  likes  to 
feel  he  has  some  influence  on  what  he  is  doing;  he  is 
free  to  do  things  the  way  he  wants;  we  try  to  give  him 
the  means  to  do  so  in  specific  domains. 

Even  with  CAD  systems,  people  take  different  ways 
to  realise  the  same  object. 


5.  The  proposed  ‘intelligent5  devices. 

We  have  designed  several  input  devices.  They 
are  being  developped  with  Lille  and  with  Toulouse 
Universities.  They  are  needed  for  paiting  and  to 
extend  our  cursive  handwriting  simulation  using 
personally  designed  alphabet  and  symbols.  The  letters 
are  joined  with  splines  [15].  The  cursive  rendering  had  to 
be  improved  in  a  user-definable  way  :  the  distance  between 
two  letters,  solutions  to  avoid  monotony... 

In  previous  papers,  we  have  proposed  several  kinds  of 
‘intelligent’  devices  [16,17]. 

6.  The  system  proposed  by  TOULOUSE 

The  system  proposed  by  the  Electronic 
Laboratory  of  the  Ecole  Nationale  Sup&ieure 
d'Electronique  d'Electrotechnique  dlnformatique  et 
d'Hydraulique  de  Toulouse  (ENSEEIHT)  is  based  on 
the  registering  by  triangulation. 

On  a  pen  ‘SMART  PEN’  with  which  one  can  register 
and  reproduce  movement,  we  fixed  three  laser  diodes 
in  a  pyramidal  geometry,  (fig.  4). 


Fig.  4  :  Three  laser  diodes  on  SMART  PEN 

These  three  diodes  create  three  traces  on  a 
translucent  screen.  A  video  camera  records  the  traces 
images.  The  center  of  gravity  is  calculated  by  a  system 
of  image  treatment;  a  localisation  calculation  allows 
to  get  the  position  and  orientation  of ‘SMART  PEN’. 

This  3D  localisation  system  using  laser  marking  is 
the  object  of  a  european  patent  (ref.  n°EP  0  385  528 
Bl).  Localisation  has  been  tho  object  of  many  studies  in 
both  cognitive  and  technical  points  of  view  [10,18,19]. 
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The  registration  of  the  hand  real  movement,  in 
position  and  in  orientation,  allows  to  reproduce  for 
example  the  graphics  created  by  a  pen  that  should  be 
a  the  tip  of  the  ‘SMART  PEN’  ,  in  the  same  way  as 
this  drawing  was  created. 

The  Electronic  Laboratory  of  the  ENSEEIHT 
and  the  Microelectronic  System  Design  team  are 
working  on  the  project. 

-  on  the  one  hand,  on  the  optimization  of  the  image 
process  for  a  real  time  process,  this  for  the  3D 
localisation  data  collection  and  process, 

-  on  the  other  hand,  on  the  real  time  reproduction  of 
the  created  drawing. 

The  modularity  of  the  process  will  allow  the 
optimisation  following  the  technological  evolution, 
because  the  team  is  working  on  the  part  ‘optimisation 
of  the  process’  to  do  the  best  between  ‘the  algorithm 
and  the  circuit’. 

The  quick  evolution  of  the  FPGA  (Fields 
Programmable  Gates  Array),  allows  to  face  quick 
adaptations  cheaper  than  those  imposed  by  the  ASIC 
(Applications  Specific  Integrated  Circuits). 

We  are  in  the  realisation  phase  of  a  prototype 
which  will  allow  us  to  validate  our  proposed  concepts 
in  C.A.D,  on  the  3D  localisation  part  (AUTOCAD), 
and  on  the  electronic  design  part  (circuits  CAD). 

7.  Conclusion 

M-M  interaction  is  a  mechatronics  field.  One  of 
its  new  trends  concerns  customization.  Next 
IEEE/Conference  IROS  99  is  on  ‘Human  and  Friendly 
Robots  with  high  Intelligence  and  emotional  quotients4. 

To  realise  customisation,  we  have  proposed 
previously  ‘DEVICE’,  a  user  definable  menu  based  on  the 
interpretation  of  gesture  multi-parameters  and  the  design  of 
new  tools  and  new  ‘intelligent’  functions. 

We  analyse  our  proposition  face  to  kansei 
technologies.  The  generalisation  of  our  method  to 
various  senses  is  going  in  the  way  of  these  kansei 
technologies  applied  to  computer  interactive 
systems.  They  are  a  central  part  of  the 
development  of  new  multimodal  interfaces. 

We  present  a  device,  SMART  PEN  allowing 
the  reproduction  of  a  created  drawing,  for  the  real¬ 
isation  of  computer  painting  with  live  brush  strokes. 
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Abstract 

In  this  study ,  vibration  signals  of  roller  ball  bearings , 
gears  and  drill  bits  are  studied  in  the  frequency  domain  to 
determine  the  condition  of  them.  Signal  generation 
models ,  characteristics  of  a  specific  fault  type  and 
signature  of  wearing  are  investigated  Power  spectrum  of 
the  signals  produced  by  these  models  are  computed  that 
are  typical  for  a  given  fault  type.  By  use  of  these  signal 
generation  models,  an  algorithm  is  developed  for 
predicting  a  fault  type  and  wearing. 


1.  Introduction 

Vibration  monitoring  is  an  act  of  extracting 
information  by  using  vibration  signals  of  a  machinery, 
which  contain  information  about  the  condition  of  the 
machinery’s  components.  As  these  signals  carry 
information  about  the  forces  that  created  them  and  about 
the  structural  path  through  which  they  propagated,  they 
can  be  used  for  predicting  the  condition  of  the 
components.  The  advantages  of  vibration  and  acoustic 
monitoring  are  as  follows:  it  is  nondestructive  and 
nonintrusive,  could  be  implemented  on-line,  and 
instrumentation  can  be  done  by  off-the-shelf  reliable 
devices.  A  review  about  these  techniques  can  be  found  in 
[1]  and  [2]. 

The  study  presented  here  is  formed  of  three  application 
fields:  diagnosis  of  faults  in  roller  ball  bearings,  diagnosis 
of  faults  in  gears,  and  diagnosis  of  wearing  in  drill  bits. 
The  outline  of  this  paper  is  as  follows:  in  the  first  part,  the 
roller  ball  bearings  are  considered,  and  the  following 
signal  generation  models  are  obtained  following  the  works 

[4] »  [6]>  [7]:  rotational  speeds  and  frequencies  of  inner  and 
outer  races,  the  cage  and  the  balls,  the  frequency 
generation  model  of  a  localized  defect  in  a  race,  effect  of 
loading  distribution,  effect  of  the  ellipticity  of  the  races, 
effect  of  unequal  rotating  elements,  and  effect  of  a 
misaligned  race.  In  the  second  part,  the  gears  are  studied, 
and  the  following  signal  generation  models  are  considered 

[5] ,  [8]:  spectral  distribution  of  tooth-tooth  variation, 


eccentricities  of  the  gears,  effect  of  misaligned  or  bent 
shaft  The  methodologies  followed  for  the  analysis  of 
signals  of  gears  and  roller  ball  bearings  are  identical  as 
follows:  firstly,  signal  generation  models  are  derived  and 
then  simulated  by  use  of  the  Matlab  software;  the 
simulation  results  are  compared  with  the  predictions 
based  on  analytical  models  and  with  some  experimental 
data.  In  the  third  part,  following  the  work  [3]  drill  bit 
wearing  is  considered  in  frequency  domain.  Finally,  a 
diagnosis  program  written  in  die  Matlab  that  analyzes  the 
frequency  spectrum  of  a  given  signal  to  detect  the  possible 
faults  and  wearing  is  presented.  Vibration  monitoring  is 
commonly  used  for  prediction  of  fruits;  however,  it  is 
generally  attempted  to  establish  connections  between  the 
peaks  in  the  power  spectrum  or  cepstrum  and  rotational 
frequencies  of  the  elements.  In  our  approach,  the  signal 
pattern  of  a  fruit  is  searched  in  the  power  spectrum  and 
cepstrum,  which  is  more  suitable  for  automation. 

2.  Vibration  generation  models  for  roller-ball 
bearings 

In  this  section,  following  [4],  [6]  and  [7],  simplified 
models  for  vibration  generation  mechanisms  in  a  roller 
ball  bearing  are  presented.  For  the  roiling  motion, 

generated  signals  have  the  following  frequencies 

fc=j4(i-r)+jfor(i+r)  (i) 

fb=^-(1-yK1+y)(for-fir)  (2) 

-up 

where  y  =  (db  / dp)cosa,  db  and  dp  are  the  ball  and  pitch 
diameters,  a  is  the  contact  angle,  and  fc,  fb,  for  and  fir 
are  respectively  the  rotational  frequencies  of  the  cage, 
rollers,  and  inner  and  outer  rings. 

Vibration  generation  model  for  a  single  localized 

defect  i  can  be  expressed  as  follows 

xi(t)  =  2]gi(t-jTi)u(t-jTi)  (3) 

j 

where  j  is  the  index  of  the  period,  gj(t)  is  the  vibration 
response  upon  the  impacting  with  the  defect  and  Tj  is  the 
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repetition  period.  For  instance,  for  a  defect  on  the  outer 
ring,  for  =0  and  K  rolling  elements,  -  l/(Kfc).  Note 
that  Xj(t)  has  components  with  discrete  frequencies  of 
n  /  Tj,  n  =  1,2,3,... .  On  the  other  hand,  the  intensity  of  the 
impulse  generated  by  the  localized  defect  is  proportional 
to  the  load  on  the  rolling  element  at  the  time  of  impact. 
Therefore,  the  loading  distribution  is  multiplied  by  the 
intensity 

Q  =  Qmax(l-^(l-C0S<t,)]  (5) 

where  S  is  the  load  distribution  factor,  m  is  a  factor  in  the 
range  1-1.5  depending  on  the  type  of  rolling  element,  and 
the  variations  of  the  load  as  a  function  of  the  number  of 
rolling  elements  in  the  loading  zone  are  neglected.  For  a 
rotating  inner  ring,  <(>  =  2nfA  as  long  as  it  stays  within 
the  loading  zone,  that  results  in  a  time-dependent  periodic 
loading  function  Q(t).  For  example,  for  an  impact 
frequency  f  and  rotating  outer  ring  at  frequency  fOT,  we 
obtain  the  signals  having  the  following  frequencies:  nf 
for  impacting  frequencies,  and  nf{  ±  mfor  for  the 
sidebands,  whose  magnitudes  depend  on  the  load. 

In  case  of  unequal  rotating  elements,  the  contact  force 
will  have  the  frequencies  nfr  and  nKfr  whose  magnitude 

proportionality  factors  are  respectively  (Pf -P0)/(P0n2) 

and  (l/nK):.  On  the  other  hand,  a  misaligned  race  will 
produce  signal  components  of  the  frequencies  nKfr  and 
nKfr±2fr  whose  magnitude  proportionality  factors  are 
(1  /  nK):  and  p/(nK±2)\  Finally,  an  elliptic  race  would 
yield  signals  of  the  nKfr  and  nKfr  ±  fr  whose  magnitude 
proportionality  factors  are  (l/nK)2  and  p/(nK±l)\  In 
these  formulas,  P0  is  the  equal  contact  forces  on  an 
element,  -  P0  is  the  unequal  element  force,  p  is  a 
constant  to  represent  defect  severity.  Note  also  that  a  good 
bearing  generates  frequencies  of  nKfr  due  to  balls 
entering  and  leaving  the  loading  zone  whose  magnitude 
proportionality  factor  is  (1/  nK)\ 

3-  Vibration  generation  models  for  gears 

In  this  section,  vibration  generation  models  of  a  gear 
pair  are  presented  following  the  works  [5],  [8].  Note  that 
there  exist  forces  with  the  basic  period  1  /  fb  due  to  tooth 
meshing,  where  1  /  fb  is  the  time  necessary  for  N  rotations, 
defined  by  the  minimum  number  of  rotations  of  the 
smaller  gear  necessary  to  achieve  the  said  recurrence. 
Hence,  N  is  defined  as  the  numerator  of  the  fraction  (>1) 
describing  the  gear  ratio  after  dividing  out  all  common 
factors  from  the  numerator  and  denominator  [2].  In  sum, 
fb  is  the  frequency  of  tooth-tooth  variation  of  the  force, 


and  fm  is  the  mesh  frequency. 

On  the  other  hand,  eccentricities  in  the  driving  and 
driven  gears  1  and  2  having  the  rotating  frequencies  fx 
and  f2  would  generate  signals  of  the  following 
frequencies 


that  have  peaks  around  the  multiples  of  fffl,  where  k„  k2, 
1,  and  12  are  all  integers.  Note  also  the  following 
frequency  components:  mechanical  looseness  2fr , 
damaged  gear  and  imbalance  fr ,  and  misaligned  or  bent 
shaft  f,  and  often  2fr . 

4.  Effects  of  wearing 

It  is  observed  in  experimental  data  that  due  to  wearing 
the  magnitude  of  the  power  spectrum  of  the  vibration 
signals  increases  up  to  full  acoustic  emission  range.  For 
instance,  typical  power  spectrum  plots  that  belong  to  the 
vibration  signals  in  drilling  of  AISI  1045  steel  by  using  a 
9.5  mm  HS  drill  are  given  in  Figures  1  to  3.  Observe  that 
the  average  power  spectrum  level  of  vibration  signals  of 
drill  bits  increases  almost  uniformly  in  all  frequency 
ranges  as  wearing  develops.  Thus,  drill  bit  wearing  can  be 
determined  by  monitoring  the  vibration  or  acoustic  signals 
and  analyzing  them  in  the  frequency  domain. 

5.  Frequency  domain  signatures  of  signal 
models 

The  frequency  domain  signatures  of  signal  generation 
models  presented  in  Sections  2  and  3  will  be  given  in  this 
section  through  examples. 

5.1.  Roller  ball  bearing  models 

Consider  a  roller  ball  bearing  ORS  6413  in  a  gear  box 
having  the  following  specifications:  nr  =  1450  rpm, 

K=t2,  dp  =160  mm,  db  =29  mm,  a  =  30°,  and  the 

outer  race  is  constant  while  the  inner  race  is  rotating  with 
the  shaft.  Accordingly,  (l)  and  (2)  yields  that  fjr  =  0, 

f  =f,  and  y  =  (db  / djcosa  =  0.15735,  fc  =  10.182 
Hz,  fb  =2.141  Hz;  hence,  corresponding  power  spectrum 
would  have  two  peaks  at  f  =  10.182  Hz  if  a  local  defect 
exists  on  the  outer  race  and  f  =  2.141  Hz  if  a  local  defect 

o 

exists  on  a  ball.  On  the  other  hand,  a  good  bearing  would 
have  frequencies  at  nKf  =  145n  Hz  having  the  magnitude 

proportionality  factor  l/(nK)\  n=  1,2,3,...,  whose  time 
vs.  vibration  signal  plot  and  corresponding  power 
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spectrum  are  respectively  shown  in  Figures  4  and  5. 

For  unequal  element  forces,  P.  =  103  kN  and 
P.  =  25.75  kN.  Then,  Corresponding  frequencies  are  nKfr 
and  nfr  whose  magnitude  proportionality  factors  are 
respectively  l/(nK)2  and  (p. -P0)/(P0n2),  n=  1,2,3,.... 
Its  time  vs.  vibration  signal  plot  and  corresponding  power 
spectrum  are  respectively  shown  in  Figures  6  and  7. 

For  a  misaligned  race,  corresponding  frequencies  are 
nKf  and  nKf  ±2fr  whose  magnitude  proportionality 
factors  are  respectively  l/(nK)2  and  p/(nK±2)2, 
n  =  1,2,3,....  Its  time  vs.  vibration  signal  plot  and 
corresponding  power  spectrum  are  respectively  shown  in 
Figures  8  and  9.  Similarly,  the  frequencies  of  an  elliptic 
race  are  nKf  and  nKf  ±ff  whose  magnitude 

proportionality  factors  are  respectively  l/(nK)2  and 
P  / (nK±  l)2,  n  =  1,2,3,....  Its  time  vs.  vibration  signal  plot 
and  corresponding  power  spectrum  are  respectively  shown 
in  Figures  10  and  11. 

5.2.  Gear  models 

Consider  a  gear  pair  in  a  gear  box  having  the  following 
specifications:  nr  =  1450  rpm,  number  of  teeth  of  die 
driving  gear  zx  =  20  and  number  of  teeth  of  the  driven 
gear  z2  =100.  Accordingly,  the  reduction  ratio  i  =  5, 
fb  =  24.166  Hz.  and  fn  =  ifb,  whose  power  spectrum  is 
shown  in  Figure  12.  Notice  tooth-tooth  variations  and 
modulation  effect. 

Eccentricity  in  one  of  the  gears  would  generate  a  signal 
in  time  domain  shown  in  Figure  13  whose  power  spectrum 
is  given  in  Figure  14.  Finally,  the  signal  originating  from  a 
local  defect  in  a  tooth  and  its  power  spectrum  are 
respectively  given  in  Figures  15  and  16. 

6.  The  expert  system  for  fault  diagnosis 

In  this  section,  an  expert  system  for  fruit  diagnosis  that 
utilizes  the  signal  generation  models  presented  in  Sections 
2  and  3  to  predict  faults  is  presented.  The  expert  system 
program  developed  in  Matlab  works  in  the  following 
manner:  first,  given  the  necessary  information  about  the 
system,  the  potential  frequency  components,  i.e.,  the 
location  of  peaks  in  frequency  domain,  are  calculated  for 
each  fault  type.  By  using  this  result,  the  program  fits  a 
spline  between  the  two  peaks  of  the  signal  generation 
model  for  a  fault  candidate  plus  a  priori  white  noise  input. 
Then,  these  peaks  are  searched  and  the  deviation  from  the 
spline  is  calculated  in  the  power  spectrum  or  cepstrum  of 
the  vibration  signals.  If  it  matches  with  the  experimental 
data  and  the  value  of  the  peak  is  higher  than  a  threshold 
value,  the  program  gives  a  warning.  In  order  to  detect 
wearing,  the  average  of  spectrum  of  the  raw  data  is 


calculated  and  compared  with  a  priori  threshold  values  to 
determine  the  condition  of  the  component. 

7.  Conclusions 

In  this  work,  signal  generation  models  in  roller  ball 
bearings  and  gears  are  derived,  and  simulated  by  use  of 
the  Matlab  software.  The  simulation  results  are  compared 
with  the  predictions  based  on  analytical  models  and  with 
some  experimental  data  in  literature.  It  is  found  that  the 
signal  generation  models  are  sufficient  to  be  employed  for 
detecting  associated  signal  sources,  which  explain  the 
underlying  mechanisms  behind  the  signals  and  their 
results  match  with  the  experimental  results.  Then,  an 
algorithm  for  detecting  the  fruits  considered  in  this  study 
is  tested  on  experimental  data.  Unlike  common  vibration 
monitoring  applications  that  usually  attempt  to  establish 
connections  between  the  peaks  in  die  power  spectrum  or 
cepstrum  and  rotational  frequencies  of  the  elements,  the 
signal  pattern  of  a  fault  is  searched  in  the  power  spectrum 
and  cepstrum  in  our  approach,  which  is  more  suitable  for 
automation. 

It  is  observed  that  the  proposed  approach  works  well  if 
the  signals  associated  with  the  faults  are  distinct  in  the 
frequency  domain  and  signal-to-noise  ratio  is  high.  In 
addition,  using  power  cepstrum  rather  than  power 
spectrum  of  the  experimental  data  for  diagnosis  yield 
better  results.  If  two  fruits  have  the  same  frequency 
component,  time  domain  techniques  are  necessary  for 
detecting  the  faults.  On  the  other,  it  is  possible  to  detect 
the  fruits  by  using  the  signal  generation  models:  however, 
a  good  knowledge  about  the  sample  space  and  threshold 
values  of  frequency  components  is  needed  to  decide  about 
a  machinery  whether  it  is  faulty  or  not. 
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Abstract 

An  approach  to  the  construction  of  a 
intelligent  environment  for  a  real-time 
systems  development  based  on  a  object- 
oriented  technology  have  been  considered ' 
Multi-layer  loigc  is  selected  as  the 
knowledge  representation  formalism.  The 
intelligent  tool  for  real-time  systems 
development  on  the  stages  of  analysis  and 
design  have  been  constructed. 

1.  Introduction 

At  the  present  time  the  designing  of  the 
environments  for  real-time  systems  development  is  a 
necessity.  This  environment  must  be  integrated 
system  intended  for  automating  the  entire  life  cycle 
of  the  real-time  systems  from  the  analysis  stage  to 
that  of  maintenance.  The  intelligent  approach  to 
design  of  the  real-time  systems  development 
environment  based  on  a  object-oriented  technology 
is  now  the  most  perspective  one. 

We  will  consider  one  approach  to  the  design  of 
the  an  intelligent  environment  for  the  development  of 
the  real-time  systems. 

2.  The  Approach  to  Design  of  Real-Time 

Systems  Development  Environment 

We  will  consider  the  life  cycle  of  the  real-time 
systems  development  based  on  the  object-oriented 
technology [1,2].  The  object  classes,  the  object,  the 
relations  between  them  and  the  object  classes 
behaviour  are  describe  on  the  analysis  stage  of  the 
object-oriented  systems  development  life  cycle.  The 
structures  of  the  objects  and  them  activities  are  give 
on  the  design  stage  of  the  considering  life  cycle.  The 
program  code  of  the  object-oriented  system  is  create 


on  the  programming  stage  of  the  considering  life 
cycle,  also  the  testing  of  the  program  code  is  make. 
The  inculcatment  and  the  renovatment  of  the  object- 
oriented  system  is  make  on  the  maintenance  stage. 

At  present  time  the  designing  of  the 
environment  for  the  development  of  the  real-time 
systems  based  on  the  object-oriented  technology  is  a 
necessity.  The  creation  of  an  intelligent  environment 
has  a  number  of  advantages  compared  with  the 
traditional  approach  to  the  development  of  a 
workbench,  among  which  we  can  distinguish  the 
following: 

•  the  opportunity  to  represent  knowledge  that 
constitutes  a  generalization  of  the  developer 
experience  accumulated  in  the  process  of  the 
object-oriented  systems  design,  and  access  to  this 
knowledge  in  the  process  of  the  object-oriented 
systems  development; 

•  the  creation  of  new  knowledge  about  from 
knowledge  represented  in  the  workbench,  which 
can  help  to  developer  in  designing  the  system; 

•  the  opportunity  to  verify  consistency  of 

knowledge; 

•  the  opportunity  to  obtain  answers  to  questions 
relating  to  the  states  of  the  developing  object- 
oriented  system  (for  example,  which  the  object 
classes  and  object  have  been  designed,  and  which 
have  not,  what  kinds  of  errors  have  occurred, 
etc.)  and  its  workbench  (what  kinds  of  failures 
have  occurred  in  the  hardware  and  software 
products,  what  kind  of  emergencies,  etc  ),  which 
facilitates  the  the  object-oriented  system  design 
procedure; 

•  the  opportunity  to  predict  the  results  of  design 
operations; 

•  the  opportunity  to  select  repeatedly  used  the 
object  classes  that  satisfy  specified  requirements. 

The  central  moment  of  the  real-time  systems 
development  is  problem  domain  modelling.  At  the 
present  time  is  a  necessity  the  modelling  of  the 
hierarchical  problem  domains,  having  complex 
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control  processes.  For  a  modelling  of  the  modem 
problem  domains  the  choice  the  knowledge 
representation  models,  which  is  available  for  the 
description  of  them,  is  necessary. 

3.  A  Formal  Model  of  a  Problem  Domain 

As  a  fonnal  model  of  a  modern  problem  domain  we 
are  proposing  a  hybrid  model  that  combines  the 
"program  engineering”  paradigm  [1,2, 3, 4]  and  the 
"knowledge  engineering”  one[5,6],  i.e.  an  object- 
oriented  approach  [7,8]  and  procedures,  and  a  model 
for  representing  knowledge  about  a  problem  domain, 
should  be  used  in  order  to  represent  the  formal 
description  of  the  problem  domain  in  the  intelligent 
tool  of  modelling  [9,10,11,12]. 

We  are  proposing  the  hybrid  model,  based  on 
logical  model  and  product  model,  as  a  knowledge 
representation  in  a  modern  tool  of  the  processes 
modelling.  A  logical  model  based  on  multi-layer 
logic  (briefly  MLL)  [13,14]  should  be  used  as  a 
formalism  of  the  modelling  of  a  static  aspect  of  a 
problem  domain.  A  production  model  should  be  used 
as  a  formalism  of  the  simulation  of  a  dynamic 
aspect  of  a  problem  domain. 

MLL  is  an  integration  of  the  logical  approach 
and  an  approach  based  on  semantic  networks.  It  may 
be  considered  as  an  object-oriented  first-order 
predicate  calculus  that  describes  knowledge 
structuring  and  aggregation. 

To  represent  the  IS-A  and  Part-of  hierarchies  in 
MLL  we  use  the  hierarchical  abstraction  [13],  which 
is  given  in  the  form  of  a  graph  whose  vertices 
correspond  to  classes  of  objects  or  their 
representatives,  and  whose  edges  correspond  to  the 
"class-subclass”  relations  or  to  "part-whole” 
relations.  A  hierarchical  abstraction  consists  of 
several  levels.  The  concept  of  a  level  in  a 
hierarchical  abstraction  corresponding  to  the  "class- 
subclass”  relation  is  based  on  the  inheritance  of 
properties.  In  the  hierarchical  abstraction 
corresponding  to  the  "part-whole”  relation,  the 
concept  of  a  level  is  based  on  the  principle  of 
decomposition.  The  class  of  objects  and  the  classes  of 
which  a  domain  consists  are  arranged  on  different 
levels.  The  attributes  of  classes  of  objects  or  their 
representatives  (objects)  and  the  relations  between 
classes  of  objects,  excluding  the  structure  relations, 
may  be  described  in  the  hierarchical  abstraction 
either  as  individual  predicates  or  as  well -formed 
formulas  (WFF). 

To  represent  the  Part-of  hierarchies  of 
representatives  in  MLL  we  use  the  hierarchical 
structure  [13].  As  in  the  hierarchical  abstraction,  the 
concept  of  a  level  is  based  on  the  inheritance  of 
properties.  Classes  of  objects  in  relation  that  inherit 
properties  are  arranged  on  different  levels.  Thus, 
classes  of  objects  and  their  representatives,  linked 


together  by  the  relation  "Element-of  ’  ,  are  arranged 
on  different  levels,  while  classes  of  objects  that  are 
linked  together  by  the  relation  "Component-of’  are 
all  on  the  same  level,  since  they  do  not  exhibit  any 
inheritance  of  properties. 

The  hierarchical  abstraction  allows  to  describe  a 
control  object  on  the  various  control  levels.  The  owns 
methods  of  the  object-oriened  modelling  of  real-time 
systems  and  the  owns  visual  tools  are  used  on  the 
every  control  level:  Us,  the  hierarchical  abstraction 
is  the  integrated  center  between  the  various 
modelling  methods  and  the  various  visual  tools, 
allowed  to  made  the  transitions  from  one  of  the 
control  level  to  other  or  from  ones  of  the  modelling 
methods  to  others.  This  approach  allows  to  increase 
the  visual  power  of  the  modelling  tools  and  . 
therefore,  to  increase  the  control  efficacy. 

By  a  slash  we  will  mean  a  kind  of  delimiter  used 
in  prefix  of  a  formula.  Thus,  the  simple  slash  (Qx/X) 
is  used  to  denote  that  x  is  an  element  of  the  set  X  (x 
e  X),  the  simple  "thick"  slash  (Qx/  X)  denotes 
that  x  is  defined  on  a  set  whose  elements  are  the 
components  of  the  object  X  (XV  x).  while  the 
double  slash  (Qx//X)  denotes  that  x  is  defined  on  a 
set  whose  elements  are  part  of  the  object  X  (X  >  x) 
[13]. 

We  was  developed  the  extension  of  MLL  syntax, 
which  allows  to  increase  efficacy  of  the  deductive 
inference.  The  extented  MLL  syntax  is  represented. 

Alphabet  : 

(1)  constants:  a.b.c . X.Y.Z  (constant  sets).... 

(2)  variables:  x,y,z,... 

(3)  function  symbols:  f,g,h.... 

(4)  predicate  symbols  :  P,Q,R..., 

(5)  quantifier:  V,  3 

(6)  logic  connectives:  — i,  &.  V.  <-> 

(7)  auxilliaiy  symbols:  #.*,/.  /.//.[.}.(.) 

Terms: 

(1) .  Any  constant  and  variable  are  terms. 

(2) .  If  f  is  a  n-ary  function  symbol  and 
tvt2,...,tn  are  terms,  then /(/,,  t2, .. . ,  tn)  is 
a  term. 

(3) .  All  terms  are  obtained  by  applying  the  rules 
(1)  and  (2). 

Rules  of  designing  WFF  for  extented  MLL 
syntax  is  represented. 

FL  If  P  is  a  n-ary  predicate  symbol  and 
t]f  t2, . . . ,  /„are  terms,  then  P(t{,  t2, ...  ,  1  n)  is 
WFF. 

F2.  If  F  and  G  are  WFF. 
-i F, F& (IF  v  G\ F->G  are  WFF. 

F3.  If  F  is  a  WFF  and  x  is  an  object  variable. 

then 

(1).  (Vx  /  y)F  and  (3x  /  y)F  are  WFF.  where  y 
is  a  constant  or  variable. 


235 


(2) .  (Vx  /  y)F  and  (3x  /  y)F  are  WFF,  where  y  is 
a  constant  or  variable. 

(3) .  (Vx  //  y)F  and  (3x  /  /y)F  are  WFF,  where  y 
is  a  constant  or  variable. 

(4) .  (V(x  !Z)H  y)F  and  (3(x/Z)  /  /y)F  are  WFF, 
where  y  is  a  constant  or  variable,  Z  is  a  constant  set. 

F4.  There  are  no  other  rules  of  designing  WFF. 
An  inference  algorithm  for  extented  MLL 
syntax  has  been  developed. 

At  the  present  time  KM  (Knowledge  Model) 
tool  as  teal-time  systems  development  tool  on  the 
stages  of  analysis  and  design  based  on  deductive 
inference  in  MLL  was  designed. 

4.  A  Hierarchical  Abstraction  and  a 
Production  Model 

To  represent  the  control  process  in  hierarchical 
problem  domain  we  use  the  hierarchical  abstraction. 
The  hierarchical  abstraction  allows  the  production 
rules  to  break  on  the  blocks  in  corresponding  to  the 
elements  of  the  abstraction  and  to  use  the  mechanism 
of  the  production  rules  inheritance.  This  mechanism 
is  consider.  Let  us  the  hierarchical  abstraction,  which 
describe  a  u  Airport”  problem  domain,  is  given.  This 
hierarchical  abstraction  consists  of  the  3-th  layers  of 
the  partitioning,  a  namely: 

•  1-th  layer  -  “airport”  class; 

•  2-th  layer  -  “radiolocation  station”  (RLS)  class, 
“airplane”  class; 

•  3-th  layer  -  RLS  class  representatives,  “ariplane” 
class  representatives. 

The  production  rules  is  broken  on  the  blocks  in 
corresponding  to  the  layers  in  the  hierarchical 
abstraction,  a  namely: 

•  the  production  rules,  which  describe  the 
management  process  of  the  whole  airport,  is 
included  in  the  block  of  the  1-th  layer; 

•  the  production  rules,  which  describe  the  RLS 
class  management  process  and  the  “airplane” 
class  management  process,  is  included  in  the 
block  of  the  2-th  layer; 

•  the  3-th  layers  blocks  number  is  defined  by  the 
number  of  the  representatives  of  RLS  class  and 
"airplane”  class. 

The  production  rules,  which  is  given  specify 
management  laws  by  the  airplane  and  RLC,  is 
included  in  the  blocks  of  the  3-th  layer.  The  other 
production  rules,  which  is  given  common 
management  laws  by  the  RLSs  and  airplanes,  can  be 
received  owning  inheritance  mechanism. 

Thus,  the  hierarchical  abstraction  allows  to 
describe  the  production  rules  hierarchy  and  to  use  the 
inheritance  principle  of  the  production  rules,  similar 
the  mechanism  of  inheritance  of  properties  in  ISA- 
hierarchy. 


5.  KM  as  a  Intelligent  Tool  of  Real-time 
Systems  Development  on  Analysis  and 
Design  Stages 

KM  as  a  intelligent  tool  of  real-time  systems 
development  on  analysis  and  design  stages  is 
developed.  KM  tool  allows : 

•  to  construct  the  ISA-hierarchy  of  object 
classes; 

•  to  define  the  object  classes  attributes  and  the 
relations  between  object  classes; 

•  to  construct  the  Part-of  -hierarchy  of  object 
classes; 

•  to  define  the  classes  representatives; 

•  to  construct  the  representatives  Part-of  - 
hierarchy; 

•  to  browse  the  class  objects  and  the 
hierarchies; 

•  to  describe  the  logic  formulas; 

•  to  receive  answer  to  queries. 

The  main  components  of  the  system  KM  are: 

•  subsystem  of  the  problem  domain  models 
controlling; 

•  subsystem  of  modelling; 

•  subsystem  of  browser; 

•  knowledge  base; 

•  subsystem  of  the  deductive  inference. 

The  example  of  the  logic  formula,  represented 
in  the  knowledge  base,  is  considering. 

Example. 

A  program  component  x,  including  in  the 
control  support  system  (CSS)  #P,  provides 
landing  an  airplane  y,  assigned  to  the  airport 

•  A,  if  there  are: 

•  PC  t,  in  which  x  operates: 

•  a  radio-location  station  (RLS)  s,  connected 
with  PC  t; 

•  information  flow  #11,  containing  message 
flow  tl,  accepted  by  RLS  s  and  containing 
message  class  rl,  describing  an  airplane  y 
and  processing  by  a  program  component  x; 

•  information  flow  #12,  produced  by  a 
program  component  x  and  accepted  by  an 
airplane  y,  containing  message  flow  t2. 
transferring  RLS  s,  in  which  there  is  a 
message  class  r2,  containing  information 
about  airplane  landing  y. 

The  structures  of  the  problem  domain  is 
represented  in  Fig.l. 

Formalization  in  MLL  with  extented  syntax: 

(3  (x/program_component)//#P) 

( V (y/ airplane)// #0)  . 

(3(s/RLS)//#A)(3(t/PC)//#A) 

(3  (t  1  /message_flow)//#I  1 ) 

(3  (r  1  /message_class)//t  1 ) 

(3  (t2/message_flow)//#I2) 
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(3  (r2/message_class)//t2) 

Operate(x,t)  &  Connect(s,t)  &  Accept_RLS(s,tl) 
&  Describe^  l,y)  &Process(x,rl)  & 

Transfer_RLS(s,t2)  &  Produce(x,r2)  & 
Accept_airplane(y,r2)  ->  Landing(x,y) 

The  subsystem  of  deductive  inference,  based  on 
inference  in  MLL,  is  served  for: 

•  logical  verification  of  information 
represented  in  the  knowledge  base; 

•  obtaining  attribute  values  and  extensions  of 
relations  what  allows  to  “compress”  the 
extensional  component  of  the  knowledge 
base: 

•  receiving  the  new  knowledge  from 
knowledge,  represented  in  the  knowledge 
base: 

•  obtaining  answers  to  queries. 

The  conceptual  language  for  modeling  of 
hierarchical  problem  domain  was  developed.  The 
translator  of  queries  with  conceptual  language  to 
MLL  has  been  designed. 

The  system  KM  supports  “a  free  connection” 
[15]  of  KB  and  DB  under  Paradox  DBMS.  Such 
implementation  of  the  system  KM  provides  using  all 
opportunities  Paradox  DBMS  such  as  distributed 
processing,  high  performance,  complex  verification, 
supporting  of  integrity  and  safety  of  data,  failure  and 
recover}7  of  DB,  supporting  of  very  large  DB. 

The  system  KM  is  running  under  Windows’95. 
The  development  language  of  the  system  KM  is 
Borland  C++. 

6.  Acknowledgments 

We  would  like  to  thank  our  Teacher  prof.  Vadim 
Nikolaevitch  Vagin  for  big  help  in  this  work. 

7.  Conclusions 

We  have  considered  a  approach  to  constructing  an 
intelligent  tool  of  real-time  systems  development  on 
the  analysis  and  design  stages.  The  hybrid  model, 
based  on  logic  model  and  production  model,  for 
description  of  management  processes  in  a 
hierarchical  problem  domain  is  proposed.  The 
intelligent  tool  based  on  the  hybrid  model  is 
developed. 

This  work  was  supported  by  the  Russian  Fund 
of  Fundamental  Researches  (project  code  99-01- 
00049). 


8.  References 

[1]  Cooling  J.E.  Software  Design  for  Real-time  Systems.  - 
Chapman  and  hall  (University  and  Professional  Division), 
1991. 

[2]  Peter  A.Ng.,  Raymond  T.  Yeh  eds.  Modem  Software 
Engineering.  Foundation  and  Current  Perspectives..  - 
VAN  NOSTRAND  REINHOLD,  New  York,  1 990. 

[3]  Harel  D.  et  al.  “Statemate:  a  Working  Environment  for 
the  Development  of  Complex  Reactive  Systems.  ”  IEEE 
Transactions  on  Software  Engineering ,  Vol.  16.  No.  4, 
1990,pp.403-413.  ’  - 

[4]  Robert  V.  R.,  Walker  J.,  Golin  E.J.  '‘Early  Experience 
with  the  Visual  Programmer's  Workbench.  'IEEE 
Transactions  on  Software  Engineerings  1990,  vol.  16, 
No.  10,pp.  32-56. 

[5]  Eisenstadt  M.,  Domingue  J.,  Rajan  T.,  Motta  E. 
'‘Visual  Knowledge  Engineering. "'IEEE  Transactions  on 
Software  Engineering,  1990,  Vol.  16,  No.  10  (Oct.). 

[6]  Symods  A.J.  '‘Creating  a  Software-Engineering 
Knowledge  Base.”  hi  Software  Development  Computer- 
Aided  Software  engineerig  (CASE),  Chirofskv  E.J  ed.- 
EEEE  Computer  Society  press  technology  series,  1 992. 

[7]  Gibson  E.  “Objects  -  Bom  and  Bred”.-  BYTE .  1990. 
October. 

[8]  Shlaer  S.  and  Mellor  S.J.  "  Object  Lifecycles:  Modeling 
the  World  in  State.?",Prentice-Hall,  Englewood  Cliffs. 
N.J.,1992. 

[9]  Eisenstadt  M.,  Brayshaw  M.  “A  Knowledge 
Engineering  Toolkit.”  BYTE,  WO,  vol.  1 5,  No.  10,12. 

[10]  Yokoyamat  “An  object-oriented  and  constraint-based 
knowledge  representation  system  for  design  object 
modeling.”  I  COT  Research  Center  ,Tokyo,  Japan,  1989. 

[11]  Vagin  V.N.,.  Golovina  E.Yu,  Salapina  N.O. 
“Intelligent  CASE  for  Decision  Making  Systems",  hi 
Proceedings  of  JCKBSE'96.  Second  Joint  Conference  on 
Knowledge-Based  Software  Engeneering.  (Sozopol, 
Bulgaria.  September  2 1-22),  1996,pp.  122-127. 

[12]  Vagin  V.N.,  Viktorova  N.P.,  Golovina  E.Yu.  “Multi¬ 
layer  Logic  as  a  Knowledge  Representation  Model  in  the 
CASE  System.”  Journal  of  Computer  and  Systems  Sciencis 
International- 1 995,Vol.33,  No.3,pp.72-83. 

[13]  Ohsuga  S.,  Yamauchi  H.  “Multi-layer  logic  -  a 
predicate  logic  including  data  structure  as  knowledge 
representation  language.”  New  generation  computing , 
1985,  Vol.3,-N0.4,pp.45 1-485. 

[14]  Ohsuga  S  “Toward  inlelligent  CAD 
systems. ’Y’owpwfcr  Aided  Desing ,  1989,  Vol.21,-NO,5,pp. 
315-337. 

[15]  Yamauchi  H.,  Ohsuga  S.  “Loose  coupling  of  KAIJS 
with  existing  RDBMSs.”  Data  $  Knowledge  Engineering . 
5, 1991, pp.227-251. 


237 


SUN  1000  SUN  10000  Pentium  133 

RLS 1 


-  “Component-of  ”  relation  (  V  ) 

-  “Element-of  ”  relation  (e  ) 


RLS2  RLS3 


-  “Part-of  ”  relation  ( ►  )  Figure  1 .  The  structures  of  the  problem  domain 


238 


The  2nd  International  Conference  on  Recent  Advances 
in  Mechatronics,  Istanbul,  Turkey,  May  24-26, 1999 


©  UNESCO  Chair  on  Mechatronics 
BogaziQi  University,  Istanbul,  Turkey 


Decentralized  Controllers  for  SCARA  Robot  Trajectory  Tracking:  an 

Experimental  Comparison  * 

Antonio  Visioli^  Bruno  Zappa§  Giovanni  Legnani^ 


^Dipartimento  di  Elettronica  per  l’Automazione 
University  of  Brescia  -  Italy 
e-mail :  visioli@bsing.ing. unibs . it 


Abstract 

We  present  an  experimental  comparison  between  dif¬ 
ferent  decentralized  controllers  for  the  trajectory  track¬ 
ing  of  a  two  degrees- of- freedom  SCARA  robot ,  which 
is  one  of  the  most  adopted  in  industrial  settings,  espe¬ 
cially  for  assembly  tasks.  Specifically,  a  nonlinear  three 
term  control  law  and  the  Discontinuous  Integral  Con¬ 
trol,  which  is  a  kind  of  Variable  Structure  Control,  are 
examined  and  compared  with  the  typical  PID  control. 
The  parameters  tuning  aspect,  as  well  as  the  choice 
of  the  control  frequency  in  the  digital  implementation 
and  the  chattering  effect  have  been  analyzed.  Results 
show  how  the  Discontinuous  Integral  Control  is  a  good 
candidate  in  replacing  PID  control  in  industrial  envi¬ 
ronments. 

1  Introduction 

It  is  well-known  that  dexterous  and  skilled  motions 
in  industrial  manipulators  need  reliable  and  robust 
control  in  order  to  assure  an  accurate  path  tracking. 
Although  in  general  a  robot  manipulator  is  an  highly 
nonlinear  and  coupled  system,  nonlinear  model-based 
controllers  are  not  generally  adopted  in  industrial 
environments,  because  the  estimation  of  a  sufficiently 
accurate  model,  that  guarantees  high  performances, 
is  a  very  hard  task  and  adaptive  control  laws  [1]  seem 
difficult  to  be  applied  because  their  performances  are 
limited  by  the  presence  of  noise  and  of  unstructured 
uncertainties. 

Hence,  in  spite  of  the  great  effort  provided  by  re¬ 
searchers  and  of  the  number  of  innovative  solutions, 
concerning  centralized  controllers,  that  have  been  pro¬ 
posed  in  the  last  decade,  independent  joint  controllers 
appear  at  the  moment  to  be  the  most  preferred  choice 
in  industrial  settings,  because  of  their  simplicity  and 
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also  for  their  fault  tolerance  features,  since  in  the  case 
of  a  component  failure  only  one  joint  is  affected  and 
the  robot  can  be  retrieved  in  a  safe  position  by  means 
of  the  others.  In  this  context,  several  decentralized 
control  schemes  have  been  presented  in  the  literature 
(see  e.g.  [2,  3,  4,  5,  6]).  However,  the  most  adopted  is 
still  the  linear  Proportional-Integral-Derivative  (PID) 
controller.  This  seems  to  be  due  to  the  fact  that 
PID  controllers  provide  satisfactory  performances 
for  point-to-point  motions  and  industrial  operators 
have  great  experience  on  their  use  and  on  the  tuning 
of  the  parameters.  Moreover,  many  theoretical  and 
simulation  works  have  been  published  on  decentral¬ 
ized  control  schemes  different  from  PID,  but  not 
many  experimental  results  have  been  discussed  in 
the  literature  [7,  8]  and  there  is  a  lack  of  a  detailed 
framework  regarding  the  use  of  the  new  control  laws. 
The  aim  of  this  paper  is  to  analyze  and  compare,  from 
an  experimental  point  of  view,  different  decentralize 
controllers  in  order  to  provide  information  on  their  use 
and  the  results  they  can  achieve  in  trajectory  tracking 
of  a  two  degrees-of-freedom  SCARA  robot,  which  is 
widely  adopted  in  industry  for  assembly  tasks.  The 
selected  control  laws  appear  to  be  simple,  easy  to 
implement  and  suitable  for  industrial  applications; 
they  do  not  require  the  knowledge  of  the  robot 
dynamics  and  the  stability  proof  is  obtained  under 
mild  conditions.  These  controllers  are:  a)  a  nonlinear 
three-term  controller  proposed  by  Tarokh  [9];  b)  the 
Discontinuous  Integral  Control,  which  is  a  kind  of 
sliding  mode  variable  structure  control,  both  in  the 
continuous  and  discretized  version  [10,  11,  12].  Their 
performances  have  been  compared  with  the  typical 
PID  control,  which  has  been  implemented  in  different 
configurations.  Specifically,  both  the  scheme  with  a 
single  digital  position  loop  and  the  scheme  with  an 
inner  velocity  control  loop  (both  in  the  analog  and 
digital  version)  and  an  outer  digital  position  loop 
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have  been  implemented.  All  the  controllers,  with 
the  exception  of  the  PID  with  analog  velocity  loop, 
require  motor  drives  configured  in  torque-mode. 


2  Decentralized  control 

The  dynamic  model  for  a  n-joint  manipulator  can  be 
written  as  follows: 

M(q)q  +  C(q,  q)  +  g(q)  +  f  (q)  +  u d(t)  =  u(t) 

where  q  is  the  n  x  1  joint  angle  vector,  u(i)  is  the 
n  x  1  input  torque  vector  (control  variable),  M(q)  is 
the  n  x  n  inertia  matrix,  C(q,q)  is  the  n  x  1  vector 
representing  the  centrifugal  and  Coriolis  terms,  f(q) 
is  the  n  x  1  vector  of  the  frictional  terms,  g(q)  is  the 
nxl  vector  of  the  gravity  terms,  and  urf(£)  is  a  n  x  1 
vector  representing  unknown  disturbances  and  unmod¬ 
elled  dynamics.  Using  a  decentralized  or  independent 
joint  control  scheme  means  that  the  torque  to  be  gen¬ 
erated  by  the  motor  is  evaluated  depending  only  on  its 
position  error  e(t)  =  q d(t)  -  q (t)  and  its  time  deriva¬ 
tives,  that  is: 

ui  =  Ui(ei(t),ei(t)),  i  =  l,...,n. 

The  coupling  among  the  links,  as  well  as  variations  of 
the  inertia  and  other  parameters,  are  seen  as  distur¬ 
bances  to  be  attenuated. 

2.1  PID  control 

The  well-known  general  expression  of  a  PID  controller 
is  the  following: 

u(t)  =  Kp(e(t)  +  ±j\(T)dT  +  Td^y  (1) 

where  Kp  is  the  proportional  gain  and  T*  and  Td  are, 
respectively,  the  integral  and  derivative  time  costants. 
Many  tuning  rules  have  been  devised  along  the  last 
fifty  years  [13]  and  the  characteristics  of  the  over¬ 
all  controller  are  generally  well  understood  by  control 
practitioners.  In  typical  industrial  settings,  PID  con¬ 
trol  for  DC  and  brushless  motors  actuating  a  robot 
manipulator  consists  of  two  nested  control  loops,  that 
is  an  inner  loop  for  velocity  control  and  an  outer  loop 
foT  position  regulation.  The  velocity  control  loop  is 
generally  implemented,  either  analogically  or  digitally, 
directly  in  the  drive  of  the  motor,  and  it  is  tuned  before 
the  position  control  loop,  which  is  always  implemented 
by  a  digital  microprocessor.  The  output  of  the  veloc¬ 
ity  control  loop  is  the  motor  torque  reference  signal. 
The  motor  torque  is  also  controlled  by  a  closed-loop 
system  whose  tuning  is  however  not  available  to  the 


user.  In  this  paper  we  implemented  the  velocity  con¬ 
trol  loop  both  in  an  analog  and  a  digital  configuration 
(in  the  latter  case  the  control  frequency  is  ten  times  the 
control  frequency  of  the  position  control  loop).  Fur¬ 
thermore,  a  control  scheme  with  a  single  position  loop 
control  has  also  been  examined. 

2.2  Tarokh’s  nonlinear  three  term  con¬ 
trol 

The  nonlinear  control  law  proposed  by  Tarokh  in  1996 
[9] ,  is  described  by  the  following  three-term  expression 
of  the  torque  to  be  applied  to  the  motor  of  the  i- th 
joint: 


Ui  ( t )  =  kiTji  ( t )  +  PM  ( t )  f*  77?  (r)dr+ 

+qisgn{rti(t))  f*  |^(r)|dr  i  =  l,...,n 

U  (2) 

where  u{  is  the  torque  applied  to  the  motor  of  the  i- 
joint,  ki,  pi  and  qt  are  constant  scalar  gains,  and  77,  (t) 
is  defined  as 


r}i(t)  —  A ,ej(f)  +  e.i{t)  i  =  1, . . . ,  n 


where  A;  is  a  positive  scalar  constant.  The  asymp¬ 
totic  stability  of  tracking  error  is  demonstrated  by  us¬ 
ing  the  Lyapunov  stability  theorem.  Theoretical  and 
simulation  results  show  that  this  control  system  is  ro¬ 
bust  to  torque  disturbances  and  unmodelled  dynamics 
and  hence  it  appears  particularly  suitable  to  be  ap¬ 
plied  in  practical  cases.  It  has  to  be  noticed  that  the 
last  control  torque  component  in  (2)  contains  a  signum 
function  that  can  result  in  an  undesirable  chattering 
effect.  This  can  be  avoided,  preserving  the  asymptotic 
stability  [14],  by  replacing  the  signum  function  with  a 
saturation  function  of  the  form: 


rn/e, 

sgn{vi), 


M  <  £ 

otherwise 


where  e  is  the  boundary  layer  thickness. 

2.3  Discontinuous  integral  control 

Variable-structure  sliding  mode  control  of  nonlinear 
multivariable  systems  has  been  widely  investigated  in 
the  last  decades  because  of  its  inherent  robustness  to 
external  disturbances  [15].  In  this  context,  the  re¬ 
cently  developed  discontinuous  integral  control  (DIC) 
[10,  11,  12]  appears  to  be  particularly  suitable  to  be 
applied  to  mechanical  systems.  An  integrator  with  a 
nonlinear  switching  input  is  added  in  the  control  law  to 
track  the  unknown  disturbances.  In  this  way,  the  chat¬ 
tering  introduced  by  the  characteristic  switching  term 
of  the  variable  structure  controller  is  reduced  since  it 
has  only  to  compensate  the  difference  between  the  real 
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disturbance  and  its  estimation.  The  following  typical 
sliding  surface  is  chosen: 

a(t)  =  ce(t)  +  e(t) 

where  c  >  0.  Then,  for  the  continuous-time  case,  the 
expression  of  the  torque  command  can  be  written  as 
(i  =  1, . . .  ,n): 

m(t)  =  ka{t)  +  kiSgn{ai{t))  +’4>i{t) 

=  hi  f*  sgn(o-j(r))dr 

where  Z*,  ki  and  hi  are  constant  parameters  and  is 
an  estimation  of  an  external  disturbance  i/Ji(t)  which  is 
assumed  to  be  continuous  and  satisfying  the  following 
conditions: 


\A(t)\  <  A0;  \Tpi(t)\  <  Ai- 
If  Zj,  ki  and  hi  are  chosen  in  such  a  way  that 
li  >0;  ki  >  0;  h{  >  Ai;  hikh  >  A?(l  -  In  2); 


then  the  controlled  system  is  demonstrated  to  be  glob¬ 
ally  stable  and  after  a  finite  time  a  sliding  mode  arises 
on  the  surface  ai ,  so  that  the  error  asymptotically  con¬ 
verges  to  zero.  In  a  digital  implementation,  the  dis¬ 
cretisation  of  expression  (3)  cen  be  slightly  modified 
in  order  to  prevent  oscillations  when  the  estimation  of 
the  disturbance  is  constant  [12],  It  follows  that  the 
torque  command  for  the  discretised  discontinuous  in¬ 
tegral  control  (DDIC)  can  be  written  as: 


Ui(n)  =  ki$gn(ai(n))  -f^;(n) 

^i(n)  =  yji(n  -  1)  +  kei  +  ^sgn(<ji(n  -  l))j 

(4) 

where  Aa*(n)  =  <jj(n)  —  <jj(n  —  1),  Ts  is  the  sampling 
time,  ke  is  a  new  design  parameter  and 


+  fc»sgn(<7i(n  -  1)) 

S 


=  7 i(n) 


can  be  viewed  as  a  disturbance  estimation  error. 
Denote  by  A^,  the  discrete  derivative  of  the  distur¬ 
bance.  Choosing  the  amplitude  of  the  switching  term 
ki  such  that  ki  >  A ^Ts  a  discrete  sliding  mode  occurs 
in  the  system. 


3  Experimental  setup 

The  previously  described  decentralized  controllers 
have  been  adopted  for  the  trajectory  tracking  of  a 
SCARA  (Selective  Compliance  Assembly  Robot  Arm) 
robot,  shown  in  Figure  1,  which  is  widely  used  in  in¬ 
dustrial  environments  for  assembly  and  pick  and  place 
tasks.  The  manipulator,  denominated  ICOMATIC 


Figure  1:  The  SCARA  ICOMATIC  03  situated  in  the 
Applied  Mechanics  and  Robotics  Laboratory  of  the 
University  of  Brescia. 


SCARA  03,  was  built  by  ICOMATIC  of  Gussago 
(Brescia,  Italy)  and  it  is  installed  in  the  Applied 
Mechanics  and  Robotics  Laboratory  of  the  University 
of  Brescia.  It  has  three  degrees-of-freedom,  but  only 
the  first  two  has  been  utilized  in  the  experiments  since 
the  third,  which  is  devoted  to  move  the  end-effector 
up  and  down,  is  completely  decoupled  from  the  others 
from  a  dynamic  point  of  view. 

The  two  rotoidal  joints  are  actuated  by  two  direct 
current  motors  with  Harmonic  Drive  reduction  gears, 
which  assure  a  good  dynamic  decoupling  between  the 
two  links  (whose  length  is  0.33m).  For  a  detailed  anal¬ 
ysis  of  the  robot  kinematics  and  dynamics,  see  [16]. 
Position  measurements  are  obtained  by  incremental 
encoders  (one  encoder  step  is  equal  to  7r  •  10~5rad), 
and  velocity  measurements  by  tachometers  or  simply 
deriving  the  encoder  signal.  The  system  is  controlled 
by  means  of  a  build  on  purpose  controller  which 
consists  of  a  PC  Pentium  133  with  I/O  boards.  The 
control  software  runs  under  QNX  operating  system, 
which  assures  real-time  functionalities  and  a  control 
loop  frequency  of  1kHz  (in  the  digital  PID  velocity 
control  loop  the  control  frequency  is  10kHz). 

For  the  evaluation  of  the  tracking  error  performances, 
two  kinds  of  trajectory,  which  cover  a  large  portion 
of  the  robot  workspace,  has  been  utilized:  a  circular 
trajectory,  centered  in  (0,0.5)  and  with  diameter 
equal  to  0.2m,  to  in  accomplised  in  3s,  and  a  linear 
trajectory,  from  (-0.3,0.5426)  to  (0.6,0.1574),  to  be 
accomplished  in  6s.  The  trajectories  are  represented, 
in  the  Cartesian  space,  in  Figures  2  and  3.  It  is  worth 
stressing  that  they  require  significantly  different 
torque  levels  to  the  motors. 
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Figure  2:  Cartesian  space  representation  of  the  circular 
trajectory,  to  be  accomplished  in  3s. 


Figure  3:  Cartesian  space  representation  of  the  linear 
trajectory,  to  be  accomplished  in  6s. 

4  Results 

4.1  Tuning  of  the  parameters 

Between  the  decentralized  control  laws  presented  in 
Section  2,  a  well-settled  tuning  procedure  exists  only 
for  the  PID  control  law.  On  the  contrary,  for  the 
Tarokh’s  controller  and  for  the  Discontinuous  Integral 
Control  (both  in  its  continuous  and  discretized  ver¬ 
sion),  the  physical  meaning  of  the  different  terms  of 
the  controller  expression  are  not  very  clear  and  prac¬ 
tical  tuning  procedures  are  not  provided  in  the  litera¬ 
ture. 

For  this  reason,  a  set  of  experiments  has  been  per¬ 
formed  for  these  three  controllers,  in  order  to  deter¬ 
mine  a  method  for  the  tuning  of  their  parameters. 

For  obviuos  safety  reason,  it  is  necessary  to  guarantee 
that  actuators  limits  are  not  exceeded  and  that  no  sig¬ 
nificant  overshoot  occurs  in  a  step  response.  However, 
under  this  latter  point  of  view,  it  has  been  found  that 
only  the  PID-based  controllers  are  critical.  In  fact,  no 
overshoot  is  present  in  a  step  response  adopting  the 
other  analyzed  controllers,  disregarding  the  values  of 
their  parameters. 

For  the  PID,  the  inner  velocity  loop,  when  it  is  present 
in  the  control  scheme,  has  been  tuned  before  the  outer 
position  loop.  The  values  of  the  proportional,  integral 
and  derivative  constants  have  been  found  by  starting 


from  the  well-known  Ziegler-Nichols  rule  and  then  re¬ 
fining  them  by  hand.  Specifically,  the  derivative  ac¬ 
tion  in  the  velocity  loop  (when  it  is  implemented  in 
software)  and  the  integral  action  in  the  position  loop 
have  been  set  to  zero  and  the  proportional  and  the 
derivative  actions  in  the  position  loop  have  been  sig¬ 
nificantly  increased.  It  has  to  be  remembered  that  the 
velocity  loop  is  based  on  the  velocity  measure  made  by 
the  tachometer,  which  provides  a  noisy  signal,  and  the 
noise  can  be  undesirably  amplified  by  the  derivative 
action  so  that  it  is  opportune  to  switch  it  off.  Regard¬ 
ing  the  single  loop  PID  control,  all  the  three  actions 
have  to  be  employed  to  assure  the  best  tracking  per¬ 
formance. 

For  the  Tarokh’s  controller,  the  PD  part  krj  has  been 
tuned  at  the  beginning  analogously  to  the  position  loop 
in  the  PID  control  scheme.  Then,  the  two  other  critical 
parameters  p  and  s  have  been  considered.  The  most 
significant  for  the  adopted  systems  is  p  that  have  been 
tuned  increasing  it  until  the  performances  worsen.  The 
same  method  has  been  applied  to  s.  Finally,  parame¬ 
ter  e  has  to  be  chosen  taking  into  account  the  trade-off 
between  the  accurate  tracking  and  the  chattering  ef¬ 
fect,  althogh  it  is  not  very  crucial.  A  chattering  level 
close  to  the  one  achieved  by  the  PID  control  has  been 
allowed. 

In  the  Discontinuous  Integral  Control,  again  the  PD 
term  can  be  tuned  as  usual.  Then,  it  is  necessary  to 
increase  h  as  much  as  possible,  until  the  performances 
stop  improving.  A  high  value  of  h  permits  the  re¬ 
duction  of  the  amplitude  k  of  the  switching  term  and 
therefore  to  limit  chattering.  Hence,  in  the  tuning  of 
k  the  trade-off  between  the  chattering  and  the  accu¬ 
rate  tracking  has  to  be  considered,  but  the  presence  of 
the  integral  term  cause  an  excessive  increasing  of  k  to 
be  unuseful  with  an  evident  benefit  on  the  mechanical 
system. 

In  the  discretized  version  of  the  DIC,  the  parameter  is 
fce,  unlike  h  in  the  continuous  version,  has  to  be  kept 
sufficiently  small  in  order  to  avoid  chattering  again. 
From  a  practical  point  of  view  ke  can  be  increased, 
starting  from  very  small  value,  until  the  tracking  error 
does  not  decrease. 

4.2  Trajectory  tracking  errors 

After  the  controllers  have  been  tuned,  they  have  been 
used  for  the  tracking  of  the  trajectories  presented  in 
Section  3.  The  performances  have  been  evaluated  with 
respect  to  different  indexes.  For  each  motion,  the  max¬ 
imum  and  mean  absolute  errors  (in  encoder  steps)  have 
been  calculated  for  each  motor  separately.  Then,  ac¬ 
cording  to  the  ISO  9283  standard  [17],  the  tracking 
error  of  the  end-effector  (in  mm)  has  been  calculated 
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Cont. 

Circular 

Linear 

max 

mean 

e.i. 

max 

mean 

e.i. 

PIDS 

0.105 

0.017 

0.065 

0.081 

0.017 

0.059 

PIDD 

0.054 

0.010 

0.038 

0.049 

0.009 

0.030 

PIDA 

0.703 

0.293 

0.977 

0.374 

0.184 

0.554 

TAR. 

0.145 

0.037 

0.117 

0.102 

0.027 

0.087 

DIC 

0.105 

0.021 

0.089 

0.053 

0.012 

0.039 

DDIC 

0.081 

0.021 

0.066 

0.073 

0.020 

0.065 

Controller 

Joint  1 

Joint  2 

max 

mean 

max 

mean 

PIDS 

6 

0.954 

9 

1.400 

PIDD 

4 

0.774 

12 

1.233 

PIDA 

78 

16.18 

131 

22.53 

TAROKH 

13 

3.706 

20 

6.234 

DIC 

4 

1.106 

20 

3.275 

DDIC 

7 

1.828 

18 

4.340 

Table  1:  Joint  tracking  errors  (in  encoder  steps)  for  Table  3:  End-effector  tracking  errors  (in  mm)  for  the 
the  circular  motion  of  3s.  two  trajectories. 


Controller 

Joint  1 

Joint  2 

max 

mean 

max 

mean 

PIDS 

6 

1.057. 

9 

1.471 

PIDD 

11 

0.961 

5 

0.894 

PIDA 

48 

15.75 

44 

14.02 

TAROKH 

12 

4.680 

16 

5.714 

DIC 

7 

1.507 

7 

1.822 

DDIC 

9 

2.284 

11 

3.222 

Table  2:  Joint  tracking  errors  (in  encoder  steps)  for 
the  linear  motion  of  6s. 

as  the  minimum  difference  (with  sign)  between  the  real 
end-effector  position  and  the  reference  trajectory.  Fur¬ 
thermore,  an  additional  parameter  has  also  been  uti¬ 
lized;  it  is  an  error  index  defined  as: 

e.i.  =  e  4-  3 ae 

where  e  is  the  average  value  of  the  absolute  error  and 
ae  is  the  standard  deviation  of  the  absolute  value  of 
e.  The  definition  of  e.i.  is  inspired  by  the  concept  of 
accuracy  and  repeatibility  (ISO  9283  standard  [17]).  It 
has  to  be  stressed  that  the  end-effector  tracking  errors 
are  determined  by  applying  the  direct  kinematic  to  the 
joint  positions,  so  that  no  direct  measurements  of  the 
gripper  position  has  been  performed.  Hence,  we  refer 
to  the  tracking  errors  due  to  the  control  action  only, 
disregarding  elasticities  in  the  joints  and  calibration 
errors.  Results  are  exposed  in  Tables  1,  2  and  3,  where 
PIDS  denotes  the  single  PID  position  loop  control,  and 
PIDD  and  PIDA  denote  the  double  loop  control  with 
digital  and  analog  velocity  control  respectively  . 

4.3  Discussion 

From  the  above  results  it  appears  how  the  tracking  er¬ 
rors  obtained  with  the  traditional  PID  control  with  an 
analog  velocity  control  loop  are  significantly  improved 
by  using  a  full  digital  controller.  This  can  be  due  to 
the  difficult  in  tuning  the  analog  loop  in  the  drive. 
In  this  context,  tracking  performances  achieved  by  us¬ 
ing  a  PID  control  and  DIC  are  very  similar,  whilst 


Tarokh’s  controller  seems  not  as  good  as  the  others. 
However,  it  has  to  be  said  that  from  other  experiments 
on  a  different  system  with  a  more  significant  coupling 
between  the  links  ([18]),  it  comes  out  that  the  Discon¬ 
tinuous  Integral  Control  (both  in  the  continuous  and 
discretized  version)  is  the  most  robust  to  external  dis¬ 
turbances. 

In  order  to  better  evaluate  the  performances  of  the 
controllers,  it  is  worth  analyzing  the  torque  signal  in 
the  different  cases.  In  fact,  it  is  well  known  that  robot 
life-span  generally  decreases  when  the  system  has  to 
cope  with  vibrations,  and  torque  vibrations  can  excite 
mechanical  vibrations  resulting  in  performance  degra¬ 
dation  and  motor  overheating.  To  give  a  measure  of 
the  chattering  amplitude,  the  torque  signals  have  been 
digitally  filtered  off-line  using  a  high  pass  filter  (with 
cut-off  frequency  of  150Hz  and  attenuation  of  40dB  at 
130Hz)  and  then  the  mean  of  the  absolute  values  has 
been  evaluated,  obtaining  a  “chattering  index”  e.i.  re¬ 
ported  in  Table  4.  It  can  be  seen  how  the  PID  with 
the  analog  velocity  loop  has  a  low  chattering  and  also 
how  the  DIC  is  better  than  the  DDIC  and  in  general 
provides  no  significant  oscillations,  compared  with  the 
other  controllers. 

A  further  study  has  been  done  in  order  to  verify  the 
dependence  of  the  tracking  errors  with  respect  to  the 
sampling  time.  It  has  been  found  that  for  all  the 
controllers  the  tracking  error  increases  when  the  sam¬ 
pling  interval  increases,  but  performances  are  still  ac¬ 
ceptable  until  5ms.  For  larger  sampling  intervals,  the 
tracking  error  can  dramatically  worsen,  for  all  the  ex¬ 
amined  controllers.  As  an  example,  the  maximum  and 
mean  absolute  errors  of  the  end-effector  for  the  Dis¬ 
continuous  Integral  Control  applied  to  the  tracking  of 
circular  motion,  as  the  sampling  interval  increases  from 
1  to  10ms,  are  plotted  in  Figure  4. 

Finally,  it  has  to  be  stressed  that  the  presence  of  the 
tachometer  is  not  strictly  necessary  to  assure  good 
tracking  performances.  In  fact,  for  all  the  implemented 
controllers,  experiments  have  been  done  substituting 
the  velocity  measure  with  a  simple  differentiation  of 
the  encoder  measure.  It  has  been  checked  that  for  all 


Controller 

c.i. 

pros 

0.146 

PIDD 

0.121 

PIDA 

0.041 

TAROKH 

0.514 

DIC 

0.088 

DDIC 

0.478 

Table  4:  Chattering  index  for  the  different  experi¬ 
ments. 


Figure  4:  Maximum  and  mean  absolute  tracking  er¬ 
rors  of  the  end-effector  with  different  control  sampling 
intervals  (DIC  for  the  circular  trajectory). 


the  controllers  there  is  not  an  increasing  of  the  track¬ 
ing  error,  so  that  the  cost  of  the  overall  control  system 
can  be  reduced  eliminating  these  sensors. 

5  Conclusions 

In  this  paper,  different  advanced  decentralized  con¬ 
trollers  for  the  trajectory  tracking  of  an  industrial  two 
degrees-of-frredom  SCARA  robot  manipulators  have 
been  compared  from  an  experimental  point  of  view. 
Despite  their  simple  structure,  which  allows  a  low-cost 
implementation,  and  the  easiness  in  the  tuning  of  the 
parameters,  results  show  how  they  can  achieve  in  gen¬ 
eral  very  good  performances  (less  than  0.1mm  of  max¬ 
imum  tracking  error  for  the  end-effector).  It  also  ap¬ 
pears  how  for  the  considered  SCARA  robot,  in  which 
the  dynamic  coupling  between  the  two  links  is  signif¬ 
icantly  reduced  by  the  use  of  Harmanic  Drive  reduc¬ 
tion  gears,  it  is  not  easy  to  improve  the  performances 
obtained  using  a  PID  controller.  However,  the  Discon¬ 
tinuous  Integral  Control  seems  to  be  particularly  suit¬ 
able  to  be  adopted  in  industrial  environments,  since, 
in  addition  to  the  very  good  tracking  performances,  it 
is  very  good  in  rejecting  external  disturbances  and  it 
keeps  the  chattering  at  a  low  level.  Furthermore,  the 
computational  effort  it  requires  is  low  with  respect  to 
the  typical  PID-based  control,  since  only  one  control 
loop  has  to  be  implemented. 
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Abstract 

A  variety  of  robot  controllers  appear  in  the  literature. 
However,  computed  torque  (or  inverse  dynamics)  with  a 
secondary  controller  (PID)  is  still  the  most  used  technique 
by  the  industry.  In  this  paper,  a  novel  sliding  mode 
approach  to  computed  torque  based  robot  control  is 
proposed.  Firstly,  the  parallelism  between  the  equivalent 
part  of  the  sliding  mode  control  and  the  inverse  dynamics 
is  investigated.  In  sliding  mode,  the  equivalent  control  has 
an  effect  identical  to  the  inverse  dynamics.  In  this  paper, 
the  corrective  term  of  the  SMC  is  used  as  the  secondary 
controller  instead  of  PID  controller  and  equivalent  part  is 
replaced  with  the  computed  torque.  The  novel  controller 
can  be  designed  for  nonlinear  systems  without 
linearization  around  an  operating  point.  Experimental 
studies  carried  out  on  a  direct  drive  robot  indicate  that  a 
the  controller  results  in  a  robust  response  and  is  suitable 
for  industrial  applications . 


1.  Introduction 


The  robot  control  problem  for  the  gross  (free)  motion  is  a 
servo  problem  in  which  the  center  of  mass  of  the  robot  end 
effector  is  made  to  track  the  reference  values  representing 
the  desired  trajectory.  The  inputs  to  the  robot  can  be 
chosen  as  the  generalized  torques  produced  by  the 
actuators,  and  the  outputs  to  be  controlled  are  the 
positions,  velocities  and  accelerations  [1].  To  make  the 
robot  track  the  desired  nominal  trajectory,  the  generalized 
torque  applied  to  the  system  should  have  the  appropriate 
(nominal)  values  that  result  in  the  desired  motion  under 
ideal  conditions.  The  controller  generating  these  values 


can  be  referred  to  as  the  primary  controller.  Thus,  it 
compensates  for  the  nonlinear  effects,  and  attempts  to 
cancel  the  nonlinear  terms  in  the  model.  Since  the 
mathematical  model  used  is  usually  not  exact,  and  since 
the  system  is  subject  to  disturbances,  undesirable 
deviations  (errors)  of  the  actual  motion  from  the  nominal 
trajectory  can  be  corrected  by  means  of  an  additional 
controller  called  a  secondary  controller  [1]. 

Variable  structure  systems  with  a  sliding  mode  were 
first  proposed  in  early  1950’s,  but  it  was  not  until  the 
seventies  that  sliding  mode  control  (SMC)  became  more 
popular.  It  nowadays  enjoys  a  wide  variety  of  application 
areas.  The  main  reason  of  this  popularity  is  the  attractive 
superior  properties  of  SMC,  such  as  good  control 
performance  even  in  the  case  of  nonlinear  systems, 
applicability  to  MIMO  systems,  design  criteria  for  discrete 
time  systems,  etc.  The  best  property  of  the  SMC  is  its 
robustness.  Loosely  speaking,  a  system  with  a  SMC  is 
insensitive  to  parameter  changes  or  external  disturbances 
[2]. 

SMC  design  that  is  based  on  the  selection  of  a 
Lyapunov  function  yields  two  parts:  equivalent  control 
and  corrective  control  [3].  In  this  paper,  the  relation 
between  the  inverse  dynamics  and  the  equivalent  control  is 
presented.  In  sliding  mode,  equivalent  control  has  an  effect 
identical  to  the  inverse  dynamics  (computed  torque).  The 
motivation  of  showing  the  equivalence  is  to  use  the 
corrective  term  of  the  SMC  as  secondary  controller.  In  this 
paper,  the  primary  controller  is  again  the  inverse  dynamics 
while  the  secondary  controller  is  the  corrective  term  of 
SMC,  instead  of  a  PID  controller. 

The  paper  concludes  with  the  presentation  of  some 
experimental  results  obtained  for  the  control  of  a  direct 
drive  scara  type  robot. 
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2.  Sliding  Mode  Control 


In  the  application  of  Variable  Structure  System  theory 
to  control  nonlinear  processes  it  is  argued  that  one  only 
needs  to  drive  the  error  to  a  “switching”  or  “sliding” 
surface,  after  which  the  system  is  in  “sliding  mode”  and 
will  not  be  affected  by  any  modelling  uncertainties  and/or 
disturbances  [2,3].  Intuitively,  VSS  with  a  sliding  mode  is 
based  on  the  argument  the  control  of  lst-order  systems 
(i.e.,  systems  described  by  lst-order  differential  equations) 
is  much  easier,  even  when  they  are  nonlinear  or  uncertain, 
than  the  control  of  general  n^-order  systems  [4], 


2.1  The  System  (Plant) 


Consider  a  nonlinear,  non-autonomous,  multi-input 
multi-output  system  of  the  form, 


xjki)  =fi(X)+  £  bjjuj  (1) 

7=1 

where  ^  indicates  the  kf1  derivative  of  X(.  The  vector 
U  of  components  Uj  is  the  control  input  vector  and  the  state 
X  is  composed  of  the  jq’s  and  their  first  (krl)  derivatives. 
Such  systems  are  called  square  systems  since  they  have  as 
many  control  inputs  as  outputs  xt  to  be  controlled  [4].  The 
system  can  be  written  in  a  more  compact  form  as  letting 


is  to  force  the  system  states  to  the  sliding  surface.  Once  the 
states  are  on  the  sliding  surface,  the  system  errors 
converge  to  zero  with  an  error  dynamics  dictated  by  the 
matrix  G . 


2.3  Sliding  Mode  Controller  Design 


In  this  section,  the  design  of  a  SMC  based  on  the 
selection  of  a  Lyapunov  function  is  presented  [3].  The 
control  should  be  chosen  such  that  the  candidate  Lyapunov 
function  satisfies  Lyapunov  stability  criteria.  As  a  result  of 
the  design,  the  SMC  controller  is  obtained  as; 

U(t)=Ueg(t)  +  Uc(t)  (6) 

where  Ueq  (t)  is  the  equivalent  control  and  written  as, 


Ueq(t)  =  -(GB) 


-1 


GF(X)- 


d(GXd) 

dt 


(7) 


and  Uc(t)  is  the  corrective  control  term  and  written  as, 


Uc(t)  =  (GB)-'Dh(S)  =  K  h(S ) 
where  h(S)  is  the  saturation  function  and  defined  as, 


h(Sj)  = 


Sj 


SI 


ign(Sj) 


if  —  1  <  S  j  <  1 
otherwise 


(8) 


(9) 


The  details  of  the  controller  design  can  be  found  in  [3]. 


4.  *  ••  ^  -  *>»  (2) 

^  =  [“1  .  ««F  (3) 

and  assume,  X  is  (nxl).  The  system  equation  becomes, 

X(t)  =  F(X)  +  BU(t)  (4) 

where  B  is  (nxm)  input  gain  matrix. 

2.2  Sliding  Surface 

For  the  systems  given  in  (4),  generally,  sliding  surface, 
S,  (mxl)  is  selected  [3]  as  given  below, 

S(X,  t )  =  G(Xd  (/)  -  X(t))  =  GE  (5) 

Xd  represents  the  desired  (reference)  state  vector  and  G  is 
(mxn)  slope  matrix  of  the  sliding  surface.  The  aim  in  SMC 


3.  Sliding  Mode  Approach  to  Computed 
Torque 

The  proposed  controller  is  a  combination  of  the 
computed  torque  control  and  the  sliding  mode  control.  The 
equivalent  control  part  of  SMC  is  replaced  by  the 
computed  torque,  based  on  the  equivalence  of  these  two 
terms  under  sliding  mode  condition.  In  this  part,  firstly, 
equivalence  condition  will  be  presented.  Secondly,  the 
structure  of  the  proposed  controller  will  be  explained  and 
at  the  end,  the  closed  loop  system  behaviour  will  be 
presented. 


3.1  Parallels  between  Inverse  Dynamics  and  the 
Equivalent  Control 

As  stated  earlier,  the  sliding  mode  control  converts  the 
n*  order  system  equations  to  the  1st  order  equations.  The 
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new  dynamics  equation  while  in  sliding  mode  can  be 
written  as, 


(10) 


By  solving  (10)  for  desired  control  signal,  one  will  obtain 
the  following, 

gj-igla.o  a,) 

dt 

With  using  (5)  and  (4),  the  equation  (1 1)  can  be  computed 
as, 


dt  dt 

Solving  (12)  for  desired  control  signal, 


Ud  =  -(GB) 


-1 


dt 


(12) 


(13) 


is  obtained.  If  the  system  is  in  the  sliding  mode,  this  means 
the  system  perfectly  follows  the  desired  trajectory,  and 
F(X d)  term  can  be  replaced  with  F(X)  which  is  the  actual 
value.  As  a  result,  the  desired  control  signal  becomes  the 
equivalent  control. 


Ud  «Ueg=-(GB)~l 


GF(X)  - 


d{GXd)' 

dt 


(14) 


A  direct  approach  to  show  the  equivalence  of  the 
inverse  dynamics  and  the  equivalent  control  is  as  follows; 
the  system  equation  given  in  (4)  can  be  solved  for  desired 
control  signal  as  given  below, 


Xd(t)  =  F(Xd)  +  BUd(t)  (15) 


B  is  not  a  square  matrix,  so  equation  (15)  is  transformed 
by  multiplying  with  a  transformation  matrix  G, 

GXd  (t)  =  GF(Xd  )  +  GBUd  (t)  (16) 


If  (16)  is  solved  for  the  desired  control,  it  will  be  obtained 
as  given  in  (13). 

As  a  conclusion,  in  sliding  mode,  the  equivalent  control 
is  an  equivalence  to  the  inverse  dynamics. 


3.2.  The  Structure  of  the  Proposed  Controller 


The  structure  of  the  proposed  controller  is  similar  to  a 
typical  robust  controller  which  is  composed  of  a  nominal 
part,  or  inverse  control  law,  and  a  corrective  term  aimed  at 


dealing  with  model  uncertainty  or  parameter  variations.  In 
this  paper,  the  corrective  term  is  selected  as  the  corrective 
term  of  the  SMC  based  on  the  equivalence  of  the  inverse 
control  and  the  equivalent  control  under  sliding  condition. 

As  is  clear  from  Fig.l,  the  controller  is  represented  as 
given  below, 

U(t)  =  Ud(t)  +  Uc(t)  (17) 

where,  Ud  (t)  is  given  in  (13),  and  Uc(t)  is  given  in  (8). 


3.3.  Closed  Loop  System  Behaviour 


If  the  controller  (17)  is  applied  to  system  given  in  (4), 
the  closed  loop  system  dynamics  become, 

X(t)  =  F(X)  +  B(Ud  +UC)  (18) 


Equation  (18)  is  multiplied  by  G  matrix,  and  Ud 
replaced  by  (13)  and  Uc  (t)  is  replaced  by  (8), 


GX(t)  =  GF(X)  +  GB\ 


GX  -  GXd  =  GF(X)  -  GF(Xa)  +  Dh(S) 


-(GB)' 


GF{Xd)~ 


d(GXd) 

dt 


+  {Kh(S)) 


(t)  is 


(19) 

(20) 


If  the  system  perfectly  follows  the  desired  trajectory, 
GF(Xd)  term  will  cancel  GF(X)  term  and  using  (5),  the 
closed  loop  behaviour  is  obtained  as  below, 

S  =  -Dh(S)  (21) 

This  is  the  same  case  as  SMC  [3],  and  the  solution  for 
(21)  implies  that 


lim  S(t)  =  0 

t->  oo 


(22) 


Although  S(t)  goes  to  zero  when  time  goes  to  infinity,  it 
goes  into  boundary  layer  at  a  finite  time. 


Figure  1.  Overall  control  system 
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4.  Robotics  Application 

In  order  to  study  the  performance  of  the  controllers 
proposed,  extensive  experimental  studies  are  carried  out 
on  a  two  degrees  of  freedom,  direct  drive,  scara  type 
manipulator,  manufactured  by  Integrated  Motion 
Corporation  (see  figure  2). 


4.1  Robot  Dynamics 


The  robot  model  is  written  as, 

M (q)q + C(q,  q)q  +fc=x  (23) 

where, 

M(q)  :  (2x2)  Inertia  matrix, 

C{q,q)  :  (2x2)  Coriolis  terms, 
r  :  (2x1)  torque  vector, 

q>q,q  :  (2x1)  position,  velocity,  acceleration  vectors, 

fc  :  (2x1)  Coloumb  friction  force, 

q  =  [0 j  02  ]T  and  0j  :  joint  angles,  (i=l,2). 

The  details  of  the  dynamics  can  be  found  in  [5]. 

The  model  in  (23)  can  be  written  in  the  state-space 
form  representation  as, 


*1  x2 

x2_  -M  \Cx2  +  fc)_ 


(24) 


where, 


[x\  x2 F=  [y  qY  -  [fy  @2]  an^  w  =  T 

Equation  (24)  is  in  the  form  of  (4),  and  the  proposed 
method  can  be  applied. 


Figure  2.  The  experimental  direct  drive  scara  robot 


4.2  The  Experimental  Setup 


The  control  workstation  has  an  open  architecture, 
enabling  the  modifications  of  the  control  algorithm.  The 
latter  can  be  written  and  complied  in  C-language  in  a 
personal  computer  equipped  with  a  80486  CPU.  The 
compiled  form  of  the  proposed  control  algorithm  is 
downloaded  to  a  DSP  based  servo-controller.  A 
TMS320C30  DSP  is  used  which  is  a  floating  point  DSP 
with  a  32-bit  architecture.  Necessary  torques  to  track  a 
desired  trajectory  are  computed  by  software  and  written  to 
DACs  of  the  board.  The  motor  driver,  functionally, 
converts  the  complex  variable  reluctance  motor  into  a 
system  that  behaves  like  a  high  torque,  low  velocity  DC 
motor.  It  also  amplifies  the  controller  output  to  a  level  that 
is  capable  of  driving  the  direct  drive  motors.  The  only 
available  feedback  signals  are  the  angular  positions  which 
are  measured  by  encoders  with  153  600  counts  per 
actuator  revolution.  The  angular  velocities  are  computed 
by  differentiating  the  measured  positions.  The  architecture 
of  the  controller  is  presented  in  Fig.  3. 


Actuation 


DSP 

TMS320C30 

Motor 

Drivers 

Position 

PC 

b_j 

486 

r 

Feedback 

(encoders) 


Robot 


Figure  3.  The  controller  structure 


4.3  Experimental  Results 


The  experimental  results  are  presented  from  Fig.  4  to  9, 
where  the  solid  and  the  dashed  curves  are  related  to  the 
base  and  elbow  link,  respectively.  The  desired  state 
trajectories  used  are  depicted  in  Fig.  4.  The  angular  errors 
are  presented  in  Fig.  5.  It  should  be  pointed  out  that  the 
initial  position  errors  are  deliberately  introduced  to  see  the 
system  behavior  (see  Fig.  6  and  Fig.  7)  when  the  system  is 
not  on  the  sliding  surface.  Theoretically,  when  the  states 
reach  to  the  origin  on  the  phase  plane,  they  should  stay 
there.  However,  in  DSP-based  discrete  time  applications, 
due  to  the  fact  that  the  system  is  open  loop  for  the  duration 
of  the  sampling  time,  they  deviate  from  origin.  When  the 
control  for  the  next  sampling  time  is  applied,  the  states 
again  move  towards  the  origin.  In  other  words,  the 
proposed  controller  establishes  a  stable  domain  of 
attraction  around  the  origin.  The  control  signals  that  are 
applied  to  the  robot  are  presented  in  Fig.  8.  The  computed 
torque  values  are  presented  in  Fig.  9.  It  can  be  seen  that 
the  proposed  controller  results  in  very  good  trajectory 
following  performance. 
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Experimental  studies  with  the  well  known  computed 
torque  with  a  PD  control  and  direct  computed  torque 
(without  PD)  are  also  carried  out.  The  results  in  terms  of 
RMS  and  absolute  value  of  the  peak  errors  are  compared 
with  the  proposed  approach,  and  summarized  in  table  1. 
This  comparison  is  made  based  on  zero  initial  condition 
errors  as  differ  from  the  presented  figures.  .  In  the  design 
of  all  controllers,  the  parameters  of  the  robot  as  given  by 
the  manufacturer,  are  used.  The  parameters  of  the  PD 
controller  are  tuned  to  obtain  the  best  results.  As  can  be 
seen  form  the  table,  the  proposed  novel  architecture  results 
in  minimum  errors. 


time  (sec) 

Figure  4.  Reference  angular  position  trajectories 


Figure  5.  Angular  errors  for  the  base  and  the  elbow  links 


Figure  6.  Phase  plane  motion  for  the  base  link 


Figure  7.  Phase  plane  motion  for  the  elbow  link 


Figure  8.  Controller  outputs 
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Figure  9.  Computed  torques 


Table  1.  A  comparison  of  the  methods  based  on  RMS  and 
_ absolute  value  of  the  peak  errors _ 


Base 

Elbow 

Methods 

RMS 

Peak 

RMS 

Peak 

Proposed 

Controller 

0.0029 

0.0057 

0.0019 

0.0037 

Computed 
Torque  +  PD 

0.0036 

0.0084 

0.0033 

0.0088 

Computed 

Torque 

0.2601 

0.4341 

0.2815 

0.5833 

3.  The  novel  controller  is  intuitively  more  robust  then 
PID  controller  but  less  then  SMC. 

Experimental  studies  carried  out  on  a  direct  drive  robot 
arm  indicate  that  the  proposed  controller  is  a  good 
alternative  to  the  computed  torque  plus  PID  controller  for 
industrial  robotic  applications. 
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5.  Conclusions 


In  this  paper,  a  novel  approach  to  computed  torque 
based  robot  control  is  proposed.  Firstly,  the  parallelism 
between  the  equivalent  part  of  the  sliding  mode  control 
and  the  inverse  dynamics  is  presented.  In  sliding  mode,  the 
equivalent  control  has  an  effect  identical  to  the  inverse 
dynamics  (computed  torque).  In  this  paper,  the  corrective 
term  of  the  SMC  is  used  as  the  secondary  controller 
instead  of  PID  controller. 

The  proposed  controller  has  the  following  advantages: 

1.  It  can  be  designed  for  nonlinear  systems  without 
linearization  around  an  operating  point, 

2.  There  is  no  need  to  measure  (or  compute)  the  actual 
values  of  the  joint  accelerations  of  the  robot,  as  the 
case  in  SMC. 
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Abstract 

In  this  paper,  two  different  well-known  neuro-controller 
algorithms  and  a  novel  approach  are  presented  and 
compared  based  on  trajectory  tracking  performance, 
computational  complexity,  design  complexity,  RMS  errors, 
necessary  training  time  in  learning  phase  and  payload 
variations.  The  control  schemes  are  simulated  on  a  two 
degrees  of  freedom  direct  drive  SCARA  robot.  In  the 
applications  of  the  neuro-controllers,  one  has  to  take  into 
account  the  difficulties  related  to  the  implementation  of  the 
efficient  neural  network  training  strategy.  In  this  paper,  it 
is  concluded  that  which  controller  is  better  for  different 
situations. 


1.  Introduction 


Nowadays,  various  control  techniques  are  available  for 
robot  manipulators  in  the  literature.  Major  problem  of 
robot  control  includes  kinematics,  dynamics,  path 
planning,  and  motion  control. 

The  different  control  schemes  try  to  solve  one  of  them 
or  many  of  them.  The  ones  examined  in  this  paper  are 
neuro-controllers.  The  major  attraction  of  neural  network 
approach  may  be  attributed  to  their  learning  and 
adaptiveness  features.  In  the  literature,  a  number  of  recent 
research  show  that  neural  networks  are  capable  to  tackle 
with  various  nonlinear  discrete  and  continuous  control 
tasks.  Currently,  neural  networks  are  used  in  two  main 
areas  of  application:  approximation  and  classification 
problems.  Numerous  network  architectures  (back- 
propagation,  radial  basis,  recurrent,  self-organizing)  and 
training  strategies  (supervised  and  unsupervised)  are  being 
considered  [1-8].  However,  a  brief  comparative 
explanation  about  their  performance  was  not  given  until 
this  paper. 

A  popular  topic  on  NN  robot  control  is  the  use  of  NN  to 
compute  the  inverse  dynamics  or  inverse  kinematics  with 
learning.  Psaltis  et  al.[4],  Guez  and  Ahmad  [5]  are  the 


workers  who  implemented  those  architectures.  Also 
Elsley  [6],  Grossberg  and  Kuperstein  [7],  Fukuda  [8] 
are  the  other  important  workers  on  this  area.  The 
proposed  control  algorithm  are  named  direct  inverse 
modelling  architecture.  Two  types  direct  inverse 
modeling  architecture  are  applied  to  robotics 
applications:  DIMA  I,  which  is  off-line  and  DIMA  II 
which  is  on-line.  They  are  both  supervised  learning 
algorithms. 

Kawato  et  al.[l]  investigate  hierarchical  neural 
network  model  for  voluntary  movement  with  applied  to 
robotics.  He  claims  that  a  computational  model  for 
voluntary  movement  deals  with  three  computational 
tasks:  (1)  trajectory  determination,  (2)  coordinates 
transformation,  and  (3)  motor  command  generation. 
The  major  part  of  the  architecture  used  by  Kawato  is 
the  inverse  dynamic  model.  Kawato  used  NN  in 
inverse  dynamic  model  and  a  PID  controller  and  the 
sum  of  these  outputs  are  send  to  the  plant  as  input. 

A  novel  approach  about  sliding  mode  control  is 
proposed  by  Ertugrul  and  Kaynak.  In  their  work,  they 
show  the  equivalence  of  the  inverse  dynamics  and  the 
equivalent  control  [2,3].  The  equivalent  part  is 
computed  by  NN  instead  of  exact  calculation. 

In  this  paper,  direct  inverse  modelling  architecture 
(DIMA)  [4],  feedback  error  learning  architecture 
(FELA)  [1]  and  a  novel  approach  which  is  a  neuro- 
sliding  mode  control  [2,3]  are  compared  according  to: 

1.  Trajectory  tracking  performance, 

2.  Computational  complexity, 

3.  Design  complexity, 

4.  RMS  Errors, 

5.  Necessary  training  time  in  learning  phase, 

6.  Payload  variations. 

Simulation  results  obtained  from  the  trajectory 
control  of  a  scara  type  direct  drive  robot  are  presented 
and  compared. 
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2.  Neuro-Controllers 


In  this  paper,  two  well-known  neuro-controllers  and  a 
novel  approach  are  compared.  The  first  one  is  direct 
inverse  modeling  architecture  with  radial  basis  functions 
network.  The  second  one  is  feedback  error  learning 
architecture  and  the  last  one  is  neuro-sliding  mode  control. 


2.1  Direct  Inverse  Modeling  Architecture  (DIMA) 


The  direct  inverse  control  aims  to  control  the  plant  by 
using  its  inverse  dynamics.  The  neural  network  receives 
the  plant  output  as  its  input  and  the  plant  input  as  output 
[4].  The  model  is  trained  by  inputting  to  the  controller  a 
sequence  of  control  signals.  Although,  there  is  a  feedback 
path  between  the  plant  output  X  and  the  vector  of  inputs  U 
of  this  control  plant  is  open  loop  in  the  sense  that  it  does 
not  take  into  consideration  the  error  between  the  plant 
output  X  and  the  desired  output.  The  main  difficulty  in 
applying  this  learning  method  is  to  choosing  the  training 
signal  U.  The  plant  must  be  brought  into  the  desired 
operating  region  where  the  controller  will  have  to  operate. 
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Figure  1.  Direct  Inverse  Modelling  Architecture  (DIMA) 


2.2  Feedback  Error  Learning  Architecture 
(FELA) 


The  structure  of  FELA  is  shown  in  Fig.  2,  which  is 
proposed  by  Kawato  et  al  [1].  In  this  architecture,  the 
neural  network  is  used  as  a  fet^- forward  controller  and 
trained  by  using  the  output  of  ^  feedback  controller  as 
error  signal.  The  total  control  input  U  to  the  robot  is  equal 
to: 

U(!)  =  UF(t)  +  UN(t)  (1) 

where  U F{t)  and  U N{t)  are  feedback  controller  and 
neuro-controller  outputs,  respectively. 


The  architecture  requires  qualitative  priori 
knowledge  about  the  plant  dynamics.  The  input  to  the 
neuro-controller  is  firstly  given  as  an  input  to  the 
functions  which  makes  the  plant  dynamics  nonlinear. 
So  the  necessary  knowledge  is  the  functions  that  makes 
the  plant  nonlinear. 

According  to  Kawato  et  al.[l],  the  FELA 
architecture  possesses  the  following  advantages  over 
the  other  architectures: 

1.  Learning  and  control  are  performed 

simultaneously, 

2.  Back  propagation  of  the  error  signal  through  a 

forward  model  of  the  plant  is  not  required, 

3.  The  learning  is  goal-directed  and  can  resolve  any 

ill-posedness  of  the  problem  at  hand. 


Figure  2.  Feedback  error  learning  architecture  (FELA) 


2.3  Neuro  Sliding  Mode  Control 


In  the  application  of  Sliding  Mode  Controllers,  the 
main  problem  which  is  encountered  is  that  a  whole 
knowledge  of  the  system  dynamics  (or  inverse 
dynamics)  and  the  system  parameters  is  required  to  be 
able  to  compute  the  equivalent  control.  This  is  actually 
very  rare  in  practice.  A  feed-forward  neural  network  is 
proposed  to  compute  the  equivalent  control  in  [2].  The 
weights  of  the  net  are  updated  such  that  the  additional 
control  term  of  the  sliding  mode  goes  to  zero.  Gradient 
decent  method  is  used  for  the  weight  adaptation. 


Figure  3.  Neuro  sliding  mode  control 
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3.  Robotics  Application 

In  order  to  study  the  performance  of  the  controllers 
proposed,  extensive  simulation  studies  are  carried  out  on  a 
two  degrees  of  freedom,  direct  drive,  SCARA  type 
experimental  manipulator,  manufactured  by  Integrated 
Motion  Corporation  (see  figure  4). 

3.1  Robot  Dynamics 

The  robot  model  is  written  as, 

M(q)q  +  C(q,q)q  +  fc=  x  (2) 

where, 

M (q)  :  (2x2)  Inertia  matrix, 

C(q,q)  :  (2x2)  Coriolis  terms, 

T  :  (2x1)  torque  vector, 

q,q,q  :  (2x1)  position,  velocity,  acceleration  vectors, 
fc  :  (2x1)  Coloumb  friction  force, 

q  -  02] T  and  @i  '  joint  angles,  (i=l,2). 

The  details  of  the  dynamics  can  be  found  in  [2]. 


Figure  4.  The  structure  of  Direct  Drive  Scara 


3.2  Simulation  Results 


The  simulation  results  are  presented  in  Figs.  5-12.  The 
solid  and  dashed  lines  belong  to  first  and  second  link, 
respectively.  A  comparison  of  the  controllers  based  on  the 
simulation  results  is  presented  in  table  1 . 


3.2.1  Neuro-SMC.  Neuro-SMC  application  algorithm 
is  applied  on  behalf  of  computation  of  equivalent 
control. 

The  algorithm  is  given  below 

1.  Initialization  of  the  weights  of  NN. 

2.  Computation  of  hidden  layer  and  net  outputs. 

3.  Computation  of  AU. 

4.  Application  of  computed  control  to  the  robot. 

5.  Measurement  of  outputs  of  the  robot. 

6.  Update  the  weights  of  neural  network. 

In  order  to  compare  the  effects  of  payload  change 
while  the  robot  is  working  40  kg  payload  was  added 
and  the  effects  of  this  payload  to  the  end  effector  were 
examined.  The  payload  was  added  at  t=10  sec  while 
the  whole  simulation  takes  20  sec.  The  simulation 
results  can  be  seen  in  Figs.  7-8 


3.2.2  DIMA.  Direct  inverse  modelling  architecture  is 
the  one  that  is  applicable  to  the  most  of  the  known 
robotic  applications.  The  simulations  are  done  under 
the  conditions  that  has  3  layers  of  NN  which  has  one 
hidden  layer  and  the  number  of  nodes  are  6-4-2 
respectively  and  the  most  important  criteria  is  the 
training  method  which  is  RBF  in  this  case. 

The  effect  of  payload  change  was  examined  with  40 
kg  payload  and  it  was  added  at  t=10  sec  while  the 
whole  simulation  takes  20  sec.  The  simulation  results 
can  be  seen  in  Figs.  9-10. 


3.2.3  FELA.  Feedback  error  learning  architecture  has 
many  successful  examples  in  robotic  applications. 
Therefore  it  is  trustful  and  also  applicable  method. 
FELA  has  an  on-line  learning  structure  and  was  trained 
using  RBF  NN  and  the  NN  has  3  layers  (one  hidden) 
and  it  has  6  neuron  in  the  input  layer,  10  neuron  in 
hidden  layer  and  2  neuron  in  output  layer. 

The  effect  of  payload  change  was  examined  with  40 
kg  payload  and  it  was  added  at  t=10  sec  while  the 
whole  simulation  takes  20  sec.  The  simulation  results 
can  be  seen  in  Figs.  11-12. 
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Figure  5.  End  effector  reference  on  X-Y  plane 


Figure  8.  Position  errors  of  Neuro-SMC  after  payload 


time  (  e  ) 


time  (  e  ) 


Figure  11.  Position  errors  of  FELA  Figure  12.  Position  errors  of  FELA  after  payload 


Table  1.  Comparison  of  the  neuro  controllers 


METHODS 

DIMA  WITH  RBF 

FELA 

NEURO  SMC  ! 

Computation  Time 

20 

X 

X 

Necessary  Knowledge 

X 

Qualitative  priori 
knowledge 

X 

Convergence  Time 

1200 

1200 

RMS(0,  ) 

0.0068 

0.0085 

0.0061 

RMS((92) 

0.0022 

0.0058 

0.0011 

Peak(0 , ) 

0.0755 

0.0755 

0.0754 

Peak(02 ) 

0.0145 

0.0145 

0.0140 

|  After  Payload  Change  || 

rms(<9,  ) 

0.0150 

0.0116 

0.086 

RMS(02) 

0.0053 

0.0059 

0.0043  ! 

Peak{0 j ) 

0.0755 

0.0755 

0.0343 

Peak(d2 ) 

0.0231 

0.0157 

0.0157 

Training  Method 

Off-line 

On-line 

On-line 

Parameters 

NN  structure 

NN  structure 

4.  Conclusion 


In  this  paper,  the  neural  network  based  robot 
controllers  are  compared.  From  the  obtained  results, 
it  may  came  to  a  conclusion  that  in  which  condition 
the  proposed  controller  is  better  than  the  others  or 
which  neuro  controllers  are  suitable  for  the  current 
situation.  Between  the  compared  controllers, 
according  to  the  table  1,  the  performance  of  the  neuro 
sliding  mode  controller  is  the  best  with  and  without 
payload  variations.  And  also  it  has  one  great 
advantage  that  is  an  on-line  controller,  which  means 
that  control  and  learning  takes  place  at  the  same  time. 
The  RMS  and  the  peak  values  are  much  smaller  than 
DIMA  ad  FELA  with  and  without  payload  (see  table 
1). 

As  a  conclusion,  the  superiority  of  the  novel 
approach  is  shown  in  a  simple  but  easily 
generalizable  case  in  the  table.  The  approach 
proposed  has  advantages  on  both  design  and 
application  phase  for  trajectory  control  of  a  direct 
drive  robotic  manipulator. 
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Abstract 

A  method  using  fuzzy  logic  and  neural  networks  for  the 
identification  and  control  of  robot  dynamics  is  presented 
in  this  paper.  Fuzzy  identifiers  are  employed  to  match 
the  centripetal ,  Coriolis  and  inertial  effects  in  the  robot 
dynamics  model  Fuzzy  logic  systems  used  are 
represented  as  three-layer  feed-forward  neural  networks. 
Fuzzy  system  parameters  are  adjusted  via  back- 
propagation .  An  on-line  tuning  algorithm  is  employed. 
Implementation  results  obtained  with  a  two  degrees  of 
freedom  SCARA  type  arm  with  special  attention  to  rule 
base  construction  are  presented. 


1.  Introduction 

In  the  trajectory  control  of  robotic  manipulators,  the 
main  difficulty  is  that  the  dynamics  involved  is  coupled 
and  nonlinear.  Two  different  main  methodologies  can  be 
applied  to  overcome  the  problems  introduced  by  this 
difficulty.  The  first  is  to  design  robust  controllers  using 
minimum  information  about  the  dynamics  and  the 
second  is  to  use  a  model  which  is  as  close  as  possible  to 
the  real  one  to  predict  and  counteract  the  various 
nonlinear  effects  inherent.  The  problem  of  the  first 
approach  is  that  the  resulting  controller  can  be  too 
conservative.  The  main  drawback  of  the  second  approach 
is  the  need  for  an  accurate  plant  model.  Once  having 
obtained  a  ‘dynamics  model,  however,  various  control 
structures  can  be  employed.  Although  linearized  models 
can  perform  well  to  some  extend,  nonlinear  model  based 
controllers  prove  to  be  more  successful  in  trajectory 
tracking.  A  method  for  obtaining  such  a  model  is 
presented  in  his  paper.  To  match  the  centripetal,  Coriolis 
and  inertial  effects  in  the  robot  dynamics  model  [1-2], 
fuzzy  logic  systems  which  are  represented  as  three-layer 
feed-forward  neural  networks  are  used.  The 
representation  used  is  given  in  [3-4],  where  back 
propagation  is  used  to  tune  fuzzy  system  parameters. 
Similar  representations  of  fuzzy  systems  as  neural 
networks  can  be  found  in  [5-8].  Various  controller 


schemes  using  a  dynamics  model  obtained  by  neural 
network  training  are  reported  [9-10].  The  control  method 
proposed  in  this  paper,  similar  to  [11-12],  is  suitable  for 
on  line  parameter  adjustment.  In  this  algorithm,  the 
dynamics  parameters  are  learned  while  the  robot  arm  is 
partly  controlled  by  a  simple  PD  control  algorithm.  The 
identification  process  is  continuously  active  to  act  against 
disturbances  and  dynamics  changes  like  in  the  case  of  a 
sudden  grasping  of  an  unknown  payload. 

One  of  the  main  objectives  considered  is  to  keep  the 
fuzzy  system  simple  with  a  small  number  of  rules  and 
free  of  redundant  inputs  to  have  applicability  in  real 
time.  Any  deficiency  in  the  rule  base  is  aimed  to  be 
compensated  by  the  fast  learning  capacity  of  the  system. 

The  next  section  considers  fuzzy  modeling  of  the  robot 
dynamics  without  going  into  tuning  issues.  Section  3 
details  the  three-layer  feed-forward  neural  network 
representation  of  the  class  of  fuzzy  systems  which  will  be 
used  in  this  paper  together  with  the  back-propagation 
algorithm  and  briefs  the  on-line  identification  method. 
Section  4  introduces  dymamics  properties  of  the  SCARA 
type  arm  and  presents  implementation  results  obtained 
on  the  two  main  joints  of  the  robot.  Conclusions  are 
given  in  the  last  section. 

2.  Robot  dynamics  representation  via  fuzzy 
logic 

Dynamics  equations  of  a  robot  can  be  represented  in 
the  following  form 

D(q)q  +C(q,q)q  +  g(q)  =  u  .  (1) 

Here  D  is  the  cumulative  inertia  matrix  of  the 
manipulator  and  the  motors,  C  is  the  matrix  for  Coriolis 
and  centripetal  forces,  friction  effects,  g  is  the  gravity 
effect,  u  e  Rn  is  the  vector  of  generalized  force  or  torque 
inputs  and  q  eRn  is  the  vector  of  joint  positions,  where 
n  is  the  number  of  the  degrees  of  freedom.  This  equation 
can  be  interpreted  as  n  functions  of  the  3 n  variables 
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<h,—,qn,qi,"-,q„,qu—>‘i„>  i-e.  the  components  of 
q,  q  and  q .  The  relations  from  joint  angular  positions, 
velocities  and  accelerations  to  torques  can  be  divided 
further  into  subsystems.  This  is  illustrated  in  Fig.  1. 

Modeling  the  whole  dynamics  as  a  single  fuzzy  system 
and  modeling  the  individual  subsystems  as  local  fuzzy 
systems  are  two  approaches  to  take.  The  first  approach  is 
favorable  when  information  about  the  subsystems  is 
lacking.  Matching  the  dynamics  for  such  a  system  is  then 
solely  through  a  learning  process.  The  number  of  rules 
sufficient  to  identify  a  model  is  experimentally  obtained. 
Modeling  subsystems  individually  is  preferable  when 
information  is  present  to  some  extend.  In  our  case  the 
general  structure  of  the  robot  is  assumed  to  be  known. 
Thus,  valuable  information  about  the  number  of  inputs  of 
the  specific  subsystems  and  the  ranges  of  the  inputs  can 
be  exploited.  This  is  where  the  human  knowledge  enters 
to  the  modeling  process.  In  fact,  incorporating  human 
knowledge  to  model  structure  design  is  the  main  reason 
why  fuzzy  systems  are  employed  for  identification. 

Although  using  fuzzy  input  output  relations  for  the 
subsystems  is  useful  in  rule  base  reduction,  going  too 
deep  to  each  and  every  term  in  the  dynamics  equation 
would  complicate  the  model  structure  and  is  most  of  the 
time  not  possible  at  all  since  exact  knowledge  about  the 
dynamics  is  not  available. 


Figure  1.  Decomposition  of  the  robot  dynamics  into 
subsystems. 


Figure  2.  Subsystems  used  for  individual  joints 


The  inertial,  centripetal  and  Coriolis  effects  for  each  joint 
are  considered  as  subsystems  to  be  modeled  in  this  paper. 
Gravity  effects  in  Fig.  2  applicable  for  a  more  general 
robot  structure  are  not  present  in  the  first  two  degress  of 
freedom  of  the  SCARA  configuration.  Friction  effects  are 
not  modeled.  The  minimum  number  of  rules  is  tried  to  be 
obtained.  The  on-line  tuning  algorithm  will  then  be 
responsible  to  compensate  for  differences  in  the 
parameters  and  the  structure  by  very  quickly  adjusting 
fuzzy  system  parameters.  This  tuning  algorithm  is 
explained  in  the  next  section. 


q  q  q 


3.  Back-propagation  tuning 

The  fuzzy  systems  which  will  be  used  in  this  paper  are 
of  the  form  (2). 


ZM  _ 


f(x)  =  ■ 


mx  exp 


A 


V  oy  J 


ZM 

l=\ 


mx  exp 


f  f  -,\A 

X .  -  X,  ' 


V  <x,  J 


(2) 


This  function  characterizes  a  fuzzy  system  with  center 
average  defuzzifier,  product  inference  rule,  singleton 
fuzzifier  and  Gaussian  membership  functions.  Here  M  is 
the  number  of  rules,  yl  stands  for  the  output  constant  of 
rule  /,  n  is  the  number  of  input  variables,  x.  is  the  ith 
input  variable,  xj  is  the  center  of  the  membership 
function  for  x,.  for  rule  /,  cj\  represents  the  width  and 
a\  the  height  of  this  membership  function.  Gaussian 
membership  functions  are  differentiable.  This  feature  is 
required  in  the  back-propagation  algorithm.  The  function 
in  (2)  can  be  represented  with  a  three-layer  feed-forward 
neural  network  structure  shown  in  Fig.  3  [4]. 
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Figure  3.  The  three-layer  feed-forward  nn  architecture 

In  Fig.  3,  n  stands  for  the  membership  functions 

described  above.  Triangles  represent  gains. 

With  the  motivation  that  systems  of  the  form  (2)  are 
universal  approximators  [4,15],  [4]  develops  a  back- 
propagation  training  algorithm  for  this  class  of  fuzzy 
systems  as  in  the  following. 

For  a  given  input-output  pair  (xp  ,d)  with  xp  eR” 
and  d  eR,  a  measure  of  the  modeling  error  of  a  fuzzy 
model  /  (x)  of  the  form  above  can  be  defined  by 

£  =  |[/(*')-rf]2-  <3) 

In  order  to  minimize  this  error,  assuming  that  all  the 
a\  terms  are  equal  to  1,  fuzzy  system  parameters  will  be 
varied  according  to  the  back-propagation  rules  below. 


Figure  4.  On-line  identification  and  control  with  the  feedback 
error  learning  scheme 


/9E 

y‘(k  +  l)  =  y'(k)-a—r 
ay 


=  y'(k)-a£-jLz< 


(4) 


dE 


xi(k+l)  =  xi(k)-a—l 


-'m  -  f~d  I-’  ,2{xf-x‘(k)) 


(5) 


al(k+l)  =  cr‘(k)-a-^j 


.  f-d,  .  s  ,l(x? -x‘{k))‘ 


(6) 


Here  a  is  a  constant  step  size.  The  variable  b  is 
defined  in  Fig.3  and / stands  for  the  function  / (xp)  in 
(3). 

This  scheme  is  used  in  the  mechanism  of  fuzzy 
identifiers  [4].  [4]  reports  a  better  performance  of  fuzzy’ 
identifiers  when  compared  with  neural  identifiers  in  [14]. 
This  training  method  will  be  used  in  the  controller 
structure  shown  in  Fig.4. 

The  controller  structure  is  similar  to  the  controller  in 
[11]  in  that  the  dynamics  identifier  is  tuned  by  the 
difference  of  the  total  torque  input  to  the  robot  and  the 
torque  output  of  the  dynamics  model.  This  difference  is 
the  output  of  a  simple  PD  controller.  The  PD  control 
signal  diminishes  as  the  dynamics  is  learned.  [11]  uses 
pure  neural  networks  whereas  fuzzy  identifiers  are  used 
in  the  present  paper  for  identification  purposes. 

4.  Implementation  results 

The  structure  and  properties  of  the  robotic  manipulator 
that  is  used  in  the  simulations  is  described  in  the 
following  and  the  difficulties  in  its  control  are  outlined. 
The  experimental  robot  is  a  two  degrees  of  freedom 
SCARA  type  arm  [15],  the  structure  of  which  is  shown  in 
Fig.5,  its  parameters  being  listed  in  Table  1  in  standard 
(m  sec  kg)  units. 

The  dynamics  equations  of  this  manipulator  are  given 
by  the  following  equation. 

M(q)q  +V(q,q)  +  F  =  u  .  (7) 
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Figure  5.  The  direct  drive  arm 


Figure  6*  Assignment  of  joint  angles 


Table  1.  Dynamics  parameters  of  the  robot 


Motor  1  Rotor  Inertia 

0.267 

Arm  1  Inertia 

0.334 

Motor  2  Rotor  Inertia 

: 

0.0075 

Motor  2  Stator  Inertia 

0.040 

Arm  2  Inertia 

0.063 

Motor  1  Mass 

73.0 

Arm  1  Mass 

9.78 

Motor  2  Mass 

3.0 

Arm  2  Mass 

4.45 

Arm  1  Length 

0.359 

Arm  2  Length 

0.24 

Arm  1  CG  Distance 

0.136 

Arm  2  CG  Distance 

0.102 

Axis  ]  Friction 

4.90 

Axis  2  Friction 

1.67 

Torque  Limit  1 

245.0 

Torque  Limit  2 

39.2 

In  this  expression  q  is  the  vector  of  joint  angles  q}  and  q2 
shown  in  Fig.  6.  u  is  the  torque  vector  applied  to  the 
joints,  M  is  the  inertia  matrix,  V  is  the  vector  of 
centripetal  and  Coriolis  forces,  F  stands  for  Coulomb 
friction.  M  and  V  can  be  written  explicitly  as 

M(q)  =  { Pl  +2P}  003(92 }  P*  cos(?2^ 

Vp2+p,coS(q2)  p2  J  (J 


v(qA) 


-q2(lqx  +q2)p3  sin (q2j 
q\pz  sin(#2 ) 


(9) 


where  p{  =  3.1877,  p2  =  0.1 168  and  /?3  =  0.1630  .  These 
values  are  computed  by  considering  the  various  mass, 
length  and  inertia  parameters  of  the  arm  and  the  direct 
drive  motors  in  Table  1.  The  expressions  (8-9)  and  the 
parameters  above  are  given  to  present  a  detailed 
description  of  the  dynamical  system  but  they  are  not  used 
in  the  controller  structure. 

A  TMS320C30  DSP  based  system  is  used  to  control 
the  arm  [15].  The  user  interface  is  on  a  80486  PC 
machine.  C  language  servo  routines  are  compiled  in  this 
environment  and  downloaded  to  the  DSP.  Sampling 
times  in  the  range  of  50-100  microseconds  can  be 
achieved  for  simple  control  methods.  The  sampling  time 
used  in  this  work  is  3  ms.  The  torque  motors  used 
provide  position  signals  with  a  resolution  of  153600 
pulses/rev. 

This  robot  exhibits  highly  nonlinear  characteristics  and 
there  exists  a  high  degree  of  coupling  between  the  links. 
Conventional  independent  joint  PD  and  PID  controllers 
therefore  prove  to  be  inadequate  when  a  high 
performance  trajectory  following  is  a  requirement, 
especially  when  the  motion  is  fast.  In  the  following, 
details  are  given  on  how  a  simple,  on-line  adapting 
inverse  dynamics  controller  is  designed  and 
implementation  results  are  presented. 

The  following  information  about  the  robot  model  is 
used  to  obtain  a  simple  rule  base.  In  general  six  inputs, 
the  two  joint  positions,  two  joint  velocities  and  two 
accelerations,  are  involved  in  the  dynamics.  For  the 
described  structure,  the  base  joint  position  does  not  enter 
dynamics  equations  and  therefore  we  have  to  deal  with  5 
inputs  only.  To  simplify  the  rule  base,  two  fuzzy  sets  are 
used  to  characterize  the  size  of  each  input  variable  and 
two  rules  per  input  variable  are  used  for  each  subsystem. 
The  elbow  position,  since  it  enters  expressions  describing 
inertial  effects  and  since  it  also  appears  in  Coriolis  and 
centripetal  force  relations,  is  represented  by  four  fuzzy 
sets.  Four  rules  are  dedicated  to  incorporate  the  effect  of 
the  elbow  position  to  the  dynamics  of  the  robot.  Although 
most  of  the  time  outputs  depend  on  more  than  one  input 
simultaneously  in  a  coupled  manner,  simultaneous 
evaluation  of  input  variables  is  not  carried  out.  In  cases 
with  more  than  one  possible  outputs,  one  of  them  is 
selected  randomly  when  generating  the  initial  rule  base. 
This  resulted  in  a  simple  rule  base,  which  could  hardly 
perform  by  itself  without  adaptation.  This  rule  base, 
however,  has  a  sufficient  number  of  rules  for  the 
adaptation  algorithm  to  work  on  and  create  outputs  to 
match  the  dynamics  of  the  controlled  plant.  Table  2 
shows  the  inputs  for  the  individual  subsystems.  In  this 
table  D1  stands  for  the  inertial  effects,  Cl  is  centripetal 
and  Coriolis  forces  for  the  base.  D2  and  C2  are  similarly 
defined  for  the  elbow  joint.  The  parameters  to  be  adjusted 
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are  the  centers  and  widths  of  the  input  membership 
functions  and  the  output  constants  for  rules. 

Table  2.  Inputs  and  parameter  numbers  for  subsystems 


Sub¬ 

system 

Inputs 

Number 
of  rules 

Number  of 
parameters 

D1 

6 

36 

Cl 

6 

D2 

6 

36 

C2 

QiAxAi 

6 

In  the  last  column  of  Table  2,  the  numbers  of  the 
parameters  to  be  adjusted  for  each  joint  are  given. 

At  this  level  of  the  design,  considering  the  computed 
robot  model,  smaller  subsystems  inside  the  four 
subsystems  can  be  defined.  However,  this  would 
complicate  the  rule  base  instead  of  making  it  simpler  and 
no  further  subsystems  inside  the  mentioned  ones  are 
assigned  in  the  presented  work.  The  initial  membership 
widths  are  all  equal  to  1.  Initial  centers  for  positions  are 
chosen  in  the  45°  neighborhood  of  the  zero  position 
indicated  in  Fig.6.  Output  constants  are  very  roughly 
initialized  in  a  “assign  zero  or  maximum”  manner  even 
disregarding  possible  sign  changes.  The  only  information 
used  about  the  dynamics  in  the  output  constant 
initialization  is  knowledge  of  the  general  structure  and  a 
rough  knowledge  of  the  link  weights.  In  that  way,  the 
performance  of  the  fuzzy  identifier  with  minimum 
information  available  will  be  tested.  The  reference  curves 
applied  for  die  joints  given  in  Fig.  7  are  applied  to  the 
joints  simultaneously. 

Due  to  lack  of  space,  only  resulting  error  and  torque 
curves  for  the  base  joint  are  presented  in  the  figures. 
Similar  performance  is  obtained  for  the  elbow.  A  step 
disturbance  torque  is  applied  to  the  base  to  test  the 
recovery  performance  of  the  proposed  method.  The 
magnitude  20  %  of  the  maximum  torque  output  which 
can  be  achieved  by  the  base  motor.  The  step  disturbance 
is  shown  in  Fig.  8.  To  present  a  comparison  with 
conventional  control  techniques,  experiments  are  carried 
out  with  a  well  tuned  PID  controller  as  well. 

Figure  9  shows  the  error  curve  obtained  by  the  PID 
controller.  The  peak  on  the  error  curve  is  due  to  the 
applied  disturbance.  In  Fig.  10,  the  error  curve  with  the 
fuzzy  identifier  method  is  given.  It  can  be  observed  that 
the  recovery  after  the  disturbance  is  faster  and  the  error 
caused  by  the  disturbance  is  smaller  when  comparing 
with  the  PID  controller. 

As  mentioned  above  and  illustrated  in  Fig.  4,  the  total 
control  input  in  the  fuzzy  identification  method  consists 


of  the  output  of  the  fuzzy  system  and  a  PD  control  input. 
Fig.  1 1  shows  that  the  fuzzy  identifier  output  converges 
to  the  total  control  input  and  the  PD  control  signal 
diminishes.  The  fuzzy  model  takes  over  the  controlling 
task  quickly  although  its  initial  parameters,  by  purpose, 
are  not  selected  suitable  for  convergence. 


Figure  7.  Reference  positions  used  for  base  and  elbow  joints. 
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Figure  8.  Step  disturbance  applied  to  the  base.  The  height  of 
the  step  corresponds  to  20  %  of  the  torque  output  capacity  of 
the  base  direct  drive  motor. 
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Figure  9.  Base  joint  position  error.  PID  control. 
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Figure  10.  Base  joint  position  error.  Fuzzy  identifier  method. 
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Figure  11.  Total  control  input  (gray  curve)  and  its  fuzzy 
control  component  (solid  black)  for  the  base  joint.  Fuzzy 
identifier  method. 


5.  Conclusions 

An  on  line  robot  dynamics  learning  and  control  method 
is  presented  in  this  paper.  The  suggested  scheme  uses 
only  limited  information  about  the  manipulator.  By  the 
use  of  fuzzy  logic,  this  information  is  incorporated  in  the 
model  structure.  Rule  base  reduction  is  carried  out  in 
design  and  equipped  with  the  back-propagation 
algorithm  presented,  the  fuzzy  logic  system  with  a  small 
number  of  rules  performs  well  in  identifying  an  inverse 
model.  The  fact  that  the  adjusted  fuzzy  system  variables 
have  clear  physical  interpretations  has  been  intensively 
used  in  the  model  development.  This  is  a  significant 
advantage  of  the  fuzzy  identifier  over  neural  identifiers. 
Experimental  work  indicated  good  identifying 
performance  of  the  method. 
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Abstract 

Manufacturers  have  been  living  under  the  pressure  of 
both  strategic  and  organisational  challenges  that  come 
from  the  Information  Technology  (IT)  all  around  the 
world  especially  for  the  last  five  years.  In  this  rapidly 
changing  world  demand  to  produce ,  expand \  develop,  and 
advance  has  become  overwhelming.  International 
competition  for  goods  and  services  has  gone  far  beyond 
national  boundaries.  Change  affects  people ,  technology, 
organisational  structures,  information  management,  and 
of  course  the  way  manufacturers  operate.  This  paper 
examines  the  importance  of  Internet  Commerce  and 
attempts  to  develop  a  model  for  its  implementation  in 
intelligence  manufacturing  management.  It  is  believed 
that  this  model  is  effective  for  manufacturing  management 
in  this  rapidly  changing  global  business  environment  and 
for  the  21st  century. 


1.  Introduction 

The  Internet-Based  Electronic  Commerce  (in  short  form 
Internet  Commerce  or  E-Commerce)  is  rapidly  emerging 
as  an  entirely  new  method  to  conduct  business  and  to 
interact  with  customers,  suppliers  and  partners.  Electronic 
Commerce  covers  many  aspects  of  buying/selling 
relationships.  Electronic  Commerce  covers  many 
operations  within  production  processes.  Manufacturers 
(from  small  to  very  large)  are  scarcely  implementing  E- 
Commerce  over  the  Internet  to  gain  strategic  advantage 
over  their  rivals  and  not  to  be  out  of  business  in  this  very 
rapidly  changing  and  competitive  global  business 
environments  (Cronin.  1996). 

The  Internet  Commerce  has  become  essential  for  the 
survival  of  manufacturing  companies  entering  the  virtual 
distribution  marketplace.  The  business-to-business  and 
business-to-consumer  segments  of  the  E-Commerce 
contain  both  common  and  differing  characteristics  that 
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must  be  understood  to  succeed  in  implementation  (Gide  & 
Soliman.  1998). 

A  number  of  dominant  industries  have  jumped  on  the 
Internet  bandwagon.  Examples  are:  the  IT  industry, 
manufacturers,  publishers,  retailers,  banks  and  financial 
institutions,  airlines  and  others  as  shown  in  the  following 
table  (Table  1). 

Table  1.  Examples  of  businesses  using  the  Internet- 
based  e-commerce 


:|:§  Application  of  Jnteroet*Based  E* : : 

Commerce  ;||f  I 

Manufacturing 

Thousands  of  manufacturers  use  E-Commerce 
from  supplier  relations  to  customer  service. 

^j^-"  Retail'  Ky;:; 

Many  retailers  m  now  using  E-Commerce  on 
the  Internet  with  thousands  ofon-fitieprociucts’ 
ftnd  services  are  offered. 

Airlines 

Hundreds  of  airlines  have  Web  sites  and  doing 
business  with  E-Commerce  from  order 
receiving  to  payment  for  the  ticket. 

There  are  more  than  200  franks  on  the  Web 
from  30  countries.  - 

Media  & 
Publishing 

Many  publishing  houses  have  developed  Web 
versions  of  traditional  print  media,  and  entirely 
new  Electronic  Magazines. 

According  to  the  inaugural  report  from  Forrester 
Research’s  Business  Trade  &  Technology  Strategies 
service,  the  value  of  goods  and  services  traded  between 
companies  over  the  Internet  is  expected  to  skyrocket  from 
$8  billion  this  year  to  $327  billion  in  the  year  2002. 
According  to  the  same  report,  businesses  are  aggressively 
adopting  inter-company  trade  over  the  Internet  because 
they  want  to  cut  costs,  reduce  order-processing  time,  and 
improve  information  flow. 

Mougayar  (1997)  noted  that  perhaps  the  most 
significant  outcome  of  E-Commerce  is  the  positive 
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change  it  brings  to  buyer-seller  relationships.  The  most 
successful  users  of  E-Commerce  have  recognized  that 
increased  familiarity  with  customers,  dealers  or  suppliers 
afforded  by  joint  systems  leads  to  collaborative 
behaviours,  which  improve  economic  performance  for 
both  partners.  Organisations  that  want  to  stay  in  business 
beyond  to  the  turn  of  the  century  must  re-evaluate  every 
aspect  of  their  strategy  and  operations  and  incorporate 
these  technological  changes. 

Full  implementation  of  Electronic  Commerce  over  the 
Internet  is  very  much  in  its  infancy  despite  there  has  been 
phenomenal  growth  in  commercial  presence  on  the 
Internet  in  recent  times.  In  the  last  2  years  the  commercial 
domain  registrations  on  the  entire  Internet  have  grown  to 
represent  some  85%  of  all  organizations.  Electronic 
trading  opportunities  offer  businesses  the  chance  to 
compete  for  an  international  scale.  These  Electronic 
Trading  Opportunities,  are  being  expanded  to  Web  sites 
and  many  trading  forums  emerging. 

E-Commerce  implementation  varies  significant  by 
industry.  According  to  a  recent  survey,  retailers  and 
wholesalers  were  far  more  likely  to  implement  E- 
Commerce  compared  with  manufacturers.  In  addition, 
consumer  goods  industries  led  industrial  goods  industries 
in  E-Commerce  implementation.  E-Commerce 
implementation  by  manufacturers  is  accelerating,  driven 
by  successes  of  early  adopters.  Costs  associated  with  E- 
Commerce  implementation  are  level  of  system  integration 
and  communication,  hardware  and  software  and  staff 
training  costs.  A  typical  E-Commerce  business 
transaction  cycle  in  a  supplier-retailer-customer 
(consumer)  channel  could  include  electronic  transmission 
of  purchase  order,  customer-buying  schedule,  advanced 
shipping  notice,  invoice  and  electronic  funds  transfer  of 
payment  (EFT)  or  e-cash. 

2.  Globalization  Effects 

The  1990s  and  beyond  will  present  both  strategic  and 
organizational  challenges  to  manufacturing  operations 
management.  On  the  one  hand,  corporations  are  being 
forced  to  develop  strategies  that  will  allow  them  to  be 
efficient,  responsive,  and  innovative.  The  global  economy 
is  rapidly  evolving  into  an  integrated  economic  system, 
which  presents  tremendous  opportunities  to  aggressive 
global  manufacturers  while  threatening  the  veiy  existence 
of  the  mediocre  manufacturer.  Competitive  survival 
depends  on  the  firm’s  ability  to  understand  the  changing 
global  environment  and  to  adopt  the  emerging  rules  of 
global  strategy  (Soliman  &  Gide,  1997). 

Globalization  has  been  a  product  of  change  caused  by 
economic,  technological,  and  competitive  factors.  Three 
principal  economic  forces  have  driven  companies  and 


industries  toward  globalization:  economies  of  scale, 
economies  of  scope,  and  national  differences  in  the 
availability  and  cost  of  resources.  These  forces  have 
caused  companies  to  specialize  and  standardize  their 
products.  This  competitive  strategy  has  became  known  as 
global  chess  and  can  only  be  played  by  companies  that 
message  their  worldwide  operations  as  interdependent 
units  implementing  a  coordinated  global  strategy 
(Alkhafaji,  1995).  One  of  the  more  salient  aspects  of  the 
new  global  marketplace  is  the  “ borderless  world ’  -  a 
world  in  which  capital  is  more  mobile  than  ever  and 
relevant  competitive  response  times  have  been 
dramatically  reduced  (Ohmae,  1989).  This  borderless 
world  has  made  the  need  for  decision  making  based  on 
coordinated  global  strategies  more  compelling  than  ever. 

Vastag  (1994)  stress  that  in  this  rapidly  changing  world, 
not  only  internal  manufacturing  processes  but  also  its 
external  logistical  infrastructure  need  to  be  changed. 
Manufacturers  require  new  supporting  infrastructures  to 
compete  successfully  in  quickly  changing  global  markets 
demanding  flexibility  and  timely  delivery.  The  movement 
of  synchronized  manufacturing  and  an  elimination  of 
buffer  stock  mandate  improved  processes  -  including  the 
procurement  of  the  right  material,  at  the  right  place,  at  the 
right  time.  Most  authors  agree  that  time  and  flexibility  will 
be  the  new  driving  forces  of  competitiveness,  and  quality 
is  becoming  more  and  more  an  order  qualifier  rather  than 
order  winner. 

According  to  Stalk  and  Hout  (1990),  time-based 
competitors  built  their  companies  around  customer  needs 
by  redesigning  and  compressing  work  processes  in  order 
to  more  directly  provide  those  needs.  Time-based  are 
effective  in  minimizing  costs  and  maximizing  the  value 
added  to  customers  because  longer  development  times, 
cycle  times,  and  lead  times  invariably  cause  higher  costs. 
Stonich  (1990)  stresses  that  implementing  a  time-based 
strategy  requires  drastic  changes  in  company  culture, 
structure,  systems,  and  the  way  the  work  is  accomplished. 
The  plan  that  he  offers  for  becoming  a  time-based 
competitor  is  centered  on  establishing,  planning,  and 
assessing  the  feasibility  of  a  time-based  strategy. 
According  to  Fawcett  (1992),  integration  of  logistics  into 
the  design  and  management  of  global  manufacturing 
networks  is  critical  to  the  success  of  a  global 
manufacturing  strategy.  Therefore,  the  role  of  the  Internet 
based  E-Commerce  in  manufacturing  can  not  be 
underestimated  and  should  be  taken  into  account  to 
survive  and  become  competitive  in  the  world  market. 

3.  E-commerce  over  the  Internet 

Full  implementation  of  Electronic  Commerce  over  the 
Internet  is  very  much  in  its  infancy.  A  limited  number  of 
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pilots  to  provide  a  trading  scenario  have  taken  place  in  the 
US  and  other  countries.  However,  there  has  been 
phenomenal  growth  in  commercial  presence  on  the 
Internet  in  recent  times.  In  the  last  2  years  the  commercial 
domain  registrations  on  the  entire  Internet  have  grown  to 
represent  some  85%  of  all  organizations.  Facts  and 
figures  from  industry  show  that: 

•  Internet-Based  E-Commerce  is  expected  to  reach 
$150  billion  by  the  year  2000  and  more  than  $1 
trillion  by  the  year  2010; 

•  Sales  generated  via  the  Web  have  grown  from  $17.6 
million  in  1994  to  nearly  $400  million  in  1995  (a 
growth  rate  of  over  2100%); 

•  The  number  of  sites  using  the  Internet  for  product 
transactions  has  increased  from  14%  in  1995  to  34% 
in  1996  and  to  a  projected  increase  of  44%  in  the  next 
3  years. 

Electronic  trading  opportunities  offer  businesses  the 
chance  to  compete  for  an  international  scale.  These 
Electronic  Trading  Opportunities,  are  being  expanded  to 
Web  sites  and  many  trading  forums  emerging.  The 
Internet  has  reduced  tine  number  of  letters,  voice  calls  and 
faxes  around  the  globe.  Thirty  per  cent  of  Internet  users  in 
one  survey  stated  that  Internet  usage  had  resulted  in  new 
business  opportunities  and  43%  said  that  it  has  increased 
productivity. 

4.  Driving  Forces  for  E-Commerce 

Companies  start  their  E-Commerce  owing  to  different 
reasons.  However,  based  on  the  literature  survey,  there 
are  some  typical  driving  forces  for  companies  to  embark 
on  E-Commerce  implementation.  According  to  the  ABC 
News  (February  1998),  the  E-Commerce  implementation 
across  the  Australian  businesses  doubling  every  six 
months  due  to  increase  of  sales  motivation.  Various 
surveys  around  the  world  and  in  Australia  show  that  sale 
focused  motivational  elements  were  the  major  driving 
forces  to  E-Commerce  implementation  (Gide  &  Soliman, 
1997).  Some  of  these  elements  are: 

•  Reduce  cost  of  operations 

•  Increase  sales  and  profit 

•  Pressure  from  major  customers’  and  suppliers’ 
request 

•  Satisfy  the  needs  of  customers 

•  Ability  to  capture  more  market  share 
•.  Pressure  from  competitors  (rivals) 

•  Reach  potential  customers 

•  Improve  company’s  image 

•  Electronic  (paperless)  work 

•  Increase  productivity 

•  Pressures  from  globalization 


♦  Achieve  competitive  advantage 

It  is  clear  that  the  pro-active  drives  in  E-Commerce 
implementation  are  related  somehow  to  the  Just-in-Time 
(JIT)  philosophy.  Thus,  E-Commerce  can  be  treated  as  an 
effective  tool,  which  contributes,  to  the  JIT  process. 

5.  Manufacturing  Benefits  of  E-Commerce 
Implementation 

According  to  Mougayar  (1997)  there  are  various  types 
of  key  measurements  that  must  be  tracked  prior  to 
embarking  on  a  full  implementation.  Some  of  the 
important  key  elements  to  measure  business  value  are: 

♦  Reducing  costs. 

♦  Process  simplification. 

♦  Improving  customer  service. 

♦  Generating  new  revenue. 

♦  Taking  faster  decisions. 

With  Internet  Commerce  already  headed  for  $8  billion 
in  1997,  up  1,000%  from  1996,  Forrester  looked  at  which 
industries  are  at  the  center  of  the  dramatic  growth.  Three 
different  company  types  were  identified  as  shown  in  the 
following  table  (Table  2). 

Table  2.  Percentage  distribution  of  usage  of  Internet- 
based  e-commerce  within  three  industries. 


1997 

1397 

Manufacturers  of 
electronics  and  aeroplane 
parts 

37.5% 

3 

..Vidors  of 

:vs 

:  •  •  3  '  ::.v ' 

Services  and  utilities 
providers 

25% 

2 

Total 

100% 

8 

6.  Current  Challenges  to  Internet-Based  E- 
Commerce 

Like  any  other  IT  tools,  E-Commerce  is  not  without 
drawbacks  and  limitations.  There  are  two  main  drawbacks 
or  challenges  in  using  Internet-Based  E-commerce,  these 
are:  security  issues  and  payment  tools. 

On  the  other  hand,  challenges  from  the  internal 
organization  caused  one  of  the  biggest  obstacles  to  the  E- 
Commerce.  This  is  because  E-Commerce  can  be  treated 
as  a  stimulus  in  a  revolution.  Some  companies  do  not 


have  a  clear  budget  on  E-Commerce  implementation. 
However,  the  managers  would  provide  the  necessary 
resources  if  E-Commerce  comes  to  the  priority  of  the 
company  targets.  The  various  internal  difficulties 
encountered  by  the  companies  during  the  E-Commerce 
implementation  process  are:  lack  time,  lack  clear  budget, 
benefits  can  not  be  evaluated,  overloading  of  work, 
compete  for  resources,  insufficient  training,  not  supported 
by  employees.  External  difficulties  came  from  three 
sources;  customers,  suppliers  and  consultants. 

Some  international  surveys  show  many  manufacturers 
found  that,  full  E-Commerce  implementation  was 
expensive,  time  consuming,  too  formalized  and 
impersonal.  The  direct  costs  of  obtaining  and  maintaining 
E-Commerce  are  the  computer  software  and  hardware  and 
IT  consultation  fees.  Some  businesses  do  not  consider  that 
the  implementation  can  induce  new  orders  although  E- 
Commerce  can  improve  the  quality  image  of  the 
company.  Accordingly,  some  manufacturers  considered 
that  the  relatively  high  initial  software  and  hardware  costs 
of  full  implementation  and  security  and  privacy  concerns 
would  prevent  them  from  starting  the  E-Commerce,  no 
other  factors  appear  to  have  a  dominant  effect.  However, 
some  other  common  drawbacks  include  the  followings: 

•  Lack  of  knowledge  on  E-Commerce 

•  The  belief  that  total  benefits  cannot  cover  total  costs 

•  Inappropriateness  of  E-Commerce  in  their  industry 

•  E-Commerce  will  not  be  supported  by  employees 

•  Extensive  IT/IS  changes  are  required 

•  Top  management  doesn’t  know  E-Commerce  and 
hence  doesn’t  recognize  its  benefits. 

Accordingly,  companies  with  larger  employee  size  are 
more  likely  to  implement  E-Commerce  or  to  be  in  the 
process  of  implementation.  This  suggests  that  the 
accessibility  to  other  sufficient  resources  in  larger 
companies  enable  them  to  start  the  implementation 
process  earlier  and  obtain  the  system  quicker  than  the 
comparatively  smaller  companies. 

7.  Managerial  Implications  of  E-Commerce 
Implementation 

To  date  the  major  benefits  from  the  Internet  include 
improved  internal  and  external  communications.  The  Web 
has  specifically  brought  a  new  marketing  medium  and 
enhanced  information  resource.  Innovative  applications 
are  starting  to  appear  which  allow  for  sales  and  database 
interrogation.  Other  benefits  such  as  e-mail  and  file 
transfer  functionality,  Web  utilization  gave  many 
companies  Internet  presence '  and  provided  them  with 
opportunities  to  develop  and  expand  new  services.  In 
manufacturing,  traditionally  Design  Engineering. 


Procurement  and  Production  Departments  communicate 
with  each  other  using  paper  based  methods.  However  the 
introduction  of  Internet-Based  E-Commerce  and  its 
superiority  of  over  traditional  EDI  is  adding  new 
dimension  to  reducing  the  cost  of  manufacturing  (Soliman 
&Gide,  1997). 

Research  shows  that  resource  availability  plays  a  vital 
role  in  enabling  manufacturers  to  start  E-Commerce 
implementation.  The  motivations  from  implementation 
are  sales  focused  and  internal  focused.  Sales  focused 
motivation  is  defensive  in  market  survival  and  offensive 
in  acquiring  competitive  advantages.  Internal  focused 
motivation  is  originated  from  the  desire  of  senior 
management  that  internal  benefits  can  be  obtained  on  the 
successful  implementation  of  E-Commerce  over  the 
Internet.  A  commitment  of  the  decision-maker  to  E- 
Commerce  facilitates  a  more  direct  and  quick  allocation 
of  resources  to  the  process.  With  the  influence  of  senior 
management,  a  more  committed  workforce  is  expected. 
This  would  ultimately  lead  to  better  results  of  the  E- 
Commerce  implementation. 

Moreover,  senior  management  of  many  organizations 
have  in  common  the  desire  enforces  the  persistent 
commitment  of  management  by  devoting  the  time  and 
providing  the  resources  to  support  the  E-Commerce 
process.  Gaining  management  commitment  and 
communicating  that  commitment  credibly  to  employees  is 
one  of  the  success  factors  for  achieving  E-Commerce 
implementation. 

8.  Developing  a  Model  of  E-Commerce 
Implementation 

To  cope  with  these  driving  forces,  people 
(executive/employee)  involved  in  the  system  are 
identified  as  a  factor  for  the  successful  implementation  of 
the  E-Commerce.  In  this  model,  external  forces  (customer 
expectation/satisfaction)  and  internal  forces  (strategic 
/competitive  benefits)  provide  the  impetus  for 
implementation.  These  are  realized  in  market/customer 
demand  and  implementation  of  processes,  both  of  which 
require  the  commitment  and  involvement  from  the 
personnel  in  the  company. 

There  is  a  need  for  improving  the  understanding  (know¬ 
how),  belief/trust  and  communication  channels  and  these 
form  the  fundamental  element  in  the  strategy  of  E- 
Commerce  implementation.  More  importantly,  the 
continuous  commitment  of  people  is  necessary  to  ensure 
that  the  implementation  can  be  maintained  in  subsequent 
review  procedures.  A  model  for  visualizing  the  success 
factor  and  providing  the  basis  for  developing  the  strategy 
is  illustrated  in  the  Figure  1  below. 
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8.1.  Success  Factors  of  E-Commerce 
Implementation 

According  to  the  research  major  forces  behind  E- 
Commerce  implementation  are  market/customer  demand 
and  benefits  derived  from  the  implementation  and 
competitive  advantage.  Most  manufacturers  started  their 
E-Commerce  implementation  in  response  to  their 
customers’  request;  the  requirement  of  the  potential 
customers.  These  provide  the  practical  evidence  that 
market/customer  demand  is  the  major  drive-initiating 
people  commitment  toward  E-Commerce.  Manufacturers 
seeking  E-Commerce  implementation  also  believe  that 
they  can  achieve  competitive  advantage  over  their 
competitors.  In  fact,  the  research  shows  that  E-Commerce 
user  manufacturers  are  picking  up  market  shares  from 
other  non-user  suppliers.  Also,  E-Commerce  user 
manufacturers  are  seeking  company  growth  by  expanding 
new  overseas  (global)  markets  in  which  E-Commerce 
usage  is  a  must  (Gide  &  Soliman,  1998). 

The  continuous  demand  of  E-Commerce  is  the  result  of 
promotion  of  the  quality  by  government,  customers, 
suppliers,  competitors,  and  industries.  However,  people 
will  only  adopt  E-Commerce,  which  is  beneficial  and 
practical  to  their  organizations.  There  are  internal  driving 
forces,  which  are  additional  to  the  external  forces  to  push 
the  manufacturers  toward  the  need  to  implement  E- 
Commerce  over  the  Internet. 

8.2.  Successful  E-Commerce  Implementation 
Processes 

The  successful  E-Commerce  implementation  strategy  in 
manufacturing  relies  on  management  and  employee 
commitment  and  active  involvement.  This  requires 
detailed  planning  of  the  activities.  The  model  in  Figure  1 
is  realized  through  the  process  elements,  which  are 
closely  related  to  critically  affecting  the  behaviour  of 
people  (executive/staff/employee). 

Management  should  make  an  informed  decision  to 
implement  E-Commerce.  This  commitment  is  essential 
because  the  E-Commerce  process  requires  not  only  the 
involvement  of  the  management,  but  also  the  resources 
that  only  the  management  can  allocate.  In  small 
manufacturers,  it  is  not  always  a  case  to  establish  a  formal 
E-Commerce  information  management  project  team  as  in 
large  corporations.  However,  the  information  manager  or 
executive/senior  manager  is  the  one  who  has 
responsibility  to  ensure  that  the  E-Commerce 
requirements  are  successfully  implemented  as  planned. 


Figure  1.  A  Model  for  successful  e-commerce 
implementation  over  the  Internet 
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Accordingly,  the  information  technology  manager/s  or 
external  IT  consultant/s  should  provide  training  sessions 
on  E-Commerce  knowledge.  Besides,  it  is  an  effective 
way  to  visualize  the  management  commitment  toward  E- 
Commerce  implementation.  The  fundamental  process 
towards  successful  implementation  of  E-Commerce  is 
shown  in  Figure  1. 

Along  with  the  adaptation  process,  the  organization 
needs  to  determine  its  starting  point  to  continuously 
measure  progress  and  evaluate  the  benefits  gained  from 
E-Commerce  implementation  (Gide  &  Soliman,  1998). 
Gaps  of  process  steps  can  be  identified  when  comparing 
the  current  system  with  the  E-Commerce  requirements.  It 
enables  the  management  to  determine  the  necessary 
efforts  and  resources  to  be  allocated  to  the  areas  of 
importance. 

9.  Conclusions 

This  research  is  designed  to  propose  a  model  for 
successful  implementation  of  E-Commerce  over  the 
Internet  in  manufacturing  industry.  This  paper  also 
examines  the  importance  of  Internet  Commerce  in  the 
context  of  implementation  strategies. 

Commerce  over  the  Internet  is  very  much  in  the  early 
stages.  Therefore,  success  in  Internet-Based  Electronic 
Commerce  depends  on  how  organizations  strategically 
position  their  products  and  services  through  other 
Internet-based  electronic  communities  and  intermediaries, 
as  well  as  on  how  they  facilitate  the  interactions  with  their 
customers,  suppliers,  and  partners. 

The  main  obstacles  to  E-Commerce  implementation  for 
manufacturers  are  security,  privacy,  payment  tools, 
cultural  and  technical.  Cultural  issues  are  challenges 
linked  to  a  major  re-engineering  exercise  accompanied  by 
resistance  to  change.  Technical  issues  are  high 
implementation  costs,  inadequate  resources  and 
insufficient  external  assistance.  However,  it  is  expected 
that  Internet  will  made  major  advances  into  redefining 
manufacturing  away  from  physical  driven  processes  and 
towards  an  information-intensive  system. 
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Abstract 

It  is  shown  that  genetic  algorithms  can  be  employed 
to  determine  the  optimal  short-run  inventory  levels  in 
unreliable  manufacturing  systems.  Furthermore,  it  is 
indicated  that  such  inventory  levels  can  be  used  as 
the  set-points  of  gain-scheduled  adaptive  controllers 
for  such  systems.  The  performance  of  such  a  gain- 
scheduled  adaptive  controller  is  illustrated  for  a 
particular  unreliable  manufacturing  system  subjected 
to  a  particular  piecewise-constant  demand.  It  is 
demonstrated  that  the  performance  of  this  adaptive 
controller  is  superior  to  that  of  a  genetically 
optimised  non-adaptive  controller  in  this  case. 


L  Introduction 

In  modem  industry,  it  is  important  that 
manufacturing  systems  be  controlled  optimally  so  as 
to  minimise  inventory  costs  whilst  meeting 
production  schedules  [1]  [2].  However,  the 
dynamical  behaviour  of  manufacturing  systems  is 
typically  very  complex  and  the  synthesis  of  optimal 
control  policies  for  such  systems  is  accordingly  rather 
difficult.  This  difficulty  is  exacerbated  in  the  case  of 
unreliable  manufacturing  systems  by  the  presence  of 
failure-prone  machines.  Indeed,  in  the  case  of  such 
unreliable  systems,  theoretical  results  are  currently 
available  only  in  the  case  of  single-machine 
manufacturing  systems  producing  either  a  single  part- 
type  [3]  [4]  or  multiple  part-types  [5].  These  results 
also  apply  only  to  the  minimisation  of  expected  costs 
incurred  in  the  long  run  for  constant  demand  rates. 
Nevertheless,  these  results  [3]  [4]  [5]  provide  very 
important  information  regarding  the  optimal 
inventoiy  levels  to  be  used  in  unreliable 
manufacturing  systems. 

In  this  paper,  it  is  shown  that  genetic  algorithms 
can  be  employed  to  determine  the  optimal  inventory 


levels  to  be  used  in  such  systems  in  the  short  nin.  It 
is  shown  that  such  optimal  short-run  inventory  levels 
depend  (for  given  maximum  production  rate,  mean 
time  between  failures,  mean  time  for  repair,  and 
inventoiy  weighting  parameters)  on  die  demand  rate 
as  well  as  on  die  initial  state  of  a  manufacturing 
system  This  genetic  stochastic  optimisation 
procedure  readily  provides  a  ’look-up'  table  diat  can 
be  used  for  the  gain-scheduled  adaptive  control  of 
unreliable  manufacturing  systems  with  variable 
demands  Thus,  consider  diat  such  a  manufacturing 
system  is  subjected  to  a  piecewise-constant  demand; 
dien,  given  die  state  of  die  system  at  die  beginning  of 
a  piecewise-constant  interval  of  demand,  die 
corresponding  optimal  short-run  inventoiy  level  can 
be  immediately  determined  from  the  relevant  'look¬ 
up'  table.  This  optimal  inventoiy  level  can  dien  be 
used  as  die  set-point  for  a  crisp-logic  (or  fuzzy-logic) 
controller  [6]  [7]  [8]  undl  the  beginning  of  the  next 
piecewise-constant  interval  of  demand.  This  process 
can  be  repeated  indefinitely  and  thus  provide  gain- 
scheduled  adaptive  control  for  unreliable 
manufacturing  systems  with  variable  demands. 

The  performance  of  such  a  gain-scheduled 
adaptive  controller  is  illustrated  in  this  paper  for  a 
particular  unreliable  manufacturing  system  subjected 
to  a  particular  piecewise-constant  demand.  The 
optimal  short-run  inventoiy  levels  embodied  in  die 
gain-scheduled  adaptive  controller  for  this  sy  stem  are 
determined  using  die  genetic  stochastic  optimisation 
procedure  described  in  diis  paper.  Such  genetically 
determined  optimal  short-run  inventory  levels  are 
compared  with  the  theoretically  determined  optimal 
long-run  inventoiy  levels  of  Bielecki  and  Kumar  [4], 
It  is  demonstrated  that  die  performance  of  the  gain- 
scheduled  adaptive  controller  incorporating  the  'look¬ 
up'  table  for  the  optimal  short-run  inventory7  levels  is 
superior  to  that  of  a  genetically  optimised  non- 
adaptive  controller  in  this  case. 
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2.  Genetic  Optimisation  Procedure 

The  single-machine  manufacturing  systems  under 
investigation  are  governed  by  differential  equations 
of  the  form 

x(t)  =  u(t)-d  ,  (1) 

where  x(t)  is  the  inventory  level,  u(t)  is  the 
production  rate,  and  d  is  the  demand  rate.  Such 
systems  are  unreliable,  with  exponentially  distributed 
time-between-failures  and  times-for-repair  having 
respective  mean  values  Tf  and  Tf  .  The  maximum 

production  rate  is  um_v  ;  and  it  is  assumed, 

initially,  that  the  demand  rate  is  constant.  The 
inventory  level,  x(t),  is  controlled  when  the  machine 
is  'up'  by  generating  the  production  rate,  u(t),  in 
accordance  with  the  following  crisp  control  rules: 


1.  If  x(t)>  x*,  thenu(t)  =  0 

2.  If  x(t)  =  x* ,  then  u(t)  =  d 

3.  If  x(t)<x*,thenu(t)  =  umax  . 


(2) 


In  these  rules,  x*  is  the  set-point  of  the  crisp-logic 
controller.  It  is  desired  to  find  the  optimal  set-point, 

xopt  =  xopt  x(°))>  such  ^at  expected  value 
of  the  cost  function 


J(x*)==jJ{c+x+(t)+c  X  (t)}dt  (3) 
is  minimised,  where  T  is  tire  task  time, 

x  +  (0  =  max { x(t),  0 }  (4a) 

is  the  finished-parts  surplus, 


x  (t)  =max{-x(t),  0} 


(4b) 


optimisation  procedure  is  the  calculation  of  the 

expected  value  of  the  cost  function,  J(x*),  defined  in 
equation  (3)  for  specified  values  of  d  and  x(0).  This 
calculation  can  be  effected  by  introducing  tire  set, 
{aj(f),«2  (0>- of  scalar  machine-condition 

variables.  Each  such  variable,  a^(t)  (1<  j<n),  is 

equal  either  to  unity  (when  the  machine  is  ‘up')  or  to 
zero  (when  the  machine  is  'down').  In  each  machine- 
condition  variable,  the  times  for  these  transitions 
between  unity  and  zero  (and  between  zero  and  unity) 
are  instances  of  the  exponentially  distributed  times- 
to-failure  and  times-for-repair  with  respective  mean 
values  Tf  and  Tr .  Then,  for  a  fixed  value  of  the 
$ 

set-point,  x  ,  in  the  control  rules  (2),  the  cost 
function  in  equation  (3)  is  evaluated  for  each  of  the 
machine-condition  variables  in  the  entire  set 
{aj(0,a2  W,-’«n(0}thus  producing  a  set 

{ Jj ,  J2 , J n }  of  corresponding  cost  functions. 

The  expected  value  of  the  cost  function  over  the 
entire  set  of  n  machine-condition  variables  is  then 
given  by  the  equation 

*  I  u  * 

E[J(x  )]  =  -  I  J.(x  )  (5) 

nj  =  l  J 

The  Darwinian  fitness,  0(x*),  of  the  set-point,  x*  , 
can  then  be  conveniently  defined  by  the  equation 

<D(x*)  =  l/E  [J(x*)]  (6) 

Each  such  set-point  can  itself  be  conveniently 
represented  by  a  string  of  binary  digits  in  order  to 
facilitate  the  genetic  optimization  procedure. 


is  the  finished-parts  backlog,  and  c+  and  c_  are 
weighting  parameters.  In  solving  this  optimisation 
problem,  it  is  assumed  that  both  the  mean  times 
T^  and  Tf  are  fixed  and  also  that  the  demand  is 

achievable,  i.e.,  d  <umax  .  It  is  evident  that,  when 

the  machine  is  'down',  the  crisp  control  rules  are  no 
longer  effective  and  that  the  production  rate,  u(t),  is 
then  zero  regardless  of  the  value  of  the  inventory 
leVel,  x(t). 

It  is  difficult  to  determine  the  optimal  set-point, 
xopt  =xopt(d’x(0))’  using  conventional 

optimisation  techniques.  However,  such  optimal  set- 
points  can  be  readily  found  by  using  genetic 
algorithms.  The  crucial  step  in  this  genetic 


This  optimisation  procedure  begins  by  randomly 
generating  an  initial  population  of  binary  strings  in 

which  each  such  string  represents  a  set-point,  x  ,  in 
the  control  rules  (2).  Evolution  is  then  caused  to 
occur  in  this  population  of  binary  strings  in 
accordance  with  the  standard  genetic  operations  of 
crossover,  mutation,  and  selection.  In  this 
evolutionary  process,  the  Darwinian  fitness  of  each 
binary  string  is  evaluated  by  substituting  into 

equation  (6)  the  expected  value,  E[J(x*)],  of  the 
cost  function  over  the  entire  set  of  relevant  machine- 
condition  variables.  This  evolutionary  process  is 
allowed  to  continue  until  no  significant  further 
increase  is  obtained  in  the  fitness  of  the  fittest  binary 
string.  This  string  is  then  decoded,  and  thus  provides 
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the  optimal  value  of  the  set-point,  x*pt  ,  for  die 

crisp-logic  controller  for  the  specified  values  of  d  and 
x(0).  It  is  evident  that  this  genetic  optimisation 
procedure  can  be  repeated  for  any  desired  values  of  d 
and  x(0)  within  the  operational  envelope  of  the 
manufacturing  system.  In  this  way,  the  entire 
optimal  set-point  function, 

x*pt  =x*pt(d,x(°)),  required  for  gain-scheduled 

adaptive  control  can  be  readily  obtained. 

3.  Illustrative  Example 

This  procedure  for  the  design  of  gain-scheduled 
adaptive  controllers  for  unreliable  manufacturing 
systems  can  be  conveniently  illustrated  by 
considering  the  control  of  a  particular  single-machine 
manufacturing  system  governed  by  equation  (1).  In 
this  case,  x(0)  =  0,  =5s,Tf  =  1.5s, umax  =5, 

and  the  variable  demand  is  piecewise-constant  over 
intervals  of  duration  20s  as  shown  in  Figure  1;  in 

addition,  c+=land  c“=  2  in  the  cost  function 
defined  by  equation  (3).  The  first  stage  in  this  design 
process  is  to  determine  the  optimal  set-point, 

xopt  =xopt  W#x(0)),  of  the  crisp-logic  controller 

governed  by  equation  (1)  such  that,  for  initial  state 
x(0)  and  constant  demand  d,  the  expected  value  of 
die  cost  function 

J(x* )  =  1q°  { x  +  (t)  +  2x  ~  (t)}dt  (7) 

is  minimised. 


The  'look-up'  table  obtained  for  x  t  (d,  x(0))  in 

this  case,  using  the  genetic  optimisation  procedure 
described  in  Section  2,  is  displayed  in  Table  1.  In 
this  procedure,  die  expected  value  of  the  cost 
function  defined  in  equation  (7)  was  evaluated  over 
an  entire  set  of  n=100  machine-condition  variables 
using  equation  (5);  a  population  size  of  100  was  used 
in  die  genetic  algoritimi  over  100  generations;  and 
die  respective  probabilities  of  crossover  and  mutation 
were  0.6  and  0.003. 

It  is  evident  from  Table  1  diat,  as  die  demand 
increases,  die  dependence  of  die  optimal  short-run 
inventory  level  on  the  initial  state  of  the  system 
becomes  increasingly  strong.  The  corresponding 
values  of  die  optimal  long-run  inventory  levels  in  diis 
case,  as  determined  by  means  of  the  theoretical 
results  of  Bielecki  and  Kumar  [4],  are  shown  for 
comparison  in  Table  2.  These  long-run  levels  are 
clearly  different  from  the  short-run  levels  shown  in 
Table  1. 

Table  2 
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In  any  event,  the  long-run  levels  cannot  be  used 
for  gain-scheduled  adaptive  control  because  these 
long-run  levels  are  independent  of  the  initial  state  of 
the  unreliable  manufacturing  system.  However,  the 
data  contained  in  Table  1  for  the  optimal  set-point 

function,  x*pt  =  x*(d,x(0))  ,  can  be  readily 

implemented  in  the  gain-scheduled  adaptive 
controller.  Thus,  during  each  piecewise-constant 
interval  of  demand,  the  relevant  optimal  set-point  for 
the  crisp-logic  controller  governed  by  equation  (2)  is 
obtained  from  the  'look-up'  table  by  using  as  x(0)  the 
set-point  for  the  previous  piecewise-constant  interval 
of  demand.  This  procedure  yields,  in  this  case,  the 
piecewise-constant  set-point  profile  shown  in  Figure 
2.  The  performance  of  the  resulting  gain-scheduled 
adaptive  controller  can  be  conveniently  assessed  by 
evaluating  the  expected  value,  E[K],  of  the  cost 
function 

K  =  !o40{x  +  (t)  +  2x-(t)}dt  (8) 

over  an  entire  set  of  100  machine-condition  variables 
for  the  adaptively  controlled  unreliable 
manufacturing  system. 

This  expected  value  for  the  adaptive  controller  is 
E[K]  =  836.27  ,  which  is  smaller  than  the 
corresponding  value  of  E[K]  =  856.10  for  the  best 
obtainable  non-adaptive  controller.  This  non- 
adaptive  controller  was  itself  optimised  by  using 
genetic  algorithms  to  determine  the  value  of  the 
constant  inventory  set-point  (x*=2.09)  for  die  crisp- 
logic  controller  that  minimises  the  expected  value  of 
the  cost  function  given  by  equation  (8)  for  the 
variable  demand  shown  in  Figure  1.  These  values  of 
E[K]  clearly  indicate  that,  in  this  case,  the 
performance  of  the  gain-scheduled  adaptive 
controller  incorporating  the  'look-up'  table  for  the 
optimal  short-run  inventory  levels  is  superior  to  that 
of  the  best  non-adaptive  controller.  However,  it  is 
important  to  note  that  -  in  any  event  -  this  non- 
adaptive  controller  is  not  practically  realisable  since 
its  optimisation  requires  that  the  entire  demand 
profile  be  known  in  advance.  It  is  therefore 
remarkable  that  the  practically  realisable  adaptive 
controller  performs  so  well.  Indeed,  this  would  still 
be  true  even  if  -  for  some  particular  demand  profile  - 
die  performance  of  the  adaptive  controller  were 
inferior  to  that  of  the  best  obtainable  non-adaptive 
controller. 

The  time-domain  performances  of  the  adaptive 
and  non-adaptive  controllers  for  a  typical  machine- 
condition  variable  are  shown  in  Figure  3  and  Figure 


4,  respectively,  whilst  the  corresponding  trajectories 
for  the  cost  function,  K,  are  show  in  Figure  5.  It  is 
evident  from  Figure  5  diat,  in  diis  case,  the 
accumulated  cost  for  the  adaptive  controller  is 
significandy  lower  tiian  tiiat  for  the  non-adaptive 
controller. 

4.  Conclusion 

It  lias  been  shown  that  genetic  algorithms  can  be 
employed  to  determine  the  optimal  short-run 
inventory  levels  in  unreliable  manufacturing  systems. 
Furthermore,  it  lias  been  indicated  tiiat  such 
inventory  levels  can  be  used  as  die  set-points  of  gain- 
scheduled  adaptive  controllers  for  such  systems.  The 
performance  of  such  a  system  lias  been  illustrated  for 
a  particular  unreliable  system  subjected  to  a 
particular  piecewise-constant  demand.  It  has  been 
demonstrated  that  die  performance  of  this  adaptive 
controller  is  superior  to  that  of  a  genetically 
optimised  non-adaptive  controller  in  tiiis  case. 
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Figure  3:  Time-domain  performance: 
adaptive  controller 


Figure  4:  Time-domain  performance: 
non-adaptive  controller 


Figure  5:  Cost  function  comparison 
(full  line  =  adaptive  controller;  dashed  line  =  non-adaptive  controller) 
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Abstract 

The  development  of  Internet-based  E-Commerce  offers 
the  most  exciting  business  opportunities  in  the 
marketplace .  Intelligent  manufacturers  must  re-evaluate 
every  aspect  of  their  strategy  and  operations  from 
customer  service  to  marketing ,  product  development  to 
retailing,  logistics  to  distribution .  That's  why  the 
Internet-based  E-Commerce  is  becoming  essential  for 
companies  entering  the  cyber-marketplace .  This  study 
analyses  the  use  of  Internet-based  e-commerce  as  a 
strategic  tool  and  its  possible  implication  on  intelligent 
manufacturing  systems. 


utilization  of  EDI  (Electronic  Data  Interchange)  as  a 
strategic  tool  in  supply  chain  management  to  a  much 
more  comprehensive  Internet-based  Electronic 
Commerce  strategy. 

The  value  of  goods  and  services  traded  between 
companies  over  the  Internet  is  expected  to  skyrocket  from 
$8  billion  in  1997  to  $327  billion  in  2002  in  the  US, 
according  to  a  report  from  Forrester  Research. 
Businesses  are  adopting  inter-company  trade  over  the 
Internet  because  they  want  to  cut  costs,  reduce  order- 
processing  time,  and  improve  information  flow. 


1.  Introduction 

Organisations  are  facing  a  rapidly  changing  dynamic 
business  environment  and  very  fast  advances  in 
information  technology  (IT)  while  we  are  approaching  to 
the  third  millennium.  These  changes  have  occurred  due 
to  increased  global  competition,  significant  advances  in 
international  business,  and  changes  in  political  and 
economical  environments  around  the  world. 


♦  Business 
Segment 
($billfon) 
— ra —  Consumer 
Segment 
($  billion) 


Increased  internationalization  in  the  business  arena  has 
had  a  major  impact  on  business  organizations. 
Accordingly,  business  operations  all  around  the  world 
have  began  moving  from  the  traditional  methods  of 
communication  to  the  Internet  commerce  (Gide  & 
Soliman,  1998).  Meanwhile,  the  business  world  is 
witness  with  the  tremendous  growth  of  the  Internet  and 
Internet-based  Electronic  Commerce  from  manufacturing 
to  retailing,  from  insurance  to  health,  from  banking  to 
travelling,  from  food  to  hospitality  industry  especially  for 
the  last  two  years  (Soliman  &  Gide,  1997).  Bottoms 
(1995)  predicts  that  many  executives  may  soon  find  their 
companies  in  the  midst  of  a  major  transition  from  the 


Figure  1.  Worldwide  Internet  commerce  revenues 
(business  and  consumer  segments  1996-2002. 
source:  IDC  1998). 

According  to  a  report  issued  by  online  researcher 
eMarketer  (July  1998),  currently  there  are  approximately 
60  million  “ active  Internet  users ”  worldwide,  with 
Americans  accounting  for  37  million  users.  If  U.S. 
figures  are  added  for  a  total  worldwide  picture,  the 
growth  rate  will  more  than  quintuple,  from  44  million  in 
1997  to  228  million  by  year  2002.  The  worldwide 
Internet  Commerce  revenues  are  shown  in  Figure  1. 
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2.  The  Business  Process  of  Internet-Based  E- 
Commerce  in  Manufacturing 

There  are  no  exact  definitions  of  the  Internet-based  E- 
Commerce  since  it  is  still  immature.  However,  one 
definition  made  by  Kalakota  (1996),  Professor  of 
Management  at  the  University  of  Rochester  and 
electronic  commerce  author,  as  “the  process  of 
converting  digital  inputs  into  value-added  outputs 
Basically,  the  Internet  commerce  business  process 
involves  taking  information  as  raw  material  and 
producing  value  added  information-based  products  or 
services  out  of  the  original  raw  information  as  shown  in 
the  following  figure  (Figure  2).  So,  electronic  commerce 
refers  to  an  on-line  production  process  owned  by 
intermediaries. 
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Figure  2, .  Dlustration  of  the  process  of  adding  value  to 
information  in  a  manufacturing  setting 
using  the  e-commerce. 

The  figure  (Figure  2)  shows  that  the  two  models 
( Physical  and  Conceptual  Models)  in  a  manufacturing 
setting  are  similar.  In  the  Physical  Model  raw  material 
enters  the  system  and  leaves  as  finished  goods.  In 
manufacturing  the  raw  material  and  converting  into 
finished  products,  the  Internet-Based  E-Commerce  is 
used  through  two  platforms  ( User  and  Security 
Platforms).  In  the  Conceptual  Model  raw  information  is 
entered  in  the  system  and  leaves  as  processed  information 
(Soliman  &  Gide.  1997).  Producers  of  information 
interact  with  sendees  and  other  processed  information, 
such  as  orders,  payments  or  instructions.  In  reality, 
Internet  commerce  is  about  businesses  and  consumers 


adopting  a  new  process  or  methodology  in  dealing  with 
each  other.  These  processes  are  supported  by  electronic 
interactions  that  replace  close  physical  presence 
requirements  and  traditional  means. 

3.  The  Internet  Commerce  as  a  Strategic 
Advantage 

For  gaining  strategic  advantage  through  Information 
Technology  (i.e.  Electronic  Commerce)  manufacturing 
managers  must  understand  not  just  the  technology  but 
also  the  “value  chain”  in  which  their  company  operates. 
Information  technology  will  have  an  impact  on  each 
activity  along  the  value  chain.  In  fact,  information 
technology  is  transforming  the  way  value  activities  are 
performed  and  the  nature  of  the  linkages  among  them. 
These  basic  effects  explain  why  information  technology 
(Internet)  has  acquired  strategic  significance  and  is 
different  from  the  many  other  technological  business  use. 
Intelligent  manufacturing  management  via  E-Commerce 
to  gain  competitive  advantage  is  shown  in  Figure  3. 


Figure  3.  Intelligent  manufacturing  systems  via 
e-commerce  to  gain  competitive  advantage. 

The  Internet  can  make  a  significant  contribution  to 
each  components  of  a  company’s  value  chain  (Cronin, 
1994).  To  uncover  and  evaluate  new  avenues  for 
competitive  advantage  through  use  of  the  Internet, 
companies  need  to  analyze  their  relationships  with 
suppliers  and  vendors,  the  existing  role  of  information  in 
the  organization  of  the  company,  internal  production 
mechanisms,  and  the  points  of  contact  with  customers. 
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3.1.  Internet  Commerce  Manufacturing:  Suppliers 
Level 

The  Internet  provides  very  fast,  reliable  connections  to 
suppliers  around  the  world.  Companies  can 
communicates  with  vendors  in  any  location,  without 
incurring  additional  communication  costs.  It  is  difficult 
to  convey  complex  bids  or  cost  estimates  accurately  over 
the  telephone,  and  if  the  vendor  is  in  a  different  time 
zone  even  scheduling  telephone  calls  can  be  problematic. 
Large  amounts  of  data  are  also  cumbersome  to  transmit 
via  telefacsimile.  Even  overnight  delivery  of  information 
may  be  too  slow  when  critical  decisions  are  waiting  to  be 
made.  Many  vendors  offer  electronic  pricing  and 
ordering  information  to  overcome  these  limitations. 
However,  adopting  one  vendor’s  proprietary  on-line 
system  may  limit  a  company’s  flexibility  to  change 
vendors. 

In  manufacturing,  traditionally  Design  Engineering, 
Procurement  and  Production  Departments  communicate 
with  each  other  using  paper  based  methods.  However  the 
introduction  of  Internet-Based  E-Commerce  and  its 
superiority  of  oxer  traditional  EDI  is  adding  new 
dimension  to  reducing  the  cost  of  manufacturing.  In  a 
typical  manufacturing  setting  Design  Engineering 
Department  supply  design  drawings  and  specification  to 
Procurement  Department  to  procure  material,  commence 
production,  and  ultimately  delix  er  goods  to  customers  as 
per  orders.  There  are  three  types  of  flows  in  a  general 
manufacturing  setting.  These  are: 

1.  Material  flow  (examples  are  raw  material  from 
supplier,  Work-In-Process  and  Finished  Goods); 

2.  Clerical  flow  (examples  are  Drawings,  Specifications 
and  Bills  Of  Materials);  and 

3.  Information  flow  (examples  are  information  on 
parts,  suppliers,  customers  and  the  industry). 

Improvement  in  the  movement  of  raw  material,  Work- 
In-Process  and  Finished  Goods  is  likely  to  occur  as  a 
result  of  using  the  Internet-Based  E-Commerce.  The 
main  benefit  to  manufacturing  lies  in  using  the  Internet 
for  the  second  and  third  types  of  flow. 

The  number  of  parts  used  in  production  could  be  in  the 
order  of  thousands  of  items.  These  parts  are  usually 
purchased  from  suppliers  on  the  basis  of  price,  quality, 
and  delivery  on  time  and  suppliers  financial  position  and 
reputation  in  the  industry. 

Accordingly  Material  Procurement  professionals  must 
be  equipped  with  timely  and  valuable  information  on 


parts  and  their  suppliers.  The  Internet-Based  E- 
Commerce  provides  them  with  a  fast  and  efficient  way  of 
obtaining  comprehensive  information  of  the  market, 
feedback  from  the  industry  and  the  performance  of 
suppliers  (Soliman  &  Gide,  1997). 

The  following  figure  (Figure  4)  illustrates  how  clerical 
and  production  information  can  be  efficiently 
communicated  throughout  the  supply  chain  using  the 
Internet-Based  E-Commerce. 


Figure  4.  Supply  chain  communication  in 

manufacturing  using  Internet-based  e- 

commerce. 

On  the  other  hand,  electronic  distribution  of  software, 
publications,  and  other  items  provides  immediate  access 
to  these  products.  On-line  tracking  of  orders  and 
inventory  ensures  that  companies  are  aware  of  delivery 
dates,  and  reduces  delays  in  the  distribution  process. 
Many  companies  have  found  that  product  support  over 
the  Internet  significantly  reduces  the  time  lost  due  to 
system  performance  problems  (Cronin  1994).  For  some 
companies,  the  efficiencies  and  cost  savings  generated  by 
dealing  directly  with  suppliers  over  the  Internet  have 
more  than  justified  their  investment  in  the  network. 

3.2.  Internet  Commerce  Manufacturing: 

Operations  Level 

The  global  connectivity  of  the  Internet  offers 
companies  immediate  savings  in  long-distance 
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telecommunications.  A  dedicated  Internet  connection 
allows  unlimited  exchange  of  data  and  e-mail  with 
locations  around  the  world.  Even  a  low-cost,  shared  dial¬ 
up  connection  with  an  hourly  use  charge  is  more 
economical  than  long-distance  telephone  charges.  In  the 
longer  term,  the  ability  to  exchange  information  quickly 
and  easily  facilitates  the  relationships  with  business 
partners  and  customers,  encouraging  more  joint  ventures. 
For  employees,  connecting  to  an  international 
information  source  promotes  global  awareness.  It  allows 
companies  to  monitor  economic  and  political 
developments  in  countries  targeted  for  market  expansion. 

Furthermore,  information  resources  and  discussion 
groups  on  the  Internet  provide  employees  with  direct 
access  to  virtually  unlimited  advice  and  information.  By 
helping  them  to  answer  questions  and  retrieve  relevant 
materials,  the  network  connection  increases  their 
productivity.  The  Internet  also  facilitates  more  effective 
deployment  of  human  resources.  Network  links  support 
telecommuting  and  allow  small,  remote  offices  to 
participate  more  actively  in  companywide  programs  and 
contribute  to  joint  projects.  When  project  teams  are  being 
formed,  managers  can  select  members  based  on  their 
expertise,  without  regard  to  geographic  location. 

3.3.  Internet  Commerce  Manufacturing: 

Customer  Level 

The  Internet  is  a  powerful  tool  for  market  research,  for 
establishing  new  markets,  and  for  testing  customer 
interest  in  emerging  products.  Thousands  of  discussion 
groups  and  bulletin  boards  are  available  for  keeping  in 
touch  with  new  developments  through  environmental 
scanning,  as  well  as  for  direct  contact  with  customers. 
Manufacturers  and  producers  of  electronic  information  or 
software  can  deliver  these  items  instantly  to  customers 
and  collect  payment  on  the  Internet.  Electronic 
catalogues  offer  products  and  services  to  millions  of  users 
browsing  through  the  network.  Vendors  can  offer  on-line 
help  services  and  product  support  without  additional 
expense  of  dedicated  connections  to  customers.  The 
Internet  allows  direct  interactions  with  customers  to  be 
spread  through  many  divisions  of  a  company;  technical 
and  development  staff,  documentation  providers, 
production  workers,  and  researchers  find  out  first  hand 
how  customers  respond  to  company  products.  They  can 
address  problems  and  provide  customer  support  as  a 
team. 

In  addition  to  that,  strategic  use  of  the  Internet  based 
on  an  analysis  of  the  value  chain  encourages  companies 
to  focus  on  areas  where  they  can  measurably  improve 


performance.  If  a  company  decides  to  distinguish  itself 
through  the  quality  of  its  customer  service  organization, 
the  network  can  be  a  decisive  asset  in  achieving  this  goal. 
If  the  emphasize  is  on  developing  and  marketing 
innovative  products,  the  Internet  connection  will 
contribute  at  a  different  point  in  the  value  chain.  The 
benefits  of  the  Internet  will  vaiy  from  business  to 
business.  One  thing  is  certain  that  for  companies  seeking 
competitive  advantage,  the  global  network  is  essential 
management  resource. 

4.  The  Strategic  Benefits  of  the  Internet 
Commerce  for  Intelligent  Manufacturing 

In  this  rapidly  changing  environment,  businesses  are 
taking  a  look  at  their  own  organizations,  structures,  and 
processes  in  an  effort  to  become  more  competitive.  The 
Internet  is  a  very  useful  tool  for  engaging  in  business 
activities.  Many  companies  are  using  e-mail  and  group 
conferencing  to  engage  in  business  process  re¬ 
engineering  projects.  Maintaining  good  communication 
to  exchange  data  and  documents  is  critical  in  the  re¬ 
engineering  of  business  processes.  The  competitive 
benefits  of  the  Internet  Commerce  for  intelligent 
manufacturing  are  illustrated  below  in  Figure  5. 
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Figure  5.  The  competitive  benefits  of  the  Internet 
commerce  for  intelligent  manufacturing 

In  addition,  the  ability  to  have  the  latest  information 
about  our  marketplace  and  awareness  of  the  state-of-the- 
art  in  our  industry  allows  us  to  keep  our  competitive 
edge.  Learning  what  other  companies  are  doing,  knowing 
the  kinds  of  information  available,  and  discovering  new 
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markets  can  assist  a  company  in  maintaining  a 
competitive  advantage. 

More  companies  use  the  Internet  in  the  search  for  “best 
practices.”  As  businesses  tiy  to  become  more  competitive, 
many  want  to  find  existing  practices  that  can  help  them 
improve  their  activities.  Businesses  can  also  use  the 
communications  abilities  of  the  Internet  to  engage  in  a 
Total  Quality  Management  (TQM)  plan.  Some 
companies  use  the  Internet  to  maintain  corporate  process 
control  across  all  company  locations  including  even 
continents.  Many  companies  use  the  Internet  commerce 
tools  to  search  for  successful  practices  of  corporate  and 
product  improvement. 

4.1.  Just-In-Time  Communications 

The  Internet  offers  a  business  the  opportunity  for  just  in 
time  (JIT)  communications  with  people  and 
organizations  across  the  globe,  enlarging  the  visibility  of 
a  business  a  thousand  fold.  Being  on  the  Internet  allows  a 
company  to  truly  have  a  world  market.  Good 
communications  enable  more  global  corporate 
management  control,  aiding  in  consistency  of  results. 
Companies  can  be  in  touch  with  suppliers,  branches,  and 
subsidiaries  in  an  effort  to  exert  more  control  over 
variables.  Companies  can  establish,  negotiate,  and 
maintain  standards  online.  As  an  Internet  tool,  E-mail  is 
a  low  cost  method  for  maintaining  local,  regional, 
national  and  international  communication.  Messages  can 
be  exchanged  in  minutes  as  opposed  to  days  or  even 
months  using  regular  mail. 

4.2.  Corporate  Logistics 

Logistical  concerns  can  dominate  production  planning 
and  customer  sendee  issues  for  corporate.  Since  the 
Internet  is  the  anvwhere-anytime-network,  employees, 
suppliers,  customers,  and  others  can  keep  in  touch  more 
efficiently.  The  use  of  e-mail  and  teleconferencing 
facilitates  communication  between  markets.  In  addition, 
real  -  time  communication  is  also  possible.  Using  the 
Internet  for  communication  removes  distance  and  time 
barriers.  This  method  reduces  the  need  to  be  so  aware  of 
time  zone  differences  and  variations  in  the  phone  and 
mail  systems  of  various  countries.  Using  the  Internet 
lessens  logistical  concerns  because  employees  do  not 
need  to  be  in  the  same  room  or  city  for  meetings. 
Listserver  or  group  computer-conferencing  software  is 
another  tool  that  can  improve  internal  and  external 
communications  by  helping  to  overcome  logistical 
concerns. 


4.3.  Collaboration  and  Development 

The  development  team  and  project  participants  often 
use  the  Internet  to  keep  in  touch,  and  to  exchange  data, 
programs,  and  working  papers  from  any  locations.  The 
Internet  also  allows  several  small  businesses  to  band 
together  much  more  easily  for  product  development.  In 
another  words,  formation  of  partnerships  among 
companies  is  increasingly  common,  and  the  Internet 
facilitates  this  collaboration  for  product  design,  vendor 
channels,  research,  and  development.  Collaborative 
approaches  have  been  greatly  enhanced  by  the  Internet 
with  its  wealth  of  information,  its  capacity  of  supporting 
telecommuting  and  time-shifted  communication. 

4.4.  Information  Resources  and  Intelligence 

With  more  than  20  million  (1998)  machines  connected 
to  the  Internet,  the  system  has  a  multitude  of  databases, 
Web  sites,  Usenet,  Gopher  sites,  FTP  (File  Transfer 
Protocol)  sites,  Listserver  discussion  lists,  and 
conferences,  the  amount  information  available  is 
staggering.  Scientific  and  research  data  is  available  in 
large  quantities.  Furthermore,  some  manufacturers  find 
that  the  Internet  is  useful  in  helping  employees  learn  new 
tasks  and  processes.  There  are  many  simulations, 
manuals,  training  aids,  and  tools  available  for  software 
running  on  a  variety  of  platforms,  from  UNIX  tutorials  to 
Windows  tips  and  hints.  There  are  also  large  quantities 
of  instructional  materials  available  on-line  regarding  the 
use  of  the  Internet.  Furthermore,  having  the  most  up-to- 
date  information  about  markets  and  the  state-of-the-art  in 
industiy  allows  companies  to  keep  or  increase  their 
competitive  edge.  Competitive  intelligence  can  be  gained 
due  to  access  to  information  on  products,  materials,  new 
ideas  and  even  the  status  quo  in  a  given  industiy.  What 
are  other  businesses  doing?  What  kinds  of  information 
are  available?  Who  are  the  main  competitors  in  a  specific 
business? 

4.5.  Customer  Support  and  Satisfaction 

One  of  the  prime  business  uses  of  the  Internet 
Commerce  is  in  the  area  of  customer  support.  In  this 
highly  competitive  and  dynamic  global  marketplace,  the 
company  that  can  reach  and  satisfy  customers  will  have 
an  advantage-  and  the  Internet  Commerce  can  help  in 
maintaining  relationships  with  customers.  With  its 
global  reach,  the  Internet  can  assist  business  in  locating 
new  suppliers  and  keeping  in  better  touch  with  them  to 
aid,  for  example,  in  zero  inventory  planning.  Customers 
can  reach  a  company  on  their  own  schedules,  24  hours  a 
day,  365  days  a  year  and  be  able  to  obtain  information 
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regardless  where  they  are.  Many  companies  maintain 
World  Wide  Web  sites,  Gophers,  and  FTP  sites  for 
customers  use  during  working  and  non-working  hours. 

4.6.  Vendor  Support  and  Networking 

The  Internet  provides  a  fast  method  for  networking 
with  vendors  and  suppliers,  increasing  speed  and  variety 
in  our  procurement  process.  With  its  global  tentacles,  the 
Internet  can  help  businesses  locate  new  suppliers  and 
keeping  better  touch  with  them.  In  addition,  small 
suppliers  can  network  with  and  compete  with  larger, 
better-known  suppliers.  Furthermore,  the  Internet  assists 
companies  in  maintaining  low  inventory  levels  because 
of  speed  of  communications.  Relationships  with  vendors 
and  outlets  can  also  be  maintained  via  the  Internet. 
Companies  can  arrange  product  delivery  through  the 
Internet,  where  companies  can  establish  and  support 
actual  distribution  channels. 

5.  Conclusions 

It  is  a  fact  that  that  Internet-Based  E-Commerce  is  still 
in  its  early  stages  but  as  we  discussed  early,  it  is 
becoming  one  of  the  most  important  strategic  tools  for 
manufacturing  management.  Businesses  of  all  types  and 
I  sizes  can  also  find  that  the  Internet  serve  a  large  variety 
of  their  needs  as  a  strategic  tool  to  gain  competitive 
advantage  over  their  rivals,  including  marketing, 
customer  and  vendor  support,  the  exchange  of 
information,  and  joint  ventures  for  research  and 
development. 

With  the  aid  of  the  Internet-based  E-Commerce, 
companies  also  can  develop  new  products,  communicate 
in  real  time,  take  orders,  receive  electronic  publications 
and  documents,  and  retrieve  data  from  specialty 
databases  (Gide  &  Soliman,  1998).  Businesses  can  find 
technical  advice,  establish  and  maintain  business 
relationships,  obtain  market  intelligence,  provide 
electronic  products  directly.  The  Internet  will 
undoubtedly  be  one  of  the  most  important  ways  of  buying 
and  selling  goods  and  services  soon. 
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Abstract 

The  paper  presents  and  demonstrates  some  ideas  and 
results  related  to  the  application  of  ICT  in  machines  and 
machine  systems.  The  research  problems  presented  and 
the  early  research  results  reviewed  are  based  on  a 
technology  programme  " SMART "  which  combines  the 
mechatronics  research  efforts  of  several  universities, 
research  centers  and  enterprises  in  Finland.  The  target  of 
the  programme  is  to  develop  information  technology 
intensive  product  concepts  and  design  tools  to  implement 
them.  The  main  driving  force  of  the  programme  is  to  help 
SME-sized  but  globally  active  companies  to  achieve 
competitive  advantage  through  effective  incorporation  of 
ICT  in  their  future  product  concepts. 

1.  Introduction 

This  paper  presents  and  demonstrates  some  challenges 
which  emerge  when  Information  and  Communication 
Technology  (ICT)  is  applied  in  machines  and  machine 
systems.  Traditionally,  mechatronics  is  regarded 
synonymous  to  robotics.  Besides  robotics,  however,  there 
are  numerous  potential  applications  for  mechatronics 
aspects  in  industries.  In  the  future  it  will  no  more  be 
appropriate  to  speak  about  mechatronic  machines  and 
conventional  machines  because  even  simple  hand  tools 
can  feature  active  and  smart  materials,  sensors  and 
displays.  All  machines  and  devices  will  be 
Mmechatronized"  to  some  extent. 

The  Finnish  Government  has  a  special  strategic  tool  to 
encourage  technological  development  and  risk-taking 
especially  in  SME's.  So-called  technology  programmes 
are  driven  by  Tekes  (Technology  Development  Center  in 
Finland).  The  idea  is  to  bring  together  SME  type 
industries,  universities  and  research  centers  to  work  on  a 
relevant  research  area  like  mechatronics.  A  technology 
programme  typically  contains  several  generic  projects 


which  generate  public  research  results.  The  results  are 
mainly  methods  and  tools,  to  some  extent  also  new  design 
concepts.  Besides  generic  projects,  a  programme  includes 
numerous  confidential  enterprise  projects.  An  enterprise 
project  consists  of  in-house  research  and  normally  also 
subcontracting  from  a  university  laboratory.  The  results  are 
typically  confidential.  The  volume  of  a  programme  may 
vary  from  some  2  MECU  up  to  100  MECU.  Financing  the 
programmes  is  a  joint  effort  of  Tekes,  industry,  universities 
and  research  centers. 

The  discussion  in  this  paper  is  mainly  based  on  a  four- 
year  technology  programme  "Smart  Machines  and  Systems 
2010"  ("SMART")  which  is  going  on  during  1997-2000 
with  a  budget  of  8  MECU.  The  aim  is  to  develop 
information  technology  intensive  product  concepts  and 
design  tools  to  implement  them.  Some  15  participating 
industrial  enterprises  develop  their  product  concepts  in 
confidential  company  projects.  The  goal  is  that  the  new 
product  concepts  provide  a  firm  base  for  company  R&D 
well  into  the  21st  century.  Simultaneously  public  research 
projects,  driven  by  some  10  university  and  research  center 
units,  will  provide  raw  material  for  the  confidential 
projects.  Some  research  topics  and  primary  results  from  the 
public  projects  are  reviewed  in  this  paper.  The  author  is 
responsible  for  the  coordination  of  this  programme. 

2.  Mechatronics:  Fusion  of  Technologies 

Three  major  technological  leaps  have  occurred  in  history: 
the  inventions  of  steam  power,  electric  power,  and 
information  power.  The  steam  engine  liberated  man  from 
the  limitations  of  muscle  power,  resulting  in  huge  progress 
in  manufacturing  industry  and  transportation.  The  invention 
of  electric  power  enabled  distributed  but  effective 
manufacturing  of  products  on  a  smaller  scale.  Information 
power  substituted  more  flexible  automation  for  the  former 
fixed  automation  in  processes  and  products.  Machines  can 
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take  over  routine  or  calculation-intensive  intellectual 
operations  to  some  extent. 

Conventional  industrial  products  like  process  valves, 
waste  water  pumps,  elevators,  paper  machine  systems, 
electric  generators  and  forest  harvesters  typically  feature 
fairly  mature  mechanical  and  electro-mechanical  design 
due  to  the  long  development  history  of  these  domains. 
However,  it  is  difficult  to  achieve  clear  competitive  edge 
with  traditional  technologies.  Exploitation  of  the  potential 
of  information  and  communication  technologies  will  give 
new  possibilities  to  enhance  traditional  products.  During 
the  last  ten  years  a  lot  of  effort  has  been  taken  worldwide 
to  enhance  product  concepts  on  the  basis  of  mechatronics 
principles.  Advanced  sensors,  more  effective  control 
algorithms,  increased  computing  power  and  increased  use 
of  telecommunication  technologies  have  led  to  better 
performance,  ease  of  maintenance  and  more  sophisticated 
user  interfaces  of  new  machine  generations.  Also  more 
effective  computer  aided  design  methods,  e.g.  well- 
trimmed  3D  design  software  and  rapid  prototyping 
technologies,  have  been  launched  to  facilitate  and  cut  the 
development  process  of  multitechnological  machines  and 
systems.  Despite  massive  research  efforts  there  is  still  big 
demand  in  industry  for  well-focused  basic  and  applied 
research  mechatronics  programmes. 

There  are  many  reasons  why  the  development  and 
design  of  mechatronic  machines  is  relatively  difficult 
compared  to  conventional  industrial  products: 

^  added-value  is  made  with  ICT,  and  the  necessary 
resources  are  under  a  very  tough  competition 
worldwide 

S  the  development  culture  is  multitechnological  and 
nowadays  even  multiscientific:  mechanical 

engineering,  electronics  hardware,  software,  system 
and  control  theory,  electrical  engineering,  hydraulics, 
and  human  sciences.  Traditionally  the  design  and 
manufacturing  cultures  have  been  quite  different  ip 
various  domains. 

^  the  development  of  the  core  technologies  in 
mechatronics  is  rapid 

^  the  integration  of  multitechnological  rapidly 

developing  technologies  to  systems  is  very 
challenging 

3.  Make  Dust  or  Eat  Dust 

Foreseeing  and  forecasting  the  future  is  not  very 
systematic  in  conventional  machine  workshops  and 
SME  s.  Companies  are  used  to  reacting  either  rapidly  or 
slowly  to  changes  in  their  environment.  Thinking  by 
alternative  scenarios,  control  navigation  signs  and  several 


operation  alternatives  is  the  proactive  way  to  face  the  future 
and  may  lead  to  a  more  agile  approach  to  new  opportunities 
and  threats.  Some  trends  in  ICT  are  fairly  easy  to  foresee 
and  do  not  require  any  special  attention.  Some  changes  like 
the  rapid  rise  of  Internet  and  its  novel  industrial  application, 
the  high  growth  rate  of  advanced  digital  wireless 
telecommunication,  or  the  challenge  of  Linux  to  Microsoft 
have  been  more  difficult  to  foresee.  The  incubation  time  of 
new  ICT's  is  normally  fairly  long:  one  may  think  that 
practically  all  the  important  ICT  inventions  which  will  be 
available  in  2010,  already  exist  in  some  university  or 
research  center  lab!  Alternative  scenarios  are  needed  to 
discover  these  seeds  and  to  generate  enough  agility  "to  be 
the  early  bird  to  catch  the  worm". 

A  generic  project  concentrating  on  the  methods  and  tools 
of  future  research  is  established  in  the  "SMART" 
programme.  The  goal  is  to  develop  relevant  alternative 
future  sceanarios  for  the  participating  industrial  companies 
to  give  background  for  their  product  concepts  and  process 
development.  Especially  the  application  potential  of 
information  and  communication  technologies  in  the  next 
15-20  years  is  the  key  aspect  in  the  scenarios.  It  is  very 
common  that  SME's  have  a  fairly  realistic  picture  of  the 
trends  in  their  core  businesses.  Outside  the  key  technologies 
the  enterprises’  understanding  of  the  prevailing  or  future 
technology  trends  may  be  poor.  Working  with  alternative 
scenarios  normally  opens  the  eyes  of  the  company 
management  and  the  R&D  people. 

4.  Modelling  of  Multitechnological  Systems 

Modelling,  simulation  and  virtual  prototyping  have  been 
hot  topics  in  mechatronics  research  from  its  beginning.  The 
reasons  are  easy  to  see:  there  is  an  increasing  need  to 
compress  the  lead  time  in  product  development.  With  more 
detailed  modelling  and  associated  simulation  of  systems  it 
is  possible  to  decrease  the  use  of  mock-ups,  physical 
models  and  prototypes.  Especially  physical  prototyping  is 
very  time  and  money  consuming.  The  vision  is  that  with 
sophisticated  computer  prototyping,  "virtual  prototyping", 
one  can  drop  out  some  iteration  stages  with  normal 
prototypes.  It  is  not  yet  realistic  to  expect  that  physical 
prototypes  are  totally  unnecessary.  However,  with  realistic 
computer  screen  prototypes  they  may  be  postponed  to  the 
final  meters  of  the  product  development  process.  In  the 
future  physical  prototypes  will  be  used  as  the  last  check  of 
the  validity  of  design  results,  compared  to  the  past  R&D 
culture  where  successive  physical  models,  mock-ups  and 
prototypes  were  needed  to  validate  even  the  early  stages  of 
design. 
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In  mechatronics  the  buildup  of  models  and  virtual 
prototypes  is  extremely  challenging.  Mechatronics  is  the 
art  of  integration,  and  in  modelling  this  means  the 
development  of  a  common  language,  design  interfaces  and 
design  tool  compatibility.  Non-compatible  design 
languages,  methods,  tools  and  software  prevail  in  the  main 
domains  like  mechanics,  electronics,  hydraulics,  controls 
etc.  Staff  for  the  various  domains  is  recruited  from 
different  technological  cultures,  because  strict  faculty 
borders  in  the  universities  and  other  education  institutes 
traditionally  separate  mechanical  engineering,  electrical 
engineering,  chemistry  and  so  on.  Accordingly,  the 
requirements  for  design  inputs  and  outputs  vary  depending 
on  the  design  domain.  Truly  concurrent  engineering  is  not 
possible  if  you  have  to  transfer  data  manually  from  one 
software  to  another,  or  modify  the  outputs  of  one  software 
in  order  to  prepare  it  for  the  input  of  another.  The  vision 
should  be  that  you  have  a  common  digital  product  model 
and  all  the  design  domains  have  access  to  the  model. 
During  the  design  process  all  the  domains  contribute  to 
the  model  in  real-time.  In  simple  one-technology  products 
this  procedure  is  already  industrial  routine  but  in  most 
mechatronics  machines  and  machine  systems  just  a 
remote  vision. 

In  the  SMART  programme  the  generic  problem  of 
submodel  integration  to  the  system  model  is  approached 
through  simplified  multitechnological  pilot  cases.  For 
example  combined  electromagnetic  and  mechanical 
modelling  and  simulation  are  demonstrated  with  an 
electric  generator  set  in  the  "MAGEN"  project.  Integrated 
electro-hydraulic,  control  systems  and  mechanical 
modelling  and  simulation  are  tested  with  a  paper  machine 
variable  crown  roll  in  the  "MATEL"  project. 
Mechatronics  modelling  of  an  elevator  landing  door  for 
condition  monitoring  purposes  is  studied  in  the  "MAHIS" 
project.  Especially  the  interaction  of  mechanics,  sensors 
and  DSP  is  in  the  focus  of  this  project.  The  basic  research 
idea  is  to  learn  a  lot  from  these  pilot  cases  and  then 
formulate  better  generic  design  methods,  rules,  tools  and 
data  exchange  formats  for  industrial  R&D  applications  in 
order  to  approach  “the  common  design  language"  of  all 
design  domains.  A  brief  description  of  the  projects 
follows.  There  are  also  other  subprojects  which 
concentrate  for  example  on  vibro-acoustic  modelling  and 
on  modelling  of  wear  of  some  critical  mechanical 
components. 

The  major  challenge  and  risk  for  this  project  group  is 
the  item  of  generalization.  It  is  not  an  easy  task  to  draw 
generic  conclusions  of  the  very  specific  case  studies  under 
different  technology  domains.  However,  the  first  step  to 
the  correct  direction  is  always  the  most  important  one 
because  there  is  a  lot  of  effort  and  learning  behind  it. 


4.1.  Case  MAGEN 

Interaction  of  electro-magnetic  and  mechanical  structure 
of  a  generator  is  modelled  using  the  basic  parameters  of  a 
commercial  generator  set.  I-DEAS  is  used  as  a  mechanical 
design  software  and  an  application-specific  university 
software  (developed  at  Helsinki  University  of  Technology, 
Laboratory  of  Eleotromechanics)  is  used  for  electro¬ 
magnetic  field  modelling.  General  understanding  of  the 
prevailing  phenomena  is  already  achieved  and  an  interface 
for  relevant  softwares  will  be  built.  This  interface  will  be 
one  building  block  in  the  long  line  where  modelling  blocks 
for  a  mechatronics  system  are  developed  in  the  long  run. 
Some  general  conclusions  will  be  drawn  about  the 
integration  of  two  important  technology  domains  in  electro¬ 
mechanical  design. 

4.2.  Case  MATEL 

The  interaction  between  a  hydraulic  system  and  a 
complicated  mechanical  system  with  hybrid  sliding  bearing 
shoes  is  studied.  The  industrial  pilot  is  a  paper  machine 
variable  crown  roll  which  actually  is  a  massive  sliding 
bearing  system  with  remarkable  oil  flow  and  associated 
massive  static  and  dynamic  forces.  The  oil  flow  in  the 
bearing  and  the  positions  of  the  bearing  shoes  are  defined 
by  complicated  hydro-mechanical  phenomena. 
Understanding  of  these  phenomena  and  the  bearing  film 
formation  is  increased  and  the  interaction  of  hydraulic  and 
mechanical  forces  can  be  modelled  with  some  precision. 
The  results  of  this  case  study  will  have  certain  generic 
implications  to  other  hydro-mechanical  modelling  and 
simulation,  too. 

4.3.  Case  MAHIS 

The  industrial  case  is  a  normal  commericial  elevator  and 
especially  its  landing  door.  Modelling  of  the  door  is  an 
iterative  process  from  a  simple  kinematic  model  to  a 
multitechnological  complicated  model  including  flexible 
parts,  nonlinearities,  actuators  and  the  control  system. 
Modelling  is  performed  with  commercial  software  (I- 
DEAS  and  ADAMS)  and  verified  by  laboratory 
measurements  with  a  test  door.  The  final  goal  is  to  develop 
a  model  which  can  be  used  for  diagnostics  and  maintenance 
purposes.  The  model  and  the  associated  simulation  results 
are  used  as  a  reference  for  normal  operation  activities.  For 
example,  opening  the  landing  door  will  cause  remarkable 
vibration  in  some  parts  of  the  door.  A  signal  from  a  sensor 
fixed  on  the  vibrating  area  can  be  analyzed  and  processed. 
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When  a  fault  situation  appears,  a  difference  between  the 
sensor  signal  and  the  modelled  signal  can  be  seen. 
Successful  exploitation  of  this  idea  requires  a  huge 
amount  of  historical  data  from  the  field  to  determine  the 
tolerances 'and  normal  noise  of  the  relevant  signals. 

The  special  challenges  in  this  application  are  e.g.  sheet 
steel  structures  with  large  tolerances  and  flexible 
elements.  Also  the  environment  of  an  elevator  is  non¬ 
typical  compared  to  most  machines  because  two  identical 
environments  do  not  exist  in  practice. 

5.  Advanced  user  interfaces 

User  interfaces  (UI's)  are  one  of  the  most  potential 
items  to  apply  ICT  on  machines  and  systems.  Limitations 
in  current  ULs  are  due  to  two  reasons.  First,  with 
traditional  technology  it  was  possible  just  to  design  user 
interfaces  based  on  buttons,  levers  and  gauges.  It  was  not 
possible  to  integrate  any  intelligence  from  the  user’s  point 
of  view.  Second,  engineering  people  did  not  have  the 
methods  and  tools  to  deal  with  the  user’s  values, 
preferences  and  limitations.  A  remote  controller  of  a  TV 
set  is  a  typical  example  of  unsatisfactory  UI  design:  all 
functions  are  packed  in  the  same  way  in  the  controller 
without  any  preferences  for  the  commonly  used 
operations,  the  signs  are  too  small  especially  for  a  dark 
room,  etc. 

With  ICT,  you  can  enhance  the  interfaces  and  take  the 
user’s  viewpoints  better  into  account.  Communication 
technologies  like  Internet  applications  can  effectively  be 
used  for  all  kinds  of  "remote"  interfaces.  Pilot  cases  like  a 
welding  generator  user  interface,  process  valve 
maintenance  interface,  elevator  user  interfaces,  a  gasoline 
station  delivery  pump  user  interface  etc.  have  been 
developed  simultaneously  with  a  generic  research  project. 
Other  applications  like  a  forest  harvester’s  training 
simulator  are  being  developed. 

There  are  two  challenging  research  problems  in  this 
field.  First,  how  to  translate  the  user’s  values  and 
expectations  into  the  technical  language.  Second,  what 
will  the  new  UI  technologies  be  and  how  are  they  applied 
in  machine  environments. 

6.  Dependability  of  Software  and  Electronics 
Hardware 

In  other  public  projects  dependability  and  reliability 
design  methods  for  machine  electronics  hardware  are 
developed.  Simple  but  systematic  methods  for  producing, 
testing  and  documenting  real-time  software  effectively  are 
also  studied.  Especially  new  electronics  design  and  testing 


methods  and  tools  are  demonstrated  in  industrial  pilot  cases 
like  forest  harvesters  and  stone  crushers.  In  these  cases  and 
applications  the  reliability  requirements  for  electronics  and 
also  for  software  are  much  tougher  than  those  typical  of 
most  automotive  applications.  To  create  product  concepts 
with  extensive  electronics  hardware  and  software,  still 
maintaining  good  reliability  and  functionality,  requires  a 
new  culture  in  machine  workshops.  The  "SMART"  projects 
help  industrial  enterprises  to  create  new  design  and 
manufacturing  environments. 

7.  Conclusion 

Extensive  use  of  ICT  is  a  potential  but  difficult  way  to 
enhance  the  competitive  edge  of  conventional  machines  and 
machine  systems.  Mastering  the  multitechnological 
development  culture  of  mechatronic  products  is  a  big 
challenge  especially  to  SME's.  Active  research  is  a 
powerful  tool  to  convert  new  ideas  and  concepts  into 
engineering  practice.  Coordinated  research  efforts  can  be 
encouraged  by  the  government  and  achieved  through 
technology  programmes.  A  lot  of  research  power  can  be 
concentrated  on  research  focuses  and  the  cost  can  be  shared 
between  several  companies  instead  of  doing  redundant 
research.  All  research  problems  cannot  be  solved  during  a 
4-year  programme  but  many  new  initiatives  can  be  taken. 

The  Internet- sites  of  the  programme  are: 

http://www.machina.hut.fi/smart/ 

Keywords:  mechatronics,  smart  machines,  machine 
intelligence 
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Abstract 

The  authors  present  an  ATE  system  designed  for 
teaching  practices  about  programmable  electronic 
instrumentation,  by  means  of  GPIB  and  VXI  instruments 
connected  to  a  local  area  network  (LAN).  The  aim  of  this 
system  intended  for  students  of  electronic  engineering,  is 
the  introduction  of  more  relevant  technologies  involved  in 
hardware  instrumentation  for  ATE  applications,  besides 
the  handling  of  their  software  environments  for 
developing  programs. 

1.-  Introduction. 

Teaching  electronic  instrumentation  for  engineering 
students,  has  a  great  impact  about  their  basic  training  on 
technologies  oriented  to  design  systems  for:  process 
control,  product  verification,  services  operation,  quality 
analysis,  etc.,  over  all  economic  sectors.  The  growing 
advance  in  microelectronics  given  its  continuos  reduction 
in  cost  and  increasing  processing  power  and  lower  size, 
with  higher  performance  of  software  packages,  have 
boosted  the  presence  of  powerful  systems  for  automated 
test  equipment  (ATE),  based  on  programmable 
instrumentation  [1]. 

From  1965  when  HP  Company  introduced  its  HPIB 
bus,  rack-and- stack  programmable  instruments  were  “de 
facto”  standardized,  until  1975  when  was  reached  their 
“de  jure”  standard  called  IEEE  488,  also  popularly  named 
GPIB  (General  Purpose  Interface  Bus).  The  limitation  of 
this  standard  to  bus  physical  connection  (hardware) 
generated  in  1987  an  extension  of  its  first  version,  now 
called  IEEE  488.1,  in  order  to  ease  instruments 
programming  (software)  by  means  the  standardization  of: 
message  interchange  protocols,  data  formats  and  syntax, 
state  reports  and  general  commands  for  diverse  kind  of 
instruments.  This  new  version  named  IEEE  488.2  was 
unable  for  designing  ATE  systems  entirely  compatible, 
and  for  this  reason  in  1990  the  SCPI  (Standard  Commands 
for  Programmable  Instruments)  standard  defined  a  single 
conceptual  model  for  designing  programmable 


instruments.  Simultaneously  with  advances  in  designing 
software  for  compatible  ATE  systems,  efforts  were  made 
for  increasing  the  processing  power  of  GPIB  instruments 
upgrading  the  byte  rate  of  bus  transfers  (less  than  1 
MBps),  until  1987  that  a  group  of  electronic 
instrumentation  manufacturers  introduced  the  VXI 
architecture,  which  has  modular  instruments  over 
electronic  cards  that  plug  in  a  chassis  (mainframe),  able  to 
reach  a  byte  rate  of  40  MBps  [2]. 

In  90 's  decade  programmable  instrumentation  based  on 
GPIB  and  VXI  buses  has  reached  a  great  spread,  allowing 
ATE  systems  design  with  multiple  computer  platforms 
and  several  operating  systems,  creating  the  “virtual 
instrument”  concept.  Currently  the  advance  impulse  for 
GPIB  performances  and  new  architectures  for 
programmable  instruments  is  sustained  from 
manufactures,  users  and  researchers.  For  example, 
National  Instruments  manufacturer  introduced 
respectively  the  HS488  specification  in  1993  oriented  to 
reach  a  maximum  byte  rate  of  8  MBps  in  GPIB.  and  the 
PXI  (PCI  extensions  for  Instrumentation)  architecture  in 
1997,  based  on  Compact  PCI  [3] [4],  for  modular 
instruments  over  electronic  cards. 

2.-  ATE  system  description. 

In  their  digital  communications  laboratory  the  authors 
have  designed  a  programmable  instrumentation  system 
(SIP)  based  on  IEEE  488  (GPIB)  and  VXI  (VME 
extensions  for  Instrumentation)  buses,  where  the  GPIB 
bus  connects  all  programmable  instruments  to  laboratory’s 
PCs  by  means  a  LAN  [5].  The  interface  between  LAN  and 
GPIB  bus  is  made  with  the  Gateway  LAN  HP -IB  HP 
E2050  [6].  The  VXI  bus  [2][7][8]  is  configured  in  a 
mainframe  for  instrument  modules  in  size  C  cards,  and 
connected  to  the  LAN  through  the  GPIB  bus,  being  its 
software  development  tool  the  environment  VEE  4.0  from 
HP  [9][10][1 1].  The  figure  1  shows  the  ATE  system 
configuration  (SIP). 
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Figure  1.  ATE  system  configuration  (SIP)  of 
programmable  instrumentation  laboratory. 


The  gateway  connects  lab’s  PC  network  to  GPIB  bus 
following  client/server  model  [2],  where  each  PC 
performs  as  a  client  and  the  gateway  as  server  [6][12][13]. 
In  this  way  applications  running  in  clients  can 
communicate  with  instruments  based  in  GPIB  bus  in  a 
transparent  mode  through  LAN.  This  allows  the  access  of 
several  students  to  programmable  instruments  and  sharing 
in  this  way  the  SIP  resources.  The  gateway  can  be 
installed  in  any  place  of  network,  depending  on 
instruments  location  and  GPIB's  cable  length.  Instrument 
identification  is  reduced  to  a  simple  logical  address. 

Table  1  summarizes  the  features  of  equipments  that 
conform  the  SIP:  two  GPIB  rack-and-stack  instruments 
(DC  power  supply  [14]  and  multimeter  [15]),  a  VXI 
mainframe  [3][7][16]  with  eight  connected  instruments 
([14]  [15]  and  [17]  to  [25]),  the  gateway,  and  a  PC  with 
WATCH__GPIB  software  developed  by  the  authors  that 
performs  the  bus  monitor  function  (paragraph  4). 

Figure  2  depicts  the  software/firmware  architecture  of 
SIP  where  each  client  computer,  under  operating  system 
Windows  95/NT  and  with  VEE  tool,  manages  the 
intended  programs  to  control  instruments  (SICL:  Standard 
Instrument  Control  Library)  and  LAN  access  (TCP/IP). 
The  gateway  has  the  network  server  software  and 
adequate  firmware  for  implementing  the  interface  between 
instruments  and  LAN.  The  rack-and-stack  GPIB 
instruments  include  the  firmware  with  bus  drivers. 
Command  module  (slot  0  of  chassis)  from  VXI  mainframe 
[17]  [22]  incorporates  the  needed  drivers  for  VXI/GPIB 
interface.  The  PC  Bus  Monitor  only  takes  the  signals  from 
GPIB  bus  lines  for  displaying  them  on  the  screen 
(paragraph  4). 

3.-  Development  of  training  program. 

The  growing  technological  development  in  the 
programmable  instrumentation  field,  has  allowed  to 


authors  the  implementation  of  applied  research  projects 
with  this  kind  of  instruments  [1],  and  applications  oriented 
to  teaching  electronics.  In  this  way  one  of  largely 
applications  of  VXI  bus  has  been  the  automation  of 
measurement  and  testing  over  electronic  circuits 
implemented  by  the  students.  Simultaneously  have  been 
designed  practices  oriented  to  evaluation  and  analysis  of 
GPIB  bus  activity.  The  students  have  access  to  SIP 
through  PC's  connected  to  LAN,  with  the  VEE 
programming  tool  and  SICL  libraries  (Figure  2),  that 
respectively  allow  implementing  the  measurement  and 
testing  application  and  mastering  the  instruments.  Also  the 
students  can  see  the  commands  and  data  behavior  through 
GPIB  bus  by  means  a  PC  devoted  to  this  task  (Figure  1: 
Bus  Monitor). 


Table  1.  Features  of  equipments  that  conform  the 
SIP. 


Instrument 

Connection 

Characteristics 

HP  6622 A  DC  Power 
Supply 

GPIB 

2  Outputs  of  80  W,  at  7V@10  A,20V@4  A  or 
50V@2  A,20V@4A 

HP  6478A  Multimeter 

GPIB 

2&4  Wire  Q;  Vdc/ac;  3  A@  250V 

HP  E1401A  High 
Power  Mainframe 

VXI/GPIB 

C  Size,  13  slots,  connector  Pl/2 

HP  E1406A 

Command  Module 

VXI 

GP-IB,  RS-232,  slot  0;  2MB  Ram.  Message-based 
Controller. 

HP  E1428A 
Oscilloscope 

VXI 

2  channels,  lGsa/s  250  MHz 

HP  E1410A 

Multimeter 

vxr 

2  &  4  Wire  £2,  Vdc/ac,  True  RMS  from  20Hz  to 
1MHz 

HP  E1420B  Universal 
Counter 

VXI 

200  MHz  /  2ns;  9  digit  resolution. 

HP  E1445A  Function 
Generator 

VXI 

13  bit  resolution,  40  Msa/s;  256  Ksa  waveform 
segment  Memory. 

HP  E1490C  Reg. 

Based  Breadboard 

VXI 

16  bit  register-based  interface,  P2  conector. 

HPE1465A  16x16 
Relay  Matrix 

VXI 

16x16  two-wire;  1  A,  200V  signal  switching 

HP  E2050A  Gateway 

GPIB /LAN 

GPIB  Instrumentation  via  LAN;  SICL/VISA 
support. 

GPIB  Bus  Monitor 

Parallel  I/O 

486  PC,  RAM  16  MB,  I/O  port. 

The  VEE  (Visual  Engineering  Environment)  tool  is  a 
graphic  programming  language  particularly  designed  for 
driving  programmable  instruments.  When  the  instruments 
are  connected  to  GPIB  bus,  the  student  send  then 
commands  and  data  following  the  training  program  by 
means  a  LAN’s  PC.  That  PC  will  receive  the  replay  and 
will  perform  data  analyzing,  displaying  graphic  results  or 
storing  data  for  a  late  processing. 

For  easing  the  control  of  programmable  instruments  in 
the  development  of  ATE  systems,  usually  are  selected 
high  level  programming  languages  such  as  C,  C++,  Visual 
Basic,  and  so  on.  In  this  application  is  used  the  VEE 
programming  language  that  allows  simplifying  the  task  of 
interface  design,  data  acquisition,  processing  and 
displaying  of  results. 

There  are  three  control  types  of  instruments  in  VEE 
[26][27][28]:  Direct  I/O,  Drivers,  and  PC  Plug-in  I/O, 
which  main  features  are  shown  in  table  2.  Laboratory’s 
practices  have  been  designed  in  order  to  introduce  the 
students  easily  about  the  three  control  types  and  adequate 
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them  to  application  needs  and  available  tools.  In  this  way 
is  justified  this  application  does  not  use  the  PC  Plug-in  I/O 
alternative  given  it  needs  a  particular  card 
(VXIplug&play)  for  implementing  the  interface,  on  the 
contrary  to  the  cases  of  Direct  I/O  and  Drivers.  Although 
Direct  I/O  requires  the  knowledge  of  SICL's  instructions, 
nevertheless  provide  a  high  communication  speed  and 
does  not  need  the  instrument  driver,  being  for  that  the 
control  type  more  frequently  used  in  advanced  practices. 
Otherwise  VEE  gives  a  new  object  called  Multidevice 
Direct  I/O  that  gets  the  control  of  several  instruments 
through  the  edition  of  only  one  object.  The  controllers  can 
be  the  control  panel  of  the  instrument  (Panel  Driver)  or  a 
part  of  controller  (Component  Driver).  In  this  application 
the  use  of  instrument  control  tools  is  oriented  to  teaching, 
and  handling  control  panels  is  very  important  given  that 
students  learn  to  operate  the  whole  choices  of  each 
instrument  easily.  When  there  is  no  enough  knowledge 
about  SICL's  instructions  or  choices  of  an  instrument,  the 
Component  Driver  is  the  best  solution  because  provides 
higher  speed  and  only  send  and  receive  the  needed  data 
for  implementing  a  particular  task.  Also  this  object  has  a 
list  of  instructions  and  information  for  supporting  the 
learning  about  choices  of  each  instrument. 


HP  E2Q50  GPIB  Instruments  VXJ  Mainframe  GPEB  Bui  Monitor 


LAN 


Figure  2.  Software/firmware  architecture  of  SIP. 
4-  GPIB  bus  monitor. 


Table  2.  Comparison  between  control  types  of 
instruments. 


VEE 

Objects 

Access  to  Instruments 

Advantages 

Supporting 

Interfaces 

Direct  I/O 

Direct  communication  with  the 
instrument 

High  communication  speed.  Allows  to 
control  anyone  instrument 

GPIB.  Serial, 
VXI  and 

LAN. 

Plug-in  I/O 

Requires  a  controller  of  the 
instruments  that  fits  into 
VXIPlug&play  standard 

High  speed  of  communication. 
Controllers  are  reusable  for  multiple 
application  programs 

GPIB  and 

VXI. 

■  Panel  Driver 
-  Component 
Driver 

Requires  a  controller 

Easy  of  using. 

Higher  communication  speed  with 
component  driver. 

GPIB  and  VXI 

The  GPIB  Bus  Monitor  (Figure  1)  designed  for 
teaching  the  IEEE  488  instrumentation  bus,  allows  to 
display  in  real  time,  and  without  lost  of  information  the 
signal  lines  that  conform  the  GPIB  bus.  Given  that  a 
device  only  can  access  to  the  bus  when  is  addressed,  the 
GPIB  Bus  Monitor  must  not  be  a  bus  device  in  order  to  be 
able  for  reading  its  state  continuously,  and  for  this  reason 
must  not  have  any  address,  therefore  does  not  exist  from 
the  controller  and  the  whole  bus  devices  point  of  view. 


■  4  MS-DOS  -  WAT CH_GPJB 


I  jgf  jgrjjg  A 


©000005242 

<  ATN 

UNLAG > 

< 

REN 

SRQ 

> 

0000005243 

<ATN 

LAG21> 

< 

REN 

SRQ 

> 

0000005244 

<ATN 

TAG16> 

< 

REN 

SRQ 

> 

10000005245 

<  1 

SPE  > 

< 

REN 

SRQ 

> 

10000005246 

< 

P  > 

< 

REN 

> 

10000005247 

<  ! 

SPD  > 

< 

REN 

> 

0000005246 

<ATN 

UNTAG> 

< 

REN 

> 

000000S249 

<  A  T  N 

TAG21> 

< 

REN 

> 

©000005250 

<  A  T  N 

UNLAG> 

< 

REN 

> 

10000005251 

<  A  T  N 

LAG16> 

< 

REN 

> 

0000005252 

< 

D  > 

< 

REN 

> 

0000005253 

< 

1  > 

< 

REN 

> 

0000005254 

< 

0  > 

< 

REN 

> 

10000005255 

< 

1  > 

< 

REN 

> 

Figure  3.  Screen  of  WATCH_GPIB  program. 


The  programmable  instrumentation  GPIB  bus  is  a  parallel 
connection  bus  in  which  all  devices  share  the  line  signals 
[3] [5] [29],  and  requires  the  existence  of  some  equipment 
for  mastering  those  signals.  Each  connected  device  can 
listen,  talk,  control  or  make  combinations  of  these  three 
functions: 

■  Listener:  This  kind  of  device  takes  data  and  command 
from  bus  once  addressed  in  this  operation  mode  from 
the  controller. 

■  Talker:  Such  as  device  send  data  through  the  bus  to 
active  listeners  once  addressed  in  this  operation  mode 
by  the  controller.  At  a  given  time  only  one  device  like 
this  can  be  active  on  the  bus. 

■  Controller:  Is  devoted  to  manage  the  bus,  send 
commands,  request  the  state  of  devices  and  controlling 
the  data  flow.  For  requesting  an  action  to  a  device,  the 
controller  must  put  previously  its  logical  address  on  the 
bus. 


The  GPIB  Bus  Monitor  has  been  implemented  by 
means  of  a  PC  under  MS-DOS  (Table  1).  The  GPIB 
interface  cards  able  to  connect  a  PC  to  the  bus  do  not 
allow  the  access  to  it  in  a  different  time  to  the  card  has 
been  addressed  by  the  controller,  and  therefore  are  not 
useful  for  this  purpose.  For  this  reason  is  used  an  I/O  Card 
adapted  for  reading  the  eight  data  lines  and  the  eight 
control  lines  of  GPIB  bus.  The  Bus  Monitor  runs  the 
WATCH_GPIB  program  that  allows  gathering  and 
displaying  in  real-time  the  available  data  on  GPIB  bus,  or 
displaying  stored  data  in  a  file  (Figure  3).  This  program 
loads  in  the  computer  memory  an  interrupt  control  routine 
and  associates  it  to  parallel  port  (IRQ7).  The  line  SRQ 
(Service  Request)  from  GPIB  bus  [5] [8]  is  used  to  fire  the 
interrupt  routine  and  read  the  data  from  bus.  For  avoiding 
the  possible  loss  of  data  because  of  GPIB  bus  higher 
speed  compared  with  the  reading  speed  of  a  conventional 
parallel  port,  the  interrupt  routine  of  WATCH_GPIB 
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program  stops  temporarily  bus  activity  until  reading  the 
suitable  data. 

Using  the  GPIB  Bus  Monitor  the  students  can  visualize 
the  commands  and  data  that  is  sending  through  the  bus  a 
particular  instrument  from  SIP  (Figure  1). 


a)  Flowchart  for  implementing  practices. 


Figure  4.  Flowcharts  for  implementing  and 
designing  laboratory  practices. 

5.-  Implementation  of  practices. 

The  practices  are  designed  in  such  a  way  that  students 
can  handle  the  development  environment  and  visualize  the 
GPIB  bus.  The  two  first  practices  are  oriented  to  introduce 
them  the  VEE  tool  and  explaining  SIP  configuration.  Both 
are  shown  in  auto-learning  form  and  introduce  step-by- 
step  the  method  of  using  the  VEE  environment,  and  the 
control  of  instruments  for  making  the  practices,  according 
the  flowchart  of  fig  u  re  4a . 

Following  practices  allow  to  students  developing  their 
own  programs  using  existing  tools,  and  at  any  time  they 
can  access  directly  to  the  SIP  for  connecting  their 
electronic  circuits  under  test.  For  example,  the  practice 
three  is  oriented  to  design  a  VEE  program  using  the 
needed  instruments  for  implementing  Bode's  diagrams 
(module  and  phase)  of  a  RC  circuit  frequency  response. 
Also  show  to  students  the  choices  to  identify  and  compare 
the  three  control  types  of  instruments  (Table  2),  and  is 
explained  the  way  VEE  objects  are  connected,  following 
respectively  the  flow  of  execution  and  sequence.  Is 
emphasized  the  control  of  instruments  (Figure  4b)  with  the 
aim  of  students  can  identify  correctly  the  working  tools. 
The  results  of  this  practice  there  are  depicted  in  figure  5. 
This  interaction  with  development  environment  allows  to 
students  a  better  use  of  graphic  language  and  handling 
powerful  mathematical  functions  provided  by  VEE. 

Other  example  is  practice  nine,  devoted  to  the 
automated  testing  of  electronic  boards,  in  which  the 


students  have  a  board  designed  for  teaching  electronic 
devices,  with  the  aim  of  designing  a  graphical  interface 
able  to  display  the  results  obtained  from  testing  the  whole 
components  of  board.  Basically  the  control  of  instruments 
is  made  given  input  signals  to  the  board  and,  supported  by 
HP  E 1465 A  relay  matrix  switch  module  [20], 
implementing  the  right  connection  of  instruments  for 
making  measurements  of  resistence,  AC/DC  voltages  and 
currents,  period,  frequency  and  polarity  test  of  diodes.  The 
results  are  shown  in  figure  6. 


Figure  5.  Instance  of  a  practice  in  VEE 


In  each  practice  is  optional  the  use  of  GPIB  Bus 
Monitor,  that  shows  continuously  the  bus  information, 
sharing  this  task  with  bus  monitoring  function  provided  by 
VEE,  that  is  an  useful  tool  when  SICL  instructions  are 
applied. 

6. -  Background  requirements. 

The  set  of  computer  assisted  practices  designed  for 
teaching  programmable  instruments  to  students,  requires 
from  them  the  following  previous  background: 

■  Circuits  theory. 

■  Basic  electronic  foundations  and  electronic  components. 

■  Experience  using  manual  electronic  instrumentation 
(Oscilloscope,  multimeter,  function  generator,  etc.). 

■  Communication  protocols  for  understanding  the  SIP 
configuration  which  have  been  trained. 

■  Performances  of  HP- VEE  working  environment. 

7. -  Results  and  future  developments. 

From  the  point  of  view  of  students  the  results  are: 

■  Accessing  to  high  cost  and  modem  programmable 
instrumentation  equipment. 
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■  Training  in  the  use  of  programmable  instrumentation, 
currently  unavoidable  in  a  growing  number  of 
enterprises. 

■  Knowledge  acquisition  about  a  largely  used 
instrumentation  bus. 

■Use  of  instrumentation  buses  like  general 
communication  resources  if  it  were  needed  (i.e.:  file 
transfers  between  computers). 

■  Displaying  an  instrumentation  bus  control  signals, 
besides  implementing  electronic  circuits  testing. 

■  Generalization  of  SIP  from  general  purpose  practices  to 
more  specific  ones  such  as  power  electronics,  engines 
control,  network  analysis,  etc. 

Future  developments  about  new  advanced  practices  and 

the  SIP  configuration  are: 

■  Using  a  VXI  register-based  breadboard  module  [25]  for 
designing  new  compatible  instruments  (i.e.:  A/D  and 
D/A  converters,  sensors  and  effectors  conditioners, 
etc.). 

■  Implementing  automatic  evaluation  of  student 
throughput  during  their  training  time. 

■  Including  the  new  PXI  bus  for  programmable 
instrumentation  (paragraph  1). 

■  Migrating  compatible  instrument  controllers  to 
VXIplug&play  standard  currently  in  growing 
acceptance,  and  using  VISA  (Virtual  Instrument 
Software  Architecture)  libraries  [2]  [3]  [30]. 

■  Remote  training  through  Internet  over  the  developed 
SIP.  Experiments  with  Mixteca  Technological 
University  (Oaxaca,  Mexico)  are  started. 


Figure  6.  Interface  of  automated  testing  program 
for  electronic  boards 


8.-  Conclusions. 


The  use  of  communications  networks  for  implementing 
laboratory  practices  based  on  programmable 


instrumentation,  allow  to  introduce  the  electronic 
engineering  students  in  designing,  configuring,  and 
developing  distributed  ATE  systems.  This  work 
introduces  a  programmable  instrumentation  system  (SIP) 
development  with  GPIB  and  VXI  instruments  connected 
to  PC's  by  means  of  a  LAN.  The  training  program  is 
started  with  the  introduction  to  knowledge  of  VEE 
programming  tool,  and  follows  implementing  test  and 
measurements  over  electronic  circuits  designed  by 
students.  Also  are  added  practices  for  evaluating  and 
analyzing  GPIB's  bus  signals. 

Finally  basic  knowledge  required  from  students  are 
defined  in  order  to  obtain  the  best  progress  from  practices, 
and  are  analyzed  the  obtained  results  from  the  point  of 
view  of  students.  Also  are  introduced  the  future 
developments  that  authors  are  involved  for  improving 
both  programmable  instrumentation  practices  and  ATE 
system  that  runs  them. 
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Abstract 

The  paper  presents  a  method  for  deriving  and  integration 
of  the  dynamic  equations  of  multibody  systems.  The 
method  is  used  in  teaching  mechanisms  for  the 
mechatronic  courses  taught  by  the  authors  at  bachelor 
and  master  levels.  The  differential  equations  are 
developed  for  a  generic  situation ,  therefore  they  may  be 
applicable  for  a  large  range  of  mechanisms  used  in 
mechatronic  systems  (vehicles,  suspensions,  robots  etc). 
The  input  parameters  of  the  equations  are  the  bodies  and 
kinematic  joint  positions,  the  type  of  each  joint,  the 
initial  values  of  the  independent  generalized  coordinates 
and  the  forces  values.  The  system  of  differential 
algebraic  equations  is  linear  in  terms  of  accelerations 
and  joint  reactions,  which  facilitate  the  numerical 
integration.  The  method  is  implemented  in  didactical 
software  using  a  graphic  interface  conceived  in  a  very 
popular  geometric  modeling  software  -  AutoCAD. 


1.  Introduction 

The  last  developments  in  the  field  of  Multibody 
Systems  theory  led  to  a  large  number  of  commercial 
software  packages  available  both  for  research  & 
development  and  teaching  purposes.  Most  popular  of 
these  software  (ADAMS,  DADS,  DYMES,  etc)  [1,  2,  3, 
9]  are  currently  providing  a  very  large  range  of  facilities 
for  various  purposes  which  make  them  sometime 
unreachable  for  teaching  purposes  both  from  the  price 
viewpoint  and  complexity.  For  this  reason,  in  many 
cases,  teams  within  the  Universities  developed  their  own 
teaching  tools  better  suited  according  to  their  concrete 
theoretical  approach  as  well  as  the  course  curriculum  and 
syllabus  of  the  respective  course  [1,  5,  7]. 

In  this  paper  a  teaching  method  for  deriving  and 
integration  of  the  dynamic  equations  of  multibody 
systems  is  proposed  by  the  authors  as  well  as  a  software 
for  practical  application  sessions  with  students.  The 
teaching  is  starting  from  assumption  that  the  students 
have  the  basic  knowledge  in  the  mechanics  of  the  rigid 
body  in  the  tri-dimensional  space.  This  is  the  case  after 
the  second  study  year  for  most  Romanian  technical 
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faculties  providing  courses  in  mechanical  and 
mechatronics  fields.  In  this  paper  the  authors  present  the 
main  theoretical  aspects  as  well  as  the  software 
elaborated  to  support  the  teaching 

2.  Systems  of  independent  bodies 

The  systems  of  independent  bodies  are  defined  as  a 
collection  of  rigid  bodies  moving  in  the  3D  space  without 
having  any  link  between  each  other.  For  this  kind  of 
systems,  analyzing  the  motion  is  simply  analyzing  the 
motion  of  each  body  separately  with  the  classical  theory7 
of  the  single  rigid  body  motion. 

2.1.  Kinematics 

The  spatial  position  of  body  is  usually  given  by  a 
number  of  coordinates  usually  called  generalized 
coordinates  and  generically  symbolized  as  qi,  q2,  ...  etc. 
Even  various  possibilities  to  give  the  position  by  various 
number  of  coordinates  are  often  used  [3],  for  the  teaching 
purposes,  the  set  of  six  independent  coordinates  are 
preferred.  The  most  intuitive  one  includes  the  three 
coordinates  of  the  body  associated  reference  frame  (x,  y, 
z)  and  the  orientation  of  the  same  frame  (cpi,  (p2,  (p3>.  with 
respect  to  a  global  reference  frame  given  by  Euler  or 
Bryant  angles  [3].  In  this  case,  to  know  the  body  motion 


means  knowing  the  functions: 

qk(t)  =  0,  with  k  =  1...6.  (1) 

In  a  matrix  form,  this  can  be  written  as 
[q(t)]  -  0  (2) 

Given  the  motion  laws  (2)  one  can  obtain  the  velocity 
and  acceleration  laws  by  simply  differentiating  them  with 
respect  to  time  once  and  twice  respectively: 

[j][q]=[x(q,t>],  (3) 

[J][<i]=[H'(iq,  t)]  ,  (4) 


where  [J]  is  the  Jacobian  6*6  matrix  including  the 
coefficients  of  velocity  and  accelerations  obtained  in  the 
differentiation  process.  All  the  terms  not  including  the 
unknown  velocities  and  accelerations  are  grouped  in  two 
column  matrices  [x]  and  [\j /]  the  six  elements  of  which 
are  depending  only  on  previously  determined  parameters. 
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The  equations  (3)  and  (4)  are  linear  in  terms  of 
velocity  and  accelerations,  thus  easy  to  solve. 

For  the  case  of  the  independent  body  systems,  the 
equations  (2),  (3),  (4),  remain  formally  valid.  The  only 
difference  consists  in  the  form  of  the  matrices  which 
must  include  the  generalized  coordinates  of  all  bodies. 
Thus,  for  a  system  of  nb  independent  bodies,  k  =  l,2..nb 
in  the  equation  (1). 

2.2.  Dynamics 

In  the  dynamic  analysis,  the  number  of  the  given 
equations  (iu)  in  the  system  (1)  is  less  than  the  total 
number  of  unknowns  generalized  coordinates  (6nt,).  In 
this  case  we  say  that  the  system  have  the  degree  of 
freedom: 

L  =  6nb-nc  >  0.  (5) 

that  is  the  system  is  kinematically  undetermined  because 
some  of  the  generalized  coordinates  have  not  a  known 
motion  equation.  The  column  matrix  of  the  known 
coordinates  will  be  symbolized  as  [qd  as  the  independent 
coordinates  are  included  in  the  vector  [qj  with  L 
elements.  As  a  consequence,  the  motion  corresponding  to 
the  L  independent  generalized  coordinates  is  the  result  of 
the  external  and  internal  forces  action: 

Mfa.MQeJ  =>  [qI]=H'1[QOI],  (6) 

By  simply  integrating  these  accelerations  it  became 
easy  to  determine  the  velocities  and  positions  of  the 
bodies.  Together  with  the  initially  given  equations  of 
fqo],  the  situation  became  quite  similar  with  that 
presented  at  the  point  2.1. 

3.  Multibody  Systems  with  kinematic  joints 

A  more  complex  case  of  multibody  systems  is  that  in 
which  various  type  of  links  and  joints  relate  the  bodies  to 
each  other.  These  links  influence  the  degree  of  freedom 
of  the  system  and  the  kinematic  and  dynamic  equations 
as  well. 

3*1*  Kinematic  joints 

In  mechanical  and  mechatronic  systems  very  often  the 
multibody  systems  include  kinematic  joints  which 
introduce  relative  constraints  between  pairs  of  bodies.  In 
practice  a  large  number  of  joints  types  are  utilized  from 
die  construction  viewpoint.  A  systematization  of  the 
joints  can  be  made  taking  into  account  whether  they  are 
considered  in  plane  or  spatial  situations,  type  of  contact, 
number  of  relative  constraints  introduced  between  the 
two  adjacent  bodies  etc.  Most  of  the  joints  introduces 
geometric  relations  between  the  generalized  coordinates 
of  the  adjacent  bodies.  Some  type  of  joints  may  introduce 
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Figure  1.  Planar  kinematic  joints 

more  complex  restrictions  (holonomic  or  non-holonomic, 
rheonomic  or  scleronomic  etc)  but  these  categories  are 
beyond  the  scope  of  the  present  paper.  For  the  planar 
mechanisms  some  usual  joints  are  presented  in  fig.  1. 

From  the  mathematical  viewpoint  the  joints  can  be 
modeled  by  algebraic  equations  linking  the  generalized 
coordinates  of  the  adjacent  bodies.  These  equations  are 
expressing  the  geometric  restrictions  imposed  by  the  joint 
features.  We  are  going  to  establish  these  equations  for  the 
revolute  joint  case  (fig.2)  just  to  illustrate  the  matter. 

The  equations  are  derived  from  the  geometric  condition 
that  guaranties  the  coincidence  between  the  points  Pi  and 
Pj  belonging  to  the  two  bodies  respectively. 


Figure  2.  The  revolute  joint 
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that  guaranties  the  coincidence  between  the  points  Pj  and 
Pj  belonging  to  the  two  bodies  respectively. 

rp  ~  rpi  =  rpj  ? 

rPi=ri+r‘,  fpj=fj+r^, 

fj+r*  -fj-rji  =0.  (7) 

The  vector  equation  (7)  represents  the  geometric  model 
of  the  revolute  joint.  It  can  be  written  also  in  a  matrix 
form,  using  the  linear  algebra  theory: 

[d>ij]=[ri]+Ai[r‘]-[rj]-AJ[r^]=0;  (8) 

where  A  and  Aj  represents  the  transformation  matrices 
between  the  body  reference  frames  (i  and  respectively  j) 
and  the  global  reference  frame  Oxy. 

For  the  planar  case,  the  equation  (8)  include  two 
scalar  equations,  corresponding  to  the  two  relative 
translations  restricted  between  the  adjacent  bodies:  the 
translations  along  x  and  y  axis. 

In  the  general  case  of  a  spatial  multibody  system  with 
nb  bodies  interconnected  with  kinematic  joints  some  of 
the  initial  6n^  degrees  of  freedom  are  kinematically 
driven  like  in  the  case  of  the  independent  bodies  and 
other  are  restricted  by  the  kinematic  joints.  The 
remainder  degree  of  freedom  represents  the  degree  of 
freedom  of  the  system.  From  the  mathematical  viewpoint 
the  only  difference  between  the  driver  equations  and  the 
joint  equations  is  that  the  latter  are  generally  pure 
geometrical  (independent  with  respect  to  time)  and  have 
the  general  form: 

K(«ii.qj>]=0»  <9> 

as  shown  for  the  revolute  case  (8). 

Thus,  for  a  spatial  multibody  system  kinematically 
determined  L=0  and  the  number  of  the  driver  equations 
with  the  number  of  the  joint  equations  is  equal  to  6nb. 
With  the  only  formal  difference  given  by  (9)  which  came 
together  with  (2),  the  kinematic  equations  (2),  (3)  and 
(4)  remain  valid  and  are  numerically  solvable. 

3.2.  The  dynamic  equations 

Based  on  the  link  release  axiom  from  the  Newtonian 
mechanics,  one  can  replace  the  joints  with  corresponding 
forces  and  as  a  consequence  the  body  can  be  considered 
in  static  equilibrium.  Thus,  the  dynamic  equations  can  be 
written  based  on  the  d’Alambert  principle  stated  as 
following:  “The  direct  applied  forces,  the  reaction  forces 
and  the  inertial  forces  acting  on  a  body  are  in 
equilibrium”  [8],  that  is 

[Fexl  +  fRl  +  fF^O,  (10) 

where  for  the  planar  case  [Fex],  [R],  [Fi]  are  column 
matrices  with  3  elements,  including  the  torques. 

In  the  general  case,  these  forces  are  acting  in 
various  points  of  the  body  (fig.  3,a).  For  the  vector 


composition,  it  is  convenient  to  reduce  these  forces  with 
respect  to  the  global  reference  frame  axes  in  the  origin  of 
the  body  reference  frame  (fig.3,b).  The  reduced  forces  are 
called  generalized  forces  and  have  a  direct 
correspondence  with  the  generalised  coordinates,  as 
following: 


o}. 

Figure  3.  The  generalized  forces 
-  The  external  forces  reduced  in  the  point  Oi,  along 
the  directions  of  the  global  reference  frame  axes  are 
called  external  generalised  forces  and  are  symbolised  as 
[Qexl  • 


[Q«]= 


-  The  reaction  forces  from  the  kinematic  joints 
reduced  in  the  same  way  are  called  generalised  reactions 
and  are  symbolised  as  [XR]. 


[^RM-k  ]  “ 

i 

= 

[^R-k  ] 

2Z 

_AR-k  _ 

- 

.  AM-k  . 

-  In  the  case  the  origin  of  the  body  reference  frame 
is  not  coincident  with  the  mass  center,  the  inertia  forces 
have  also  to  be  reduced  in  the  same  way  and  are  called 
generalised  inertia  forces  (in  fig.  3,  for  simplicity  we 
considered  the  origin  coincident  with  the  mass  centre): 
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According  to  the  d'Alambert  principle,  the 
generalised  forces  and  reactions  are  in  equilibrium: 

[Fi-k]  +  [Qoi-k]+[\M-k]=0  (5) 

Making  the  necessary  substitutions  the  equation  (5) 
may  be  expressed  as 

[mk  Iflk  ]_[^-RM-k  ]=  [Qex-k  ]  ;  (6) 

unde 


mk 

0 

o' 

V 

0 

mk 

0 

and  [qk]  = 

yk 

0 

0 

h_ 

A. 

The  relation  (6)  represents  the  general  form  of  the 
motion  differential  equation  of  the  rigid  bod}'  k.  In  the 
case  of  multibody  systems  the  equation  (6)  is  written  for 
each  body,  thus  obtaining  the  system- 

link  Iqk  ]-  PWk  ] = [Qet-k  1  k  =  1 ..  Jtb.  (7) 

In  matrix  form  the  system  (7)  may  be  written  as: 

[mI<i]~  [Am  ]  =  [Qex  J  (8) 

where 


H= 


[m, 

0 

0 


0  0 

[m2]  0 

o  kj 


is  the  mass  matrix  of  the  system, 

[q]  -  the  column  matrix  of  the  generalised  accelerations, 
M  the  column  matrix  of  the  generalised  reactions  and 
[Qex]  is  the  vector  of  the  generalised  external  forces.  For 
the  spatial  case  the  vector  of  the  6nb  generalised  reactions 
can  be  expressed  function  on  only  c=6nb-L  unknown 
(corresponding  to  the  real  number  of  constraints  that  is 
reactions  introduced  by  the  kinematic  joints).  By  using  a 
convenient  matrix  A  (g  x  c). 
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wherefAi  ...  AJ  represents  c  unknown  coefficients,  which 
correct  the  arbitrary  chosen  values  from  the  matrix  [A], 
the  product  [A]  [A]  represents  the  generalised  reactions 
vector  (c  represents  the  number  of  the  geometric 
Constraints  introduced  by  the  joints). 

A  matrix  (g  x  c )  already  numerically  available  in 
this  stage  is  the  transposed  jacobian  matrix  [J]T.  In  this 
case,  the  relation  (9)  become 

M=[jF[4 

where  [X]  is  called  the  Lagrange  multipliers  vector* 

W=[44A] 

As  a  consequence  the  equation  (8)  lead  to 

[mjqj—  [jf  [a]  =  [Q^,  |  (10) 


and  represents  the  differential  equation  of  motion  of  the 
multibody  systems.  Together  with  the  kinematic 
equations  of  the  accelerations  it  forms  the  mixed  system 
of  differential-algebraic  equations  (DAE)  of  motion  for  a 
system  of  interconnected  bodies: 


\m=H 


(ID 


In  the  spatial  case  with  c=6nb-L  constraints,  the  system 
includes  as  unknowns:  3nb  generalised  accelerations  and 
c  Lagrange  multipliers. 

Observations 

1.  The  form  (11)  of  the  motion  equations  is  in 
general  valid  for  a  multibody  system  irespective  of  the 
bodies  and  joints  numbers,  forces  etc.  As  a  consequence 
for  automatically  formulate  the  equations  we  only  need 
algorithms  for  the  automatic  constitution  of  the  matrices 
M,  [J]>  [Qex]  and  [v|/],  first  dimensionally  and  then 
numerically,  starting  from  the  initial  data  of  the 
multibody  system. 

2.  Related  to  the  case  of  the  system  of  independent 
bodies,  characterised  by  the  equation 

[mlqHQexi 

the  system  (11)  include  furthermore  the  term  [J]T[A]  and 
the  differential  equations  of  the  joints.  It  become  clear 
that  by  replacing  the  joints  with  the  set  of  constraints 
equations  and  the  set  of  joint  reactions  a  kinematic  chain 
may  be  treated  in  the  same  way  as  system  of  independent 
bodies  (multibociy  system). 


3.3.  Numerical  integration  of  the  equations 


Even  in  some  simple  multibody  systems  analytical 
solutions  of  the  can  be  obtained  for  DAE,  in  most  cases 
numerical  procedures  have  to  be  implemented.  The 
solution  of  the  system  have  to  comply  with  the  initial 
conditions  which  form  the  initial  solution: 

[q(OHq0]  and 

The  initial  solution  have  to  comply  also  with  the 
constraints  equations: 

Mqi>q2,-..,qg,t)=0, 

yw-S 


If  the  jacobian  matrix  is  a  non-singular  matrix  then 
we  can  write  the  system  solution  as 


ran 

_[M  ['FT 

TkJ] 

M 

1W  oj 

LWJ 

from  which  the  scalar  solution  can  be  extracted 

qk  =fk(q1,q2-4»q1,q2-qg,tjlk=i....g.  (12) 
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In  this  way  the  mixed  DAE  (11)  was  reduced  to  a 
system  of  ordinary  second  order  differential  equations 
(12).  For  the  initial  position  (t  -  to),  the  left  side  of  the 
equation  (12)  depends  only  on  the  initial  data  of  the 
motion  (initial  position  and  initial  velocity)  given  as 
input  data  for  the  dynamic  analysis.  Thus 

fjco  =  (4l0>ft20  V— j^gO»QlO>Q2O***-?QgO?0”  — 

^KO  “ak  .  (13) 

Integrating  these  equations  for  with  respect  to  time 
and  with  time  intervals  sufficiently  small  one  can  obtain 
the  solutions: 

qk  =  akAt +  ck°iqk  =  -^-akAt2 +ckAt+dk  (14) 

where  ck  and  4  are  integration  constants  depending  on 
the  initial  conditions.  The  values  determined  with  (14) 
will  be  considered  initial  conditions  for  the  next  step  of 
integration  etc. 

This  method  of  integration  is  however  unstable 
because  the  constraint  equations  are  utilised  in  the 
differential  form  which  may  lose  geometric  information. 
Therefore  the  constraint  equations  have  to  be  introduced 
in  the  algebraic  form  in  the  system.  Since  the  correct 
solution  impose 

[<Kq)]=[<K4q)]=0  (15) 

the  two  terms  can  be  added  in  any  equation  without 
altering  the  equality.  Based  on  this  property  Baumgarte 
proposed  a  stabilisation  method  by  adding  the  terms  (15) 
in  the  DAE  system  with  the  weighting  coefficients  P  and 
y,  such  as: 
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This  form  of  the  equations  is  stable  and  may  be 
integrated  with  the  method  previously  indicated.  For  a 
good  convergence  the  user  have  to  make  a  proper  choice 
for  the  coefficients  p  and  y. 

4.  The  didactical  software 

Following  the  equations  derived  before  a  computer 
package  was  conceived  in  a  similar  manner  with  the 
commercial  packages  having  in  view  to  get  students 
familiar  with  this  kind  of  tools. 

The  software  called  AMEC  (Analysis  of  MECanisms) 
include  two  modules  for  the  three  major  stages  of  the 
multibody  systems  automatic  analysis:  a  pre&post- 
processing  module  and  a  processing  module. 

4.1.  The  pre&postprocessing  module 

This  module  is  dedicated  first  to  assist  the  user 
in  coding  the  input  data  into  a  format  and  syntax  that  can 


be  recognized  by  the  processing  module  and  second  to 
assist  the  user  in  interpreting  the  output  data  provided  by 
the  same  processing  module  also  in  recognizable  format. 
The  module  is  conceived  under  AutoCAD  software 
graphic  programming  language  AutoLISP,  in  order  to 
use  the  huge  popularity  of  the  AutoCAD  package  among 
students  in  Romania. 

To  this  aim,  a  simple  graphic  interface  was  added  to 
the  original  one  of  AutoCAD  (fig.  4)  in  order  to  include 
“user-friendly”  facilities  for  working  with  the  software 
entities  but  without  drastically  change  the  usual 
AutoCAD  interface.  In  this  way  the  students  feel  familiar 
with  the  software  from  the  very  beginning. 


Figure  4.  The  AMEC  graphic  interface  under 
AutoCAD 


Thus,  in  modelling  session,  the  multibody  system  is 
modeled  as  a  collection  of  solids  with  the  usual  facilities 
of  AutoCAD.  Then  the  multibody  system  data  are 
collected  both  automatically  from  the  solid  model  (the 
initial  position,  the  mass  properties  etc  -  fig.  5)  and 
manually  introduced  by  the  user  (the  forces,  joints, 
drivers  etc  -  fig.  6). 

In  the  next  stage  the  input  data  are 
automatically  coded  in  a  simple  ASCII  format  with  a 
syntax  which  easily  can  be  handled  afterwards  by  the 
students  for  debugging  the  model.  Thus  the  data  are 
structured  in  entities  called  body,  joint  etc.  For  example 
the  entity  body  i  include  all  information  regarding  that 
body,  joint  i  -  all  necessary  information  about  joint  i  etc. 
Then  the  data  are  saved  in  a  file  on  the  computer  disk 
with  the  correct  format  and  syntax  All  necessary 
commands  are  implemented  in  a  graphic  interface  with 
windows  easy  to  handle.  Furthermore  the  students 
receive  before  each  session  a  supporting  text  with  a 
tutorial  for  the  way  of  using  the  package. 
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4.2.  The  processing  module 

The  processing  module  is  conceved  in  C 
programming  language.  It  is  runned  separately  after  the 
pre-processing  opaeration.  The  operation  is  quite  simple, 
the  only  interaction  with  the  user  being  when  indicating 
the  name  of  the  input  data  file.  Then  it  automatically 
runs  and  save  result  files  which  may  be  converted  into 
graphics  or  animations  with  the  pre&post  processing 
module. 


Figure  5.  Pre-processing  body  data  window 
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Figure  6.  Pre-processing  joint  data  window 
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Abstract 

In  this  paper,  we  develop  a  parallel  inverted  pendulum 
system  that  has  the  characteristics  of  the  strongly  coupled 
dynamics  of  motion  by  an  elastic  spring,  the  time-variant 
system  parameters,  and  inherent  instability,  etc.  Hence,  it 
is  possible  to  approximate  some  kinds  of  a  physical  system 
into  this  representative  system  and  apply  the  various 
control  theories  in  order  to  verify  their  fidelity  and 
efficiency.  For  this  purpose,  an  experimental  system  of  the 
parallel  inverted  pendulum  is  implemented  in  practical, 
and  a  decoupling  control  scheme  using  the  eigenstructure 
assignment  is  presented  in  comparison  with  the  LQR 
optimal  control  method  for  hardware  validation.  Further 
more,  this  system  can  be  utilized  as  a  testbed  for  control 
theory  education  and  research  purpose  through  various 
setups. 


1.  Introduction 

In  general,  as  a  typical  mechanical  system,  an  inverted 
pendulum  system,  which  is  analogous  to  launching  control 
of  a  rocket  or  a  robot  system,  is  suitable  for  both  applying 
and  verifying  modem  multivariable  and  nonlinear  control 
theories.  Hence,  the  inverted  pendulum  is  commonly  used 
as  a  tutorial  problem  in  control  theory,  where  its  inherently 
unstable,  highly  nonlinear  dynamics  and  damping  effect  of 
mechanical  friction  provide  a  great  potential  for 
applications. 

So  far,  a  number  of  researchers  have  studied  the 
inverted  pendulum  for  the  application  of  the  various  control 
theories  to  the  various  kinds  of  configuration.  Control 
theories  such  as  LQR  method  [1],  neural  network  [2], 
variable-structure  control,  input-output  linearization  [3], 
i/x control  [4],  and  fuzzy  control  techniques  have  been 
applied  to  a  single  type,  double-series  type  [5],  and  triple¬ 
series  type  inverted  pendulum  [1]. 

However,  these  methods  are  applied  mostly  to  a  SISO 
inverted  pendulum  system  and  there  was  a  few  study  on 
dealing  with  MIMO  inverted  pendulum  systems  accounting 
for  dry  friction  of  actuators  and  robustness  with  respect  to 


the  perturbation  of  system  parameters  or  model 
uncertainties  as  well. 

In  this  paper,  we  present  a  parallel  inverted  pendulum 
system,  which  consists  of  two  identical  inverted 
pendulums,  connected  each  other  by  a  spring.  Hence,  this 
system  has  2  inputs  and  2  outputs  and  shows  a  dynamic 
behavior  of  strongly  coupled  motion,  which  is  commonly 
seen  in  MIMO  systems. 

Therefore,  this  system  allows  a  decoupling  control  that 
makes  each  inverted  pendulum  operate  independently  and  a 
decoupling  control  scheme  using  eigenstructure  assignment 
is  applied  in  hardware  experiments. 

It  is  a  distinguished  feature  of  this  system  that  the 
connecting  hinge  of  a  spring  between  both  pendulums  is 
designed  to  slide  along  the  rod  by  sliding  joints.  Hence,  this 
connecting  position  can  be  considered  as  a  time-variant 
parameter.  However,  the  time- variant  parameter,  which  is 
difficult  to  describe  explicitly,  can  be  considered  as  the 
parameter  variation  or  model  uncertainty.  Therefore,  this 
system  allows  robust  control  strategy,  so  that  the  effect  of 
the  parameter  variation  or  model  uncertainty  and  their 
robustness  can  be  carefully  examined  through  the 
experimental  system  hereafter. 

In  addition,  this  system  is  designed  that  the  mass  of  the 
pendulum  and  the  stiffness  of  a  spring  can  be  easily 
reconfigured  to  see  the  effects  of  system  parameters  on  the 
control  performance  for  educational  purposes. 

Finally,  we  develop  the  experimental  system  of  the 
parallel  inverted  pendulum  so  as  to  examine  several 
suitable  control  strategy  for  the  parallel  inverted  pendulum 
system,  and  then  the  experimental  apparatus  is  utilized  as  a 
testbed  for  applications  of  various  control  theories  and 
development  of  new  control  algorithms. 

2.  Configuration  and  modeling 

This  section  presents  the  configuration  of  the  parallel 
inverted  pendulum  system  and  its  mathematical  modeling. 
In  addition,  we  examine  the  several  important  features  of 
the  system. 

In  this  system,  two  identical  inverted  pendulums  directly 
mounted  on  the  motor  shafts  in  parallel  are  connected  by  an 
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elastic  spring.  Hence,  the  motion  of  each  inverted 
pendulum  is  strongly  coupled  with  each  other  and  its 
control  is  more  difficult  due  to  the  increased  instability  and 
control  inputs.  Therefore,  this  system  allows  for  a 
decoupling  control  that  has  received  considerable 
attentions  in  multivariable  control  in  recent  years. 

As  the  most  unique  and  important  feature  of  this  parallel 
inverted  pendulum  system,  we  let  the  connecting  position 
of  a  spring  is  not  fixed  but  varying  in  time  using  the  joint 
sliding  freely  along  the  rod  during  the  actual  operation. 

Since  this  time-variant  parameter  is  difficult  to 
mathematically  describe,  it  can  be  considered  as  the 
parameter  variation  or  model  uncertainty,  and  thus  robust 
control  schemes  are  allowed  in  this  system  as  well. 

Also,  we  design  that  the  tip  mass  of  each  pendulum  and 
the  stiffness  of  the  spring  can  be  easily  reconfigured,  so  that 
the  effects  of  the  system  parameter  on  the  performance  of 
the  control  system  can  be  readily  examined. 

A  picture  of  the  parallel  inverted  pendulum  system  is 
shown  in  Figure  1 . 


Figure  1.  Parallel  inverted  pendulum  system 


From  these  configurations,  we  derived  the  mathematical 
model  of  the  parallel  inverted  pendulum  system. 

First,  we  linearize  the  system  near  the  equilibrium  point 
assuming  a  small  angular  displacement.  Also,  rigid  body 
motion,  lumped  mass,  and  linear  spring  are  assumed  and 
the  dynamics  of  the  motor  and  the  mechanical  friction  in  the 
motor  and  the  spring  guider  are  neglected  for  simplicity. 

The  dynamics  of  the  parallel  inverted  pendulum  system 
are  described  by  the  following  equations  of  motion. 

m{l2e i  -  mxgl6x  -  ka2(0,-62)-  zx  (la) 

m2l2d2  =  m2gW2  -  ka2(e2-e])  -  r2  (lb) 

In  Eq.  (1),  mi ,  m2  and  0l9  02  are  the  mass  and  the 
angular  displacement  of  each  inverted  pendulum, 
respectively,  and  r1?  r2  represent  the  input  torque  of  each 
motor.  The  length  of  the  inverted  pendulum,  /,  the  spring 
constant,  k  ,  and  the  acceleration  of  gravity,  g ,  are  also 


appeared  in  Eq.  (1). 

Especially,  we  let  the  connecting  position  of  the  spring, 
a ,  varies  arbitrarily  in  time.  However,  it  is  difficult  to 
describe  the  time-varying  characteristic  of  a,  so  that  we 
assume  that  it  is  invariant  and  consider  as  a  model 
uncertainty  instead. 

Thus,  this  model  can  also  be  rewritten  in  the  state  space 
as  follows: 


x  =  [  ex  ex  e2  e2  Y 
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A  schematic  figure  of  the  parallel  inverted  pendulum 
system  is  shown  in  Figure  2. 


Figure  2.  Modeling  of  the  parallel  inverted 
pendulum  system 


3.  Decoupling  control  theory 

In  this  section,  we  briefly  mention  about  a  decoupling 
control  scheme  using  eigenstructure  assignment  (EA)  and 
the  command  generator  tracker  (CGT). 

The  system  control  designer  is  often  faced  with  the  need 
to  design  a  multivariable  control  law  in  which  the  objective 
is  to  decouple  the  various  outputs  from  each  other,  in  order 
to  decrease  the  effects  of  coupled  outputs  [6,7]. 

Eigenstructure  (eigenvalues  and  eigenvectors)  assignm¬ 
ent  is  very  useful  and  suitable  for  achieving  this  objective. 
Eigenstructure  assignment  algorithms  can  be  divided  into 
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two  groups;  namely,  the  right  eigenstructure  (eigenvalues 
and  right  eigenvectors)  assignment  [8]  and  the  left 
eigenstructure  (eigenvalues  and  left  eigenvectors) 
assignment  [9]. 

Their  roles  in  designing  a  control  system  are  distinctly 
different.  The  mode  or  disturbance  decouplability  is 
governed  by  the  right  eigenstructure  of  a  system.  On  the 
other  hand,  the  control  effectiveness  and  disturbance 
suppressibility  depends  mainly  on  the  left  eigenstructure  of 
the  system.  Therefore,  in  this  work,  we  use  the  right 
eigenstructure  assignment  scheme  for  a  decoupling  control 
of  the  parallel  inverted  pendulum  system.  In  addition,  the 
command  generator  tracker  [10]  is  used  as  a  servo  that 
makes  the  inverted  pendulum  follow  the  reference  input. 

Consider  a  linear  time  invariant  system  described  by 

x  =  Ax  +  B  u 

r  (3) 

y  =  Cx 

where  x  eRn ,  ueRm ,  yeRr .  As  previously  mentioned,  for 
solving  mode-decoupling  problems,  the  right  eigenstruc¬ 
ture  assignment  scheme  is  applied. 

To  present  the  right  eigenstructure  assignment  scheme, 
we  define 


In  general,  for  a  system  that  has  repeated  eigenvalues, 
the  result  of  Theorem  1  can  be  easily  extended. 

The  theorem  1  indicates  that  the  feedback  gain  K  is 
determined  by  the  closed  loop  eigenvalues  and 
eigenvectors.  The  feedback  gain  K  is  easily  obtained 
through  the  following  equations  by  using  the  right 
eigenstructure  algorithm  if  the  desired  right  eigenstructure 
(  Xi ,  v,.,)  is  already  determined. 

K=[Mxz ,  Mxj2  -  Mxz„  ][v,  v2  -v„r’ 

where  z/  is  a  linear  combination  coefficient  that  makes  v. 
with  NXj . 

The  feedforward  gain  F  for  guaranteeing  zero  steady- 
state  errors  is  obtained  by  the  command  generator  tracker 
(CGT),  which  is  one  of  the  explicit  models  following 
methods.  For  the  decoupling  control  problem  of  the  parallel 
inverted  pendulum  system,  the  one  pendulum  is  required  to 
track  the  command  inputs.  This  may  be  achieved  by 
choosing  an  identity  model.  The  output  of  this  model  is 
equal  to  its  input.  A  control  law  for  this  objective  is 
obtained  as  follows: 


a*.  «[*,/„-><  :*] 

and  a  compatibly  partitioned  matrix 


(4) 


where  [  Q  22  +  A'Q12  ]  is  the  feedforward  gain, 

_ i  ■ _ a  ts  ‘  -  -  r ii •  _ i 
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where  the  columns  of  the  matrix  Rx  form  a  basis  for  the 
null  space  of  Sx  .  For  the  cases  that  rank  B  =  m,  it  can  be 
shown  that  the  columns  of  Nx  are  linearly  independent. 


In  Eq.  (8),  T  is  the  output  matrix  C  and  D  is  zero 
matrix.  Note  that  the  feedforward  gain  F  is  subject  to  the 
feedback  gain  K  as  seen  in  Eq.  (7). 


The  following  theorem  gives  necessary  and  sufficient 
conditions  for  the  existence  of  K  which  yields  the 
prescribed  right  eigenstructure. 

Theorem  1  [11]:  Let  {  Vj ,  v2 ,  •■*,  v„  }  be  a  self¬ 
conjugate  set  of  distinct  complex  numbers .  There  exists  a 
real  (mxn)  matrix  K  such  that 

(  A  -  BK)  v,  =  A,vi9  i  =  l,2,--,n  (6) 

if  and  only  if  for  each  i 

1)  {  vi  i  v2 ,  *■*>  vn  I  ^  a  linearly  independent  set  in 

Cn ,  the  space  of  complex  n-vectors 

2)  vf  =  v  *  when  Xt  -  X*j 

3)  v,  e  span  {Nx}. 

Also,  if  K  exists  and  rank  B=m,  then  K  is  unique,  and 
is  computed  by  using  the  obtained  submatrices  N  x  and 


4.  Hardware  setup 

In  this  section,  the  hardware  setup  of  the  parallel 
inverted  pendulum  system  is  presented  and  its  validity  is 
proved  by  experiments.  Experimental  apparatus  for  the 
parallel  inverted  pendulum  system  is  constructed  as 
follows. 

Detailed  specifications  are  listed  in  Table.  1. 

The  pivot  of  each  inverted  pendulum,  which  is  allowed 
to  rotate  about  the  axis  of  the  motor  shaft,  is  directly 
mounted  on  the  motor  shaft  without  using  a  reduction  gear 
to  alleviate  the  mechanical  friction  of  linkage  as  possible. 

The  tip  mass  of  each  inverted  pendulum  is  0.4489  kg 
and  0.4496  kg ,  respectively,  and  the  mass  of  rod  is  small 
enough  to  neglect  with  respect  to  the  tip  mass.  The  hollow 
stainless  steel  rod  of  the  inverted  pendulum  is  0.32 5m  long. 

Two  inverted  pendulums  are  connected  by  an  elastic 
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coil  spring  which  has  the  constant  of  340.22  N/m  and  is 
guided  by  a  guider  to  prevent  a  buckling  in  contraction.  The 
total  mass  of  the  spring  including  the  guider  is  0.3875% 
and  would  not  be  negligible,  hence  the  equivalent  mass  at 
the  position  of  the  tip  mass  is  calculated  as  0.4868  kg.  Since 
the  guider  is  well  lubricated,  the  friction  with  the  spring  can 
be  neglected  for  simplicity.  However,  experimental 
identification  of  the  system  parameters  such  as  friction 
coefficient  is  important  in  accurate  uncertainty  description 
for  the  desired  performance  objective. 


Table  1.  Parameters  of  the  system  and  DC  servo  motor 


j  Inverted  pendulum 

DC  servo  motor  | 

Mass  1 

0.4489  Kg 

Peak  torque 

4.94  Nm 

Mass  2 

0.4496  Kg 

Peak  current 

20.5  A 

Spring  mass 

0.3875  Kg 

Mechanical 

Time  constant 

1  ms 

Spring  constant 

340.22  N/m 

Rod 

0.325  m 

Encoder 

1000  ppr 

The  actuators  selected  for  this  experiments  were 
brushless  DC  servo  motors  (SM233B)  manufactured  by 
Parker  Hannifin  Corporation  with  capability  of  applying 
4.94  N.m  of  peak  torque  at  20.5  A  of  peak  current  (rms), 
according  to  the  manufacturer’s  specifications.  The  motor 
electrical  (0.41  ms)  and  mechanical  time  constant  (7  ms) 
are  at  least  two  orders  of  magnitude  smaller  than  the  system 
time  constant. 

T  he  motors  have  the  built-in  incremental  photo  encoder 
of  1000  Pulse/Revolution  resolution  to  measure  the  angular 
position  of  the  pendulum  in  the  form  of  pulse  signals.  Also, 
12-bit  up/down  counter  is  used  to  count  the  pulse  signals 
from  the  encoder  and  this  digital  signal  is  monitored  by  the 
digital  computer  and  a  data  acquisition  system. 

DR-8330  board  of  DARIM  system  that  has  16  channel 
A/D  and  2  channel  D/A  converter,  as  well  as  1 6  channel 
DI/DO,  is  used  for  data  acquisition  and  conversion.  The 
real-time  digital  controller  is  implemented  by  C-language 
routines  on  a  personal  computer  of  233MHz  processor. 


The  operation  performed  in  the  DR-8330  board  and  in 
C-routines  is  to  sample  the  digital  signal  from  the  encoder 
by  reading  the  DI  channel  and  generates  the  analog  voltage 


to  the  motor  by  writing  to  the  D/A  channel. 

Digital  controller  closes  the  control  loop  and  feedback 
the  angular  position  and  velocity  of  the  pendulum  from  the 
encoder  then  generates  the  command  voltage  for  the  motor 
through  the  TQ-10  torque  servo  driver. 

This  setup  is  designed  to  capture  the  important 
characteristics  of  the  parallel  inverted  pendulum  system  to 
the  best,  and  it  gives  the  systematic  methods  for  identifying 
modeling  errors  for  control  from  experimental  data. 

5.  Hardware  experiment 

In  this  section,  the  performance  of  the  decoupling 
control  system  is  investigated  by  hardware  experiments. 
The  objective  of  this  experiment  is  to  demonstrate  the 
decoupling  controlled  system  and  verify  its  control  law, 
and  more  of  it  is  the  development  of  the  parallel  inverted 
pendulum  system  itself. 

For  the  decoupling  control,  one  of  the  inverted 
pendulums  is  need  to  be  oscillated  arbitrarily.  Hence,  a 
servo  controller  such  as  the  command  generator  tracker 
(CGT)  is  employed  to  make  the  pendulum  oscillate  to  a 
given  reference  input.  In  the  experiments,  we  give  the 
reference  input  as  a  sinusoidal  wave  of  magnitude  0.3  and 
frequency  of  0.25Hz. 

In  designing  a  decoupling  controller,  we  assign  the 
eigenvalues  of  a  closed-loop  system  as  follows: 

X=[  -24  -22  -14  -12  ]. 

Since  the  position  and  rate  responses  of  the  inverted 
pendulum  should  be  decoupled  and  has  to  be  faster  than 
another  one,  we  make  X 12  larger  than  X  3  4. 

Using  the  algorithm  described  in  the  section  3,  we  can 
design  desired  right  eigenvectors  of  the  closed  loop  system 


as  follows: 

"  0.0416 

-0.0454 

0 

0 

-0.9991 

0.9990 

0 

0 

0 

0 

0.0830 

-0.0712 

0 

0 

-0.9965 

0.9975 

In  V ,  each  element  having  zero  value  is  assigned  for 
achieving  the  desired  decoupling  objective. 

Then,  the  feedback  gain  that  satisfies  the  desired 
eigenvalues  and  eigenvectors  of  the  closed  loop  system  can 
be  obtained  by  an  appropriate  eigenstructure  assignment 
method. 

22.1357  2.4138  7.1531  0  1 

7.1531  0  3.2449  1.3643  J 

In  the  gain  K,  elements  of  Kn ,  A:i4and  K2],  K22 
represent  the  degree  of  correlation  of  two  inverted 
pendulums  and  affected  mainly  by  the  system  parameters  as 
the  spring  constant,  mass  and  length  of  the  rod.  By 
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assigning  the  desired  eigenvectors  for  the  decoupling 
control,  Kl4,K22  go  to  zero,  and  Kn,  K2X  are  the  same  as 
7.1531  that  represents  the  degree  of  the  correlation.  Neither 
eigenvalues  nor  eigenvectors  of  the  closed  system  do  affect 
these  elements  in  feedback  gain. 

By  feedback,  the  effect  of  this  correlation  can  be 
compensated,  and  the  required  decoupling  objective  can  be 
achieved. 

Moreover,  a  feedforward  gain  for  a  servo  controller  is 
obtained  by  the  algorithm  given  in  section  3  as  follows: 


F= 


27.7065 

0 


0 

8.8157/ 


The  sum  of  Frobenious  norm  of  the  feedback  and 
feedforward  gains  is  shown  to  compare  with  that  of  the 
LQR  control  gain. 

normEA  =  ||  K  ||fro  +  ||  F  ||Fro 

=  24.7091  +  29.0752  =  53.7844 


Next,  this  experiments  result  is  compared  with  the  LQR 
optimal  method  to  verify  the  control  performance  of  the 
decoupling  control. 

When  the  design  parameter  R  in  the  LQR  method  is 
diag  {0.0043,0.0043} ,  the  Frobenious  norm  of  the  gains  is 
almost  same  as  the  decoupling  control  scheme  using 
eigenstructure  assignment,  and  the  feedback  and  the 
feedforward  gains  are  given  as  follows. 


[12.0255  15.2912  4.8885  0.0168  ] 

4.8885  0.0168  12.0255  15.2912  J 

f  17.5963  -2.2646 

[-2.2646  17.5963  _ 

normLQR  =  ||  G  ||Fro  +  ||  H  ||fro 

=  28.3666  +  25.0902  =  53.4568 


The  results  of  experiments  in  time  domain  response  are 
shown  in  Fig.  4  and  Fig.  6  for  the  decoupling  control  using 
eigenstructure  assignment  and  the  CGT,  and  Fig.  5  and  Fig. 
7  for  the  LQR  method  using  LQ  servo. 

In  the  experiments,  we  let  the  spring  connecting  position 
be  fixed  since  we  show  the  efficiency  of  the  decouplability 
alone  in  this  paper.  However,  the  time-variant 
characteristics  of  the  freely  sliding  spring  connecting 
position  is  surely  considered  for  the  robust  control 
hereafter. 

It  is  clear  from  experiments  that  the  results  demonstrate 
a  good  agreement  with  the  theoretical  predictions  and  show 
a  good  performance  of  the  controller  to  decouple  the 
motions  of  the  parallel  inverted  pendulum. 

Experimental  data  show  that  pendulum  which  is  to  be 
standstill  slightly  oscillate  as  seen  in  Figure  4and  Figure  5. 
This  is  due  to  the  hardware  problem  such  that  the  time 
constant  of  the  system  is  not  fast  enough  to  respond 


adequately  to  the  decoupling  command.  Since  the  time 
constant  is  directly  related  to  the  inertia  of  the  system  and 
capability  of  the  actuator,  this  drawback  can  be  easily 
improved  by  the  hardware  tuning. 

The  nonlinear  effects  and  model  uncertainties  due  to 
neglecting  the  friction  of  the  motor  and  the  guider  did  not 
cause  significant  changes  in  the  results  of  hardware 
experiments.  The  brushless  DC  servo  motor  and  data 
acquisition  board  appear  to  be  effective  and  inexpensive 
for  controller  implementation. 

The  decoupling  control  using  eigenstructure  assignment 
with  the  CGT  is  compared  with  the  LQR  optimal  method 
using  LQ  servo,  which  shows  similar  decoupling  effect  in 
the  experiments.  However,  the  LQR  control  gives  more 
weight  on  stabilizing  the  pendulum  itself  and  the 
decoupling  control  gives  more  weight  on  compensating  the 
correlation.  Therefore,  the  LQR  control  needs  more  control 
input  than  the  decoupling  control  to  achieve  the  same 
performance.  From  the  performance  stand  point,  the 
decoupling  control  using  eigenstructure  assignment  shows 
the  better  decouplability. 

One  of  the  important  lessons  in  the  hardware 
experiments  is  the  need  for  the  accurate  system  modeling 
and  system  identification.  Also,  the  experiments  proved 
again  that  the  control  problem  is  often  dependent  on  system 
modeling,  actuator  and  sensor  capability,  and  controller 
implementation. 
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6.  Conclusions 
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In  this  paper,  a  parallel  inverted  pendulum  system  is 
implemented  and  a  decoupling  control  scheme  using 
eigenstructure  assignment  with  CGT  is  experimentally 
demonstrated  and  shows  a  good  performance. 

Despite  of  the  assumption  of  a  linear  system  and  the 
neglect  of  the  mechanical  friction,  the  hardware 
experiments  show  good  agreement  with  theoretical 
predictions. 

Hereafter,  this  experimental  system  will  be  utilized  as  a 
testbed  for  control  theory  education  and  evaluation  of  new 
control  algorithms. 
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Abstract 

The  World-Wide- Web  has  been  used  extensively  in 
the  past  few  years  of  its  existence  for  data  exchange , 
and  information  gathering.  Web  on-line  control ,  on 
the  other  hand,  is  a  new  emerging  field  which  has  not 
yet  been  fully  exploited  and  holds  in  it  a  great  im¬ 
pact  on  currently  available  control  systems.  This  paper 
discusses  an  application  of  on-line  INTERNET  con¬ 
trol  service  -  a  WWW  controlled  robotic  manipulator 
arm. 


1  Introduction 

The  WWW  application  is  an  actual  real-time,  on¬ 
line  application  in  which  the  user  interacts  with  a 
robot  controller  in  real  time  using  the  World- Wide- 
Web  as  the  medium.  The  project  itself  was  conceived 
with  the  following  ideas  in  mind: 

1.  A  simple  client  side  interface  that  will  use  only  a 
form  capable  web  browser. 

2.  User  friendly  interface. 

3.  Structural  configuration  that  will  allow  fast  up¬ 
dates  for  both  user  interface  and  hardware 
change. 

4.  Web  interface  that  will  allow  easy  implementation 
of  distant-learning  tutorials  and  exercises  using 
the  robot  interface. 

The  above  last  requirement  is  quite  unique  in  this 
application.  A  major  consideration  for  this  project 
was  to  be  targeted  towards  distant-learning  programs 
in  the  field  of  control  and  robotics,  although  such  an 
interface  is  readily  implementable  in  other  fields  as  a 
distant-learning  aid. 


2  General  System  Description 

As  mentioned  earlier,  the  client  side  interface  is 
comprised  solely  of  a  forms  capable  web-browser  such 
as  Netscape  Navigator  V.2  or  Internet  Explorer  V.3. 
The  lack  of  proprietary  software  or  hardware  on  the 
client  side  enables  every  user  with  web  access  to  fully 
control  the  hardware,  regardless  of  its  platform  (ex¬ 
cept  of  the  above  minimal  requirements).  In  order 
to  start  a  link  with  the  robot,  the  user  simply  points 
the  web  browser  to  the  robots  web  page,  from  there 
on,  the  user  is  guided  through  a  series  of  screens  that 
eventually  end  up  in  an  input  page  (such  as  a  form  or 
a  sensitive  map  page).  W7hen  the  user  satisfies  the  in¬ 
put  page  request  (by  either  inputting  numerical  data 
or  clicking  on  a  sensitive  map  image),  the  data  is  being 
sent  as  a  form  GET  method  to  the  server  side  inter¬ 
face.  At  the  server  side,  the  users  data  is  processed 
and  is  being  used  to  activate  the  robot  arm.  After  the 
arm  has  ceased  moving,  two  video  cameras  are  being 
used  to  grab  an  image  of  the  robot  and  feed  it  back  to 
the  user  as  an  indication  for  the  end  of  movement  and 
as  an  opening  argument  for  the  next  command.  The 
server  side  interface  is  comprised  out  of  three  main 
parts  that  communicate  with  each  other  to  accomplish 
the  bi-directional  interface  between  the  user  and  the 
robot  arm.  The  following  are  brief  descriptions  of  the 
server  side  parts: 

2.1  The  web  server  interface. 

The  web  server  interface  is  a  series  of  HTML  pages 
that  are  being  used  as  the  visual  interface  with  the 
user.  Depending  on  the  user  selection,  different  pages 
will  be  selected,  until,  eventually,  a  page  containing  an 
input  field  will  be  selected.  When  the  SEND  button 
in  the  page' will  be  pressed,  the  page  will  launch  a 
generic  CGI  program  that  will  receive  the  data  entered 
in  the  page,  and  according  to  the  page  that  launched 
it  will  decide  upon  the  next  action.  In  any  case,  the 
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CGI  program  will  open  an  INTERNET  stream  socket 
link[l]  to  the  ROBOT  SERVER  (see  next  part)  that 
will  be  used  as  a  data  path  for  the  flow  of  commands 
to  the  robot  and  visual  feedback  back  to  the  user.  No 
checks  are  being  done  on  the  commands  sent  to  the 
robot  server.  At  an  early  stage  it  was  decided  that 
most  processing  and  command  parsing  will  take  place 
on  the  robot  server  PC  (which  is  a  dedicated  machine) 
and  not  on  the  web  server  machine  (which  is  a  public, 
general  purpose  machine). 

2.2  The  robot  server. 

The  robot  server  is  a  Pentium  PC  that  hosts  the 
robot  server  program.  This  program  is  the  heart  of 
the  system  and  most  of  the  processing  is  done  here. 
The  PC  is  equipped  with  an  RS232  interface  that  is 
used  to  communicate  with  the  robot  controller,  an  eth- 
ernet  card  which  is  being  used  to  communicate  with 
the  web  server  interface  (see  above),  and  a  dual  input 
VGA  frame  grabber  attached  to  two  Black  &  White 
video  CCD  cameras  to  enable  visual  feedback.  The 
program  is  an  event  driven  loop  which  senses  mainly 
a  dedicated  INTERNET  socket  for  requests  (which  are 
coming  from  the  web  server  interface).  The  program 
idles  as  long  as  no  requests  are  sensed.  Whenever  a 
request  is  received  over  the  socket,  the  program  wakes 
up  and  parses  the  request.  The  current  version  of  the 
program  supports  two  types  of  requests,  a  visual  only, 
and  a  movement / visual  one.  A  visual  only  request  will 
initiate  an  image  grabbing  session  that  will  grab  the 
arms  image  using  either  of  the  cameras  (depending  on 
the  request)  do  some  processing  on  it  (also  dependent 
on  the  request  -  see  stereoscopic  interface) ,  convert  the 
image  to  a  JPEG  format  and  send  it  back  to  the  web 
server  interface.  A  movement /visual  request  will  first 
be  parsed  and  checked  to  be  a  legal  movement  com¬ 
mand  for  the  robots  controller,  It  will  then  be  passed 
to  the  inverse  kinematics  module  [2]  which  will  extract 
the  new  joint  angles  required  for  the  new  position  from 
the  3D  positional  parameters  given  to  it.  The  joint 
angles  will  be  checked  out  against  the  current  ones 
to  validate  that  no  limits  are  exceeded  and  finally  a 
movement  command  will  be  sent  to  the  controller  via 
the  RS232  interface.  The  main  program  loop  will  then 
wait  on  the  robot  to  complete  its  movement  and  will 
initiate  a  visual  request  (see  above). 

2.3  The  robot  unit 

The  robot  unit  is  comprised  out  of  an  SIR-1  semi¬ 
industrial  grade  articulated  manipulator  arm  with  five 
degrees  of  freedom  (out  of  which  only  three  are  being 


a  *  -  *"T  *  -C  S"  Wt . 

*  i  w*  \-:.Vv  •  '  vvr;*':»y:  *•>? 

We leone 

To  the  In  •  •/<•■}  -  s '  y  <,♦  I?: 

Web  Operated  Manipulate 

i *  ' 2  !  '  a 

jr  Ann 

Introduction . 

y  IIJVl  .*•••  1. 

Moving  the  Arm: 

mint  rtst.  i.UAi !>.'»  tti-vmtui  a>bi  r i i-  :n 

"unW  <■  A'. 

"ROfll  *  :i:  KilVJ  TKK  A?;M  •:  IT'.'  » 

no 

si. 

\  $ 

o  USX 

(Mf 

*  •* 

Figure  1:  The  coordinate  system  interface 

Figure  2:  The  stereoscopic  visual  interface  -  state  1 


used  now),  and  the  robots  controller.  The  controller 
is  a  CPU  controlled  unit  that  is  connected  to  the  arms 
motors  and  to  a  variety  of  position  and  speed  sensors 
mounted  on  the  arm.  The  controller  can  receive  com¬ 
mands  to  move  the  robot  from  either  a  control  panel  or 
using  a  special  command  language  through  an  RS232 
port. 


3  The  user  interface 

As  written  earlier,  the  modularity  of  the  system  en¬ 
ables  the  creation  of  different  user  interfaces  which  are 
using  the  same  link  to  operate  the  robot.  Currently, 
two  user  interfaces  are  available:  the  coordinate  sys¬ 
tem  interface  (see  figure  1),  and  the  stereoscopic  visual 
interface  (see  figure  2  and  figure  3).  The  coordinate 
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Figure  3:  The  stereoscopic  visual  interface  -  state  2 


system  interface  uses  a  page  that  requests  the  coordi¬ 
nates  of  a  3D  point  to  which  the  robot  will  be  moved. 
When  the  user  presses  the  MOVE  button,  a  move¬ 
ment/visual  request  (see  above)  is  being  sent  to  the 
robot  server.  After  the  robot  has  reached  its  final  po¬ 
sition,  the  new  position  image  is  being  sent  back  to  the 
user.  A  HOME  button  is  also  available  which  causes 
the  robot  to  move  to  its  cold-start  home  position.  The 
stereoscopic  interface,  on  the  other  hand,  starts  with 
a  visual  request  at  the  moment  the  page  is  requested. 
This  enables  the  user  to  get  the  latest,  most  updated 
picture  of  the  robot.  After  the  image  is  grabbed  from 
the  main  camera  (a  camera  positioned  in  front  of  the 
arm)  it  is  being  sent  as  a  sensitive  image  back  to  the 
user  (figure  2).  The  user  is  then  given  the  choice  to 
click  on  the  sensitive  image  to  determine  the  initial 
X-Y  coordinates  of  the  requested  position,  when  the 
user  clicks  on  the  image,  the  web  server  interface  saves 
the  coordinates  entered  and  sends  another  visual  re¬ 
quest  from  the  robot  server.  This  time  though,  the 
image  is  grabbed  from  a  second  camera  placed  on  the 
same  horizontal  plane  as  the  main  camera,  but  at  a 
90  degree  angle  from  it.  The  robot  server  uses  the  Y 
coordinate  information  from  the  above  user  input  to 
interpolate  a  white  line  across  the  image  to  mark  the 
requested  X-Y  2D  point  over  the  rotated  image.  The 
image  is  then  sent  to  the  web  server  interface  which 
raps  it  with  a  HTML  package.  It  also  puts  hidden 
HTML  fields  into  which  the  previously  selected  X-Y 
coordinates  are  placed,  so  they  will  be  passed  with 
the  final  user  request.  Eventually,  the  user  receives 
another  image  of  the  robot  which  is  the  90  degree  side 
view  with  a  white  line,  representing  his  previous  se¬ 
lection  (figure  3).  The  user  is  requested  to  click  again 
over  the  image,  thus,  setting  the  new  images  X  coor¬ 


dinate.  this  coordinate  is  basically  the  Z  coordinate 
(X  system,  rotated  90  to  be  perpendicular  to  the  X-Y 
plane).  The  request  received  contains  the  Z  coordi¬ 
nate,  together  with  the  X-Y  coordinates  (embedded 
as  hidden  HTML  fields),  and  is  being  sent  as  a  move¬ 
ment/visual  request  to  the  robot  server.  The  robot 
server  is  using  this  data  to  move  the  robot  to  the  new 
position  and  finally  send  the  resultant  picture  back  to 
the  user. 


4  Future  additions 

The  WWW  operated  arm  is  a  dynamic  on-going 
project.  Currently,  there  are  a  few  additions  that  are 
to  be  considered  to  be  added  to  the  system: 

•  Queuing  system:  During  the  writing  of  this  pa¬ 
per,  extensive  work  is  being  done  in  creating  a 
queuing  system  for  user  control  and  authentica¬ 
tion.  For  the  short  duration  of  its  life,  the  WWW 
robot  had  received  thousands  of  hits  and  an  ur¬ 
gent  need  for  a  queuing  system  was  noticed.  The 
queuing  system  we  are  designing  relies  solely  on 
a  state  machine  comprised  out  of  the  user’s  own 
web  pages.  Each  viewed  page  contains  a  special 
magic  cookie  (inside  a  hidden  field)  which  is  de¬ 
crypted  in  the  server  and  carries  information  on 
scheduling,  expiration,  priority  and  user  level. 

•  Java  based  interface:  A  Java  based  interface  is 
also  considered  as  a  future  addition.  The  interface 
would  help  to  move  some  of  the  complex  inverse 
kinematics  calculations  from  the  robot  server  to 
the  users  machine. 

•  Live  Video:  For  wideband  Internet  connections,  a 
live  video  frames  stream  of  the  robot’s  movement 
would  enhance  the  understanding  of  what  is  ac¬ 
tually  happening  on  the  robot  side.  We  are  still 
considering  either  a  server  push,  or  a  Java  video 
console.  With  both  of  the  considered  options,  our 
prime  directive  is  a  non  proprietary  client  side 
software. 

•  More  Distant  Learning:  Expending  the  project 
into  the  general  sciences.  For  example,  using  the 
robot  and  interface  in  a  distant  learning  chemistry 
lab  experiment  set  (we  are  actually  working  now 
on  this  one) 
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5  Related  Information 


Information  about  this  project  and  other 
robotics  related  projects  is  provided  on  the  Uni¬ 
versity  of  Bridgeport’s  Robotics  and  Intelligent 
Sensing  and  Control  laboratory  web  page  at: 
http:  /  /  www.bridgeport.edu  /  "rise. 

Fill  free  to  visit  us  at  the  above  page,  or 
go  directly  to  the  WWW  robot  arm  page  at: 
http :  /  /  www .  bridgep  ort .  edu  /  ~  mat  anya /robi  (please 
remember  that  as  this  project  is  in  its  early  stages  of 
development,  it  is  mostly  off-line  and  under  constant 
module  and  interface  change). 
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Abstract 

Measurement  of  optica!  lenses  during  its  prototyping  and 
failure  analysis  requires  its  wavefront  error  to  be  deter¬ 
mined  in  the  sub-micron  range.  Reconstruction  of  the 
optical  wavefront  through  the  analysis  of  lateral  shearing 
interferograms  is  done  with  radial  basis  function  neural 
networks  instead  of  the  analytical  inverse  matrix  ap¬ 
proach.  Noisy  interferograms,  which  pose  a  problem  for 
the  inverse  matrix  approach,  are  simulated  and  tested  in 
the  proposed  neural  networks  approach. 

1.  Introduction 

In  the  analysis  of  lateral  shearing  interferograms  resulting 
from  optical  wavefront  measurements,  limitations  of  the 
common  approach  utilising  inverse  matrices  include  range 
and  noise  issues.  The  range  problem  is  relevant  to  aberra¬ 
tions  in  prototype  lenses  or  out-of-specification  lenses 
where  it  may  be  very  wide,  while  noise  problems  are 
common  in  the  interferometry  system  optics  and  the  test 
piece.  Besides  these  considerations  the  analysis  must  be 
able  to  handle  complicated  interferograms  containing 
combinations  of  aberrations  found  in  the  test  piece. 

The  present  techniques  for  analysing  interferograms  in¬ 
clude  the  phase  stepping  analysis,  and  Zernike  polynomial 
fitting  through  inverse  matrices  approach  [1].  The  inverse 
approach  is  very  effective  for  very  small  range  of  near 
perfect  wavefronts,  however  it  encounters  non- 
deterministic  problems  when  the  wavefronts  are  compli¬ 
cated  by  mixture  of  aberrations  or  by  noise  e.g.  dirt  on  the 
lenses.  In  this  paper  the  radial  basis  function  (RBF)  net¬ 
works  are  used  to  analyze  the  interferogram  with  the  aim 
of  improving  the  analysis  in  the  presence  of  noise  in  the 
optical  system. 

The  analysis  is  in  two  parts.  The  first  part  of  the  analysis 
is  to  identify  and  classify  the  types  of  aberration  present 
represented  in  the  interferogram.  The  second  part  is  to 


quantify  these  aberrations.  In  this  paper  the  interferograms 
are  first  simulated.  This  are  then  used  for  training  of  the 
neural  networks.  Noise  is  also  added  to  the  simulated  in¬ 
terferogram  during  the  training  phase  to  simulated  actual 
interferograms  that  are  noisy. 

2.  Aberrations  and  Interferograms 

Two  main  forms  of  interferometry  are  used  for  meas¬ 
urement  of  aberrations  of  optical  systems/components  - 
the  lateral  shearing  and  Twyman-Green  setups,  see  figure 
1.  Lateral  shearing  interferometry  has  many  advantages 
over  the  Twyman-Green  setup,  such  as  immunity  from 
vibration  of  setup,  accuracy  and  range  of  measurement 
through  variable  shear  techniques.  However  the  resulting 
lateral  shearing  interferogram  also  poses  some  peculiari¬ 
ties.  For  instance,  the  interferogram  for  defocus  and 
astigmatism  are  similar,  therefore  the  interferogram  has  to 
be  analyzed  in  two  orthogonal  shears  in  order  to  properly 
classify  these  two  aberrations.  Also  the  lateral  shearing 
interferogram,  being  obtained  from  the  interference  of  one 
beam  shifted  in  parallel  with  another  part  of  the  same 
beam,  complicates  the  analysis  as  compared  to  the  more 
straightforward  Twyman-Green  interferograms.  Examples 
of  lateral  shearing  interferograms  are  shown  in  figure  2 
where  they  are  simulated. 

3.  Zernike  polynomials  representation  of  ab¬ 
errations 

The  circle  polynomials  of  Zernike  are  commonly  used 
to  represent  wavefront  aberrations  in  optical  systems.  The 
reasons  include  the  orthogonality  of  these  polynomials 
(over  interior  of  unit  circle)  and  their  invariance  proper¬ 
ties,  both  simplifying  the  wavefront  fitting  analysis  proc¬ 
ess  in  a  rotational ly  symmetric  optical  system  [2].  The 
polynomial  representation  of  the  wavefront  is  also  in  the 
form  of  balanced  aberration;  this  is  very  useful  in  the 
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tolerance  analysis  of  an  optical  system. 

Wavefront  OPD  error  W(p,  6)  can  be  represented  by  a 
Zernike  expansion  in  cylindrical  coordinates  (p,  0)  as 

»w)=£2>.z;w)  (i) 

ffsO  /«=() 

where 

&nm  are  the  Zernike  coefficients,  and 
Z;;  (p,  6>)  are  the  Zernike  polynomials 

«  and  w  are  the  order  and  degree  of  the  polynomials  re¬ 
spectively. 

From  equation  (1)  the  wavefront  variance  with  zero 
mean  (mean  is  arbitrary)  is  then, 

cr;  =-{  \W-pdpdd  (2) 

x  0  0 

where  crH  is  the  rms  of  the  wavefront  error. 


Interferogram 


Lateral 

Shearing 

Optics 


Beam  Source 


Figure  1.  Two  main  forms  of  interferometry  set-up.  The  top 
figure  shows  the  Twyman-Green  interferometry  set-up  and  the 
bottom  shows  the  lateral  shearing  interferometry  set-up. 


Figure  2.  Lateral  shearing  interferograms  for  Spherical  Ab¬ 
erration  3X  (top  left/  Coma  ()')  (top  right)  3 A..  Astigmatism  (0°) 
(botton  left)  3X.  and  Defocus  3X  (bottom  left). 


And  the  fit  variance,  describing  the  error  of  fitting  a 
measured  wavefront  W*  to  the  actual  W ,  is 

°)=-\\QV'-wfPdpde  (3) 

X  0  0 

Then  the  optimal  fit  is  obtained  from  partial  differential  of 
a),  with  respect  to  each  coefficient  anm.  Given  the  or¬ 
thogonality  of  Zernike  polynomials,  it  can  be  shown  that 
[1], 

2(n  +  U  1  ln 

a„m = - - \  \w  z;;  p  dp  do  (4) 

x  0  0 

The  intensity  of  a  measured  interferogram  in  a  Twyman- 
Green  interferometer  can  be  described  as, 

IIK(p,&)  =  I0  +  l0yCos(^W‘)  (5) 

A 

And  the  intensity  of  a  Lateral  Shearing  interferogram  is, 

2  77” 

4(p-  6)  =  4  +  hr  Cos  (—  A  W')  (6) 

A 

In  the  Twyman-Green  interferometer  llf:(p,0)  is  imaged 

by  a  CCD  camera  and  W'  is  obtained  equation  (5). 
Zernike  coefficients  are  then  derived  through  the  optimal 
fit  equation  (4).  This  is  fairly  simple  compared  to  the  lat¬ 
eral  shearing  interferometer  where  4(p.<9)  is  similarly 

imaged  but  the  derivation  of  Zernike  polynomials  for  W' 
is  complicated  by  the  lateral  shear. 

In  Lateral  Shear  (x-shear),  the  measured  APV'  is  related  to 
W  through  the  shear  ratio  as 

AV'X  (x,y,s)  =  y'(x  +  s,y)-V'(x-s,y) 

=  5>«A x:„(x,y,s)  (7) 

where 

V'  is  the  Cartesian  coordinate  equivalent  of  Wf , 
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s  is  the  shear  ratio;  —  distance  between  centers  of  inter¬ 
ference  beams, 
and 

^X'x.u (*' y'i s)  is  the  difference  Zernike  polynomial  term. 

The  resolution  of  the  interferometry  system  is  im¬ 
proved  by  an  order  with  phase  stepping ,  where  a  series  of 
interferogram  is  used  instead  of  only  one.  In  the  simplest 
method  the  interference  pattern  is  stepped  as  follows, 

l(p,0)  =  IQ  +  I0y  Cos(~  A  W'  +  (8) 

A,  N 

where  m  is  the  step  number  and  N  is  the  total  number  of 
steps. 

4.  Analytical  wavefront  reconstruction 

There  are  two  broad  wavefront  reconstruction  tech¬ 
niques.  The  first  technique  is  the  zonal  technique  where 
the  least  square  fit  of  a  wavefront  on  a  grid  of  measured 
difference-front  points  is  used.  This  method  is  more  suit¬ 
able  for  surface  errors,  which  do  not  assume  any  shape  in 
particular.  The  second  technique  is  the  modal  phase  re¬ 
construction  method  such  as  fitting  of  wavefront  error 
using  the  Zernike  polynomials  [1]  for  identifying  and 
quantifying  lens  aberrations.  The  latter  approach  of  wave- 
front  representation  is  applied  in  this  paper  with  the  re¬ 
sults  quantified  in  Zernike  polynomial  coefficients 

5.  Wavefront  reconstruction  with  neural 
networks 

In  this  paper  the  radial  basis  neural  network  is  used  for 
the  reconstruction  of  lateral  shearing  interferograms.  The 
most  common  form  of  RBF  neural  networks  is  originally 
proposed  by  Broomhead  and  Lowe  [3]  and  has  form  as 
shown  in  figure  4.  This  network  may  be  expressed  as  a 
linear  model  for  the  function  y(x)  of  the  form 

III 

/(*)  =  £  tfy/7  (x)  (9) 

./  =  ! 

The  RBF  network  is  linear  if  the  hidden  functions,  also 
called  the  basis  functions,  are  fixed  functions.  If  the  basis 
functions  are  allowed  to  move  or  change  in  size  (e.g.  the 
radial  basis  functions’  centers  are  not  fixed),  then  the  RBF 
network  is  non-linear.  In  this  paper  the  linear  RBF  net¬ 
works  will  be  used. 

A  special  class  of  function,  known  as  radial  functions, 
is  used  for  the  hidden  functions.  The  main  characteristic 
of  radial  functions  is  that  it  is  monotonically  either  de-. 
creasing  or  increasing  from  a  center  point.  One  common 
form  of  radial  basis  functions  is  the  Gaussian  function, 
which  in  scalar  form,  is  expressed  as 


, ,  ,  f  (x-c)2) 

h(x)  =  exp  (10) 

\  r  ) 

where  its  parameters  are  c  =  center 
r  -  radius 

Gaussian  RBF  is  commonly  chosen  for  RBF  neural  net¬ 
works  as  it  gives  local  response  and  does  not  tends  to  in¬ 
finity.  This  is  unlike  multiquadric-type  RBFs  which  gives 
global  response  and  tends  towards  infinity  if  it  is  too  far 
off  the  centre.  Multiquadrics  are  of  the  form 


Figure  3.  I  he  top  row  shows  the  lateral  shearing  interfero¬ 
grams  of  coma  (X)  in  various  degrees  of  severity  (0.6X,  1.2X, 
1.8?l,  2AX  from  left  to  right).  The  bottom  row'  shows  the  phase 
stepping  (5  equal  steps)  images  of  the  0.6X  coma  (X)  interfero¬ 
gram. 

The  approach  in  this  paper  is  to  use  a  group  of  sub- 
RBF  networks  for  the  analysis  of  the  lateral  shearing  inter¬ 
ferogram.  Each  of  the  sub-RBF  networks  is  trained  to 
identify  and  quantify  one  form  of  wavefront  error  i.e.  op- 
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tical  aberrations.  The  analysis  is  in  two  parts.  The  first 
part  of  the  analysis  is  to  identify  and  classify  the  types  of 
aberration  from  the  interferogram.  The  second  part  is  to 
quantify  these  aberrations.  In  this  paper  the  interferograms 
are  first  simulated  using  equation  (6).  These  are  then  used 
for  training  of  the  RBF  networks.  Noises  are  added  to  the 
simulated  interferograms  during  the  training  and  testing  of 
the  networks  to  simulated  actual  interferograms. 


Figure  4.  Radial  Basis  Function  Network,  w  f  represents  the 
weights  of  the  network  and  hj  are  the  hidden  functions. 


£f_ 

3W  : 


(xi)  =  hJ(xi) 


(15) 


Substituting  this  into  the  derivative  of  the  sum-squared- 
error,  we  thus  arrive  at  the  following  set  of  equations 


Z/(x,)/j,(x,)=XM(x-)  (16) 

/=!  /=l 

Using  equations  (11)  and  (15)  the  set  of  weights  can  be 
solved  for  the  network,  and  a  unique  solution  can  be 
found  unless  there  is  redundancy.  In  the  presence  of  noise 
in  the  training  or  test  set,  redundancy  will  arise,  in  which 
case  regularization  and  subset  selection  will  be  required. 

It  is  important  that  the  interferogram  images  first  be 
preprocessed  before  they  are  used  for  reconstruction.  Here 
the  interferogram  set  for  training  are  prepared  so  that  it 
represent  a  uniform  distribution  of  the  interferograms  to 
be  reconstructed.  The  images  are  also  normalized  and  no 
redundant  images  are  used  in  the  training  set.  These  steps 
help  to  ensure  better  results. 


5.1.  Reconstruction  of  interferogram  without 
noise 

In  this  section  interferograms  without  noise  are  recon¬ 
structed  with  neural  networks.  This  is  an  ideal  case  study 
because  there  is  always  noise  in  practice,  however  it  forms 
the  basis  for  the  use  of  neural  networks  in  the  reconstruc¬ 
tion  process.  To  reconstruct  the  noiseless  interferogram 
the  least  squares  method  is  used  during  the  training  phase 
of  each  sub-RBF  network,  see  figure  5. 

In  Linear  RBF  network,  supervised  learning  can  be 
done  using  least  squares  method  in  which  the  best  weights 
are  solved  from  the  training  set  through  deriving  and 
solving  a  set  of  equations  (assuming  that  there  is  no  re¬ 
dundancy  in  this  set  of  equations).  In  this  analysis  it  is 
assumed  that  noise  is  negligible  in  the  training  data  set. 
Noise  will  be  considered  in  the  regularization  networks  to 
be  shown  in  the  next  section. 

Consider  the  linear  network  model 

HI 

/(x)  =  £>,/;,  (x)  (12) 

7=1 

with  the  training  set  {(x/5 £,)}£, .  The  weights  of  the 
model  are  obtained  through  the  least  squares  method  by 
minimizing  the  sum-squared-error  with  respect  to  the 
weights.  Thus  the  sum-squared-error  is 

s  =  t(f(*,)-y,Y-  (13) 

/-I 

and  to  minimize  it  (for  the  weight  wj ),  set 

SS  l>  d  f 

-T—  =  2Z(/(x,)-j>,)— Mx,)  =  0  (14) 

aw  j  /=i  aw  i 

where 


Figure  5.  Neural  networks  structure  for  interferogram  recon¬ 
struction.  Each  SabNN  module  is  a  radial  basis  function  neural 
network  similar  to  figure  4. 

5.1.1.  Results. 

In  the  first  experiment,  the  nodes  are  chosen  with  the 
same  number  of  nodes  as  the  interferogram  image  inputs. 
The  wavefront  error  of  the  reconstructed  interferogram 
shows  errors  of  less  than  5%  for  aberrations  ranging  from 
2  to  5  Each  interferogram  is  represented  by  1600 
pixels  and  240  hidden  functions  are  used  per  sub-RBF 
network.  Please  refer  to  table  1  for  the  results.  The  inter¬ 
ferograms  are  for  spherical  aberration  with  0.1  shear  ratio. 

5.2.  Reconstruction  of  interferogram  with  noise 

The  contrast  between  a  clean  and  noisy  interferogram 
used  in  the  simulation  is  shown  in  figure  6.  The  noisy  in¬ 
terferogram  from  simulation  are  close  to  the  actual  inter¬ 
ferograms  obtained  from  the  interferometer,  as  shown  in 
figure  7.  The  advantage  of  using  of  neural  networks  for 
reconstruction  of  interferogram  lies  in  its  ability  to  handle 
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noisy  images.  To  reconstruct  the  wavefront  from  noisy 
interferograms  the  supervised  learning  with  cross- 
validation  and  regularization  methods  are  employed  to 
train  the  RBF  networks. 

Network  learning  is  essentially  a  problem  of  surface 
reconstruction  or  approximation  in  a  multi-dimensional 
space  given  a  training  data  set.  This  is  a  form  of  inverse 
problems  and  are  either  well-posed  or  ill-posed.  Ill-posed 
problems  are  those  where  no  unique  solution  exists  due  to 
insufficient  information/training  data  to  constrain  one.  Ill- 
posed  inverse  problems  are  due  to  non-uniqueness  and/or 
non-continuity ,  arising  for  instance  from  the  presence  of 
noise  in  the  input  training  data 

To  make  the  problem  well-posed  so  that  generalization 
to  new  data  is  feasible,  the  regularization  method  ridge 
regression  [4,  5]  is  used.  Ridge  regression,  or  weight  de¬ 
cay,  essentially  involves  adding  a  penalty  term  to  the  sum- 
squared-error  and  helps  to  lessen  unimportant  network 
connections.  This  is  done  by  adding  a  term  to  penalise 
large  weights  in  the  sum-squared-error,  resulting  in  the 
following  cost  function 

C  =  ZG\  -/(X;))'  (17) 

/=!  7=1 

where  T>0  is  the  reguiarisation  parameter.  Therefore  a 
network  with  small  weights  is  preferred  to  that  with  large 
weights,  lessening  the  network’s  sensitivity  to  noise. 

The  number  of  hidden  nodes,  m,  of  the  network  must 
be  selected  with  care  to  ensure  good  results.  If  m  is  too 
large  then  noise  will  be  fitted  into  the  model.  This  pa¬ 
rameter  is  chosen  through  a  model  selection  criterion 
based  on  the  cross-validation  method  [4], 

5.2.1.  Results.  Simulation  results  with  noisy  interfero¬ 
grams  with  spherical  aberrations  ranging  from  2  to  5  \.x 
show  that  the  reconstructed  wavefront  error  is  below  21%. 
The  simulated  random  noise  level  is  1.5%  of  peak  image 
intensity.  480  hidden  functions  are  used  per  sub-RBF  net¬ 
work  and  each  interferogram  is  represented  by  1600  pix¬ 
els.  Three  forms  of  aberrations  were  tested,  spherical  ab¬ 
erration,  astigmatism  and  coma  (both  X  and  Y  shear). 


6.  Conclusions 

The  RBF  network  is  found  to  be  a  suitable  method  for 
reconstruction  of  wavefront  from  lateral  shearing  inter¬ 
ferogram.  This  method  has  been  shown  to  provide  accept¬ 
able  results  for  noisy  interferogram  images  found  in  actual 
measurements  for  aberrations  in  the  range  of  2k  to  5k]>y.  It 
is  expected  that  aberrations  in  the  diffraction  limited  range 
will  present  similar  results  when  the  shear  ratio  is  in¬ 
creased.  This  and  further  experiments  on  measured  inter¬ 
ferograms  will  form  future  works  in  the  area. 


Figure  6.  Coma  (y)  without  noise  (top  left)  and  with  noise  (top 
right).  I  he  graphs  below  them  are  plots  of  the  intensity  profile 
across  the  centre  of  the  interferograms. 


Figure  7.  Interferograms  obtained  from  a  lateral  shearing  in¬ 
terferometer.  see  figure  8.  The  left  has  a  defocus  and  the  right  is 
a  near  diffraction  limited  interferogram. 

7.  References 

[1]  G.  Harbers,  P.J.  Kunst,  G.W.R.  Leibbrandt  :  Analysis 
of  lateral  shearing  interferograms  by  use  of  Zernike  poly¬ 
nomials,  Applied  Optics,  Vol.35,  No. 3 1 ,  1 996. 

[2]  M.  Born,  E.  Wolf,  Principles  of  Optics.  Cambridge 
University  Press,  1997. 

[3]  Broomhead,  D.S.,  Lowe,  D.,  Multivariable  functional 
interpolation  and  adaptive  networks”.  Complex  Systems  7 
pp. 321-355,  1988. 

[4]  Orr,  Mark  J.L.,  ‘‘Reguiarisation  in  the  selection  of  ra¬ 
dial  basis  function  centres”.  Neural  Computation ,  vol.7, 
no. 3,  pp. 606-623,  1995. 


311 


[5]  Hoerl,  A.E.,  Kennard,  R.W.,  “Ridge  regression:  Bi¬ 
ased  estimation  for  nonorthogonal  problems”,  Tech¬ 
nometrics ,  vol.  12,  no.  3,  pp. 55-67,  1970. 


Figure  8.  Later  shearing  interferometer  used  for  aberration 
measurements. 


%  Reconstruction  Error 

Training 

Step 

Basis  func¬ 
tion  radius, 
r 

MSE 

2X 

3X 

41 

51 

SA 

1 

0.01 

0.04 

13.40 

0 

-6.70 

-10.72 

1 

0.1 

0.04 

13.40 

0 

-6.70 

-10.72 

1 

1 

0.04 

13.40 

0 

-6.70 

-10.72 

1 

10 

0.04 

13.40 

0 

-6.70 

-10.72 

0.5 

0.01 

0.01 

9.40 

0 

-4.70 

-7.52 

0.5 

0.1 

0.01 

9.40 

0 

-4.70 

-7.52 

0.5 

1 

0.01 

9.40 

0 

-4.70 

-7.52 

0.5 

10 

0.01 

9.40 

0 

-4.70 

-7.52 

0.1 

1 

7.8E-04 

5.50 

0 

-2.70 

-4.40 

0.05 

1 

3.2E-04 

4.92 

0 

-2.46 

-3.93 

Table  1.  Results  of  reconstruction  of  clean  interferogram  with  neural  networks. 


%  Reconstruction  Error  1 

Training 

Step 

Basis  func¬ 
tion  radius, 
r 

MSE 

2X 

3X 

4X 

5X 

SA 

0.05 

1 

0.0027 

20.4 

6.9 

-0.5 

-4.7 

AST 

0.05 

1 

0.002 

11.9 

0.0 

-5.9 

-10.3 

CAx,y 

0.05 

1 

0.0019 

11.5 

0.0 

-6.3 

-9.6 

Table  2.  Results  of  reconstruction  of  interferograms  with  noise. 
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Abstract 

This  paper  presents  a  lane  recognition  technique  that 
utilizes  a  neural  network .  This  technique  combines  the 
conventional  image  processing  and  the  principle  of  neural 
network.  The  technique  is  applied  to  images  obtained 
from  highways  and  unpaved  roads.  The  images  are 
filtered  to  reduce  the  noise  and  then  fed  to  a  multi-layer 
feed-fonvard  network  that  is  trained  with  the  road  images 
in  advance.  The  neural  network  produces  its  output 
indicating  the  direction  of  road  ahead  of  the  vehicle. 
This  technique  can  be  used  for  a  driver  warning  system 
that  can  alert  the  driver  when  the  car  is  moving  out  of  its 
lane  without  turning  on  the  turn  signal. 


1.  Introduction 

Among  the  effort  to  develop  more  intelligent  and  safer 
vehicle,  a  key  area  is  the  lane.  If  a  car  can  detect  the  lane 
that  it  has  to  follow,  it  is  possible  to  warn  the  driver  when 
he  or  she  steers  out  of  the  lane  without  turning  on  the  turn 
signal.  Also,  the  lane  recognition  ability  can  be  a  crucial 
factor  in  realizing  an  autonomous  vehicle  [1-3].  For  this 
purpose,  a  vision  system  is  a  popular  choice  because  it 
can  provide  detailed  information  about  the  road  ahead [2]. 

This  paper  presents  the  development  of  a  lane 
recognition  technique  that  combines  traditional  image 
processing  and  neural  network[4].  In  this  research,  images 
from  highways  and  unpaved  roads  have  been  processed  to 
be  suitable  for  neural  network  input.  These  processed 
images  have  been  used  by  a  multi-layer  feed-forward 
network  to  determine  in  which  direction  the  lane  is 
turning. 

2.  Lane  Recognition  Algorithm 

The  process  begins  with  filtering  a  captured  image  of 
the  road  by  using  a  band-pass  filter  whose  thresholds  are 


adjusted  to  capture  only  the  lane  markings  painted  on  the 
road  for  highway  images.  For  unpaved  road,  color 
information  of  images  is  used  for  the  band-pass  filter  to 
eliminate  portions  other  than  the  road.  In  order  to 
reduce  scattered  noise  on  the  image,  expansion  and 
contraction  method  was  used  where  a  black  pixel 
surrounded  by  white  pixels  is  converted  to  white  and  vice 
versa. 

The  highway  image  after  these  filters  has  white  lane 
markings  that  covers  a  small  portion  of  the  image  and 
may  not  provide  sufficient  information  for  neural  network. 
Therefore,  the  gap  between  lane  markings  is  filled  with 
white  pixels.  This  filling  process  starts  from  the  bottom 
center  of  the  image  and  transverses  both  left  and  right 
until  a  white  pixel  is  reached.  The  midpoint  of  the  white 
pixels  becomes  the  starting  point  of  the  filling  process  for 
the  row  just  one  pixel  above  the  current  one.  For  the  rows 
where  the  lane  marking  is  not  present,  the  lane  marking  is 
linearly  extrapolated,  as  shown  in  Figure  1,  based  on  the 
lane  markers  identified  already.  For  the  images  from 
unpaved  roads,  this  kind  of  filling  process  is  not  required 
because  the  filtered  image  has  a  substantial  area  with 
white  pixels. 
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Figure  2.  Neural  network  input  windows 


The  next  step  is  to  select  a  subwindow  for  neural 
network  input.  The  input  windows  are  shown  in  Figure 
2  where  the  input  window  for  unpaved  road  is  a  little 
lower  in  the  image  plane  and  larger  than  that  for  highway 
because  the  vehicles  are  driven  slower  on  unpaved  road 
and  the  unpaved  road  tends  to  have  smaller  radius  of 
curvature.  The  input  window  is  further  divided  into 
thirty-six  cells  that  are  arranged  in  a  three-by-twelve  array. 
Each  cell  is  given  a  value  that  is  the  ratio  of  the  number  of 
white  pixels  to  the  total  number  of  pixels  in  the  cell. 

These  values  are  fed  into  a  feed-forward  neural 
network  shown  in  Figure  3.  The  neural  network  has 
thirty-six  input  nodes,  forty  nodes  in  the  hidden  layer,  and 
fifteen  output  nodes.  The  output  nodes  are  grouped  into 
five  different  directions,  i.e.,  sharp  left  turn,  left  turn, 
straight,  right  turn,  and  sharp  right  turn.  When  the  peak 
value  appears  from  output  nodes  1  to  3,  it  is  interpreted  as 
a  sharp  left  turn  while  peak  values  appearing  from  nodes 
4  to  6,  7  to  9,  10  to  11,  and  12  to  15  are  left  turn,  straight, 
right  turn,  sharp  right  turn,  respectively.  The  reason  to 
use  the  node  number  producing  a  peak  value  rather  than 
the  individual  output  values  is  to  make  the  result  more 
robust  to  various  noises.  The  neural  network  is  trained 
with  well-known  back-propagation  learning  algorithm 
using  a  set  of  five  typical  road  images  and  corresponding 
human  interpretation.  The  expected  output  is  organized 
in  such  a  way  that  a  node  generates  the  peak  value  of  1 
and  the  output  value  is  decreased  by  0.25  for  the  nodes 
away  from  the  peak  node.  For  example,  the  node  5  is 
trained  to  generate  the  value  of  1  for  a  left-turning  road 


while  the  value  of  0.75  should  be  obtained  from  nodes  4 
and  6,  0.5  from  nodes  3  and  7,  0.25  from  nodes  2  and  8,  0 
from  all  others. 


Input  Layer(36)  Hidden  Layer{40)  Output  Layer(15) 


Figure  3.  Multi-layer  feed-forward  network 


The  steps  involved  in  recognizing  the  road  direction 
are  summarized  in  Figures  4  for  both  highway  and 
unpaved  road. 
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Figure  4.  Lane  recognition  flowchart  for  highway(left)  unpaved  road(right) 
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Figure  5.  Straight  road  for  neural  network  training 
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3.  Experimental  Results 
3.1.  Highway  Images 

After  training  the  neural  network,  the  images  used  for 
training  were  fed  into  the  network  in  order  to  verify  the 
ability  to  recall  what  has  been  learned.  Figure  4(a) 
shows  one  of  the  training  images  while  Figures  4(b)  to 
4(d)  show  the  transformed  images  after  filtering,  gap 
filling,  and  windowing,  respectively.  In  Figure  4(e),  the 
white  bars  are  the  training  data  for  the  image  and  the 
black  ones  are  the  outputs  of  the  neural  network  that 
closely  follows  the  training  data. 

In  order  to  check  the  performance  for  the  untrained 
image,  the  image  of  a  straight  section  of  a  highway  on  a 
rainy  day  has  been  used.  The  original  image  is  shown  in 
Figure  5  along  with  the  neural  network  input  image  and 
outputs.  The  peak  value  of  the  outputs  appears  from 
node  8  indicating  that  it  is  a  straight  section 
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Figure  6.  Straight  highway 


For  a  curved  section  of  a  highway,  such  images  shown 
in  Figure  6  are  used  to  test  the  performance.  Figure  6 
shows  the  original  image,  input  image  to  the  network,  and 
the  outputs.  The  outputs  are  not  exactly  matching  the 
ideal  outputs,  but  the  peak  value  is  appearing  from  the 
node  5  as  expected. 
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Figure  7.  Left-turning  highway 


3,2.  Unpaved  Road  Images 

In  Figure  7,  a  section  of  unpaved  road  is  shown  along 
with  the  image  for  the  neural  network  that  has  gone 
through  the  filtering  and  windowing.  The  peak  output  is 
from  node  5  indicating  that  the  road  curves  to  the  left. 

Similar  tests  have  been  carried  out  with  one  hundred 
untrained  images  each  for  highway  and  unpaved  road. 
For  highway,  the  network  was  able  to  determine  the  road 
direction  correctly  for  82  images  while  the  correct  outputs 
were  obtained  for  95  unpaved  road  images. 
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Figure  8.  Unpaved  road 


All  the  incorrect  outputs  were  caused  by  severe  noise 
in  the  image  that  includes  other  vehicles,  letters  and 
arrows  painted  on  the  road,  and  shadows  of  objects 
standing  at  the  roadside.  Figure  8  shows  such  an  image 
where  an  arrow  is  painted  on  the  road.  Due  to  this  extra 
marking  on  the  pavement,  the  filling  process  between  lane 
markings  was  stopped  prematurely,  which  resulted  in  an 
incorrect  output.  In  order  to  circumvent  this  problem,  an 
algorithm  may  be  included  in  the  filling  process  to  ignore 
white  pixels  within  the  identified  lane  markings. 

The  time  required  to  determine  the  road  direction  is 
within  200msec  that  looks  promising  for  the 
implementation  as  a  part  of  a  driver  warning  system  or  a 
autonomous  vehicle. 
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Figure  9.  Arrow  painted  on  the  road 

4.  Conclusions 

This  research  focuses  on  detection  of  road  direction 
based  on  images  captured  on  highways  and  unpaved  roads. 
For  this  goal,  traditional  image  processing  techniques 
such  as  band-pass  filter  and  expansion/contraction  filter 
are  combined  with  the  principle  of  neural  network. 
These  filters  are  used  to  capture  the  necessary  portion  of 
the  image  and  to  reduce  the  noise  present  in  the  image. 
After  the  processed  image  within  a  subwindow  is 
subdivided  into  thirty-six  cells,  cell  values  are  computed 
and  fed  into  a  feed-forward  neural  network  with  fifteen 
output  nodes.  The  neural  network  is  trained  to  recognize 
five  different  shapes  of  the  road.  The  neural  network  is 
found  to  be  able  to  recall  the  trained  image  very  well  and 
to  interpolate  for  the  untrained  images.  However,  the 
network  has  difficulties  with  the  untrained  images  that 
contain  severe  noise  such  as  shadows  and  letters  on  the 
road  and  further  research  is  needed  to  overcome  this 
shortcoming. 
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Abstract 

The  application  of  Artificial  Neural  Network  (ANN) 
control  of  a  Fieldbus  Pilot  Plant  System  is  considered  in 
this  paper.  The  pilot  plant  fitted  with  Foundation ™ 
Fieldbus  devices  is  modelled  using  ANN.  A  novel 
feedback  linearisation  technique  using  neural  networks  is 
proposed  to  control  the  neural  network  model  of  the  pilot 
plant.  Simulation  results  are  presented  to  demonstrate 
the  effectiveness  of  the  technique  used. 


1*  Introduction 

The  use  of  process  instrumentation  with  digital 
communication  facilities  enables  the  plant  operator  a 
number  of  advantages.  The  possibility  to  control, 
configure  and  monitor  the  performance  of  the 
instrumentation  from  a  central  control  room  reduces  the 
need  for  local  inspections,  thus  economising  and  saving 
time.  By  connecting  together  instrumentation  via  a  bus, 
the  plant  can  be  operated  as  a  decentralised,  modular 
units. 

The  main  advantage  of  using  Foundation™  Fieldbus  is 
that  it  utilises  only  a  single  pair  of  wire  for  connection 
from  the  Fieldbus  devices  to  the  controller  as  illustrated 
in  Figure  1.  This  in  turn  drastically  reduces  the  cost  of 
wiring,  control  hardware,  I/O  cards  for  PLCs  and  space  in 
control  cabinets,  expansions,  maintenance  etc.  as 
compared  with  the  existing  technologies  viz.,  pneumatic, 
analog  and  distributed  control.  Other  added  features  of 
Fieldbus  technology  are  simpler  man-machine  interface, 
easy  configuration  of  the  system,  enhanced  system 
performance,  intrinsic  safe  and  interoperability. 

Neural  networks  can  be  used  favourably  in  modelling, 
identification  and  control  of  unknown  non-linear  plants 
[1-5].  To  prevent  modelling  difficulties  for  complex 
physical  systems,  the  ANN  method  of  learning  and 
control  provides  a  basic  framework  for  the  design  of 
controllers  for  unknown  nonlinear  systems.  ANN  can  be 
viewed  as  the  nonlinear  dynamic  mapping  of  control 
inputs  onto  observation  outputs.  The  capability  of  the 
neural  networks  to  approximate  large  classes  of  non¬ 
linear  functions  sufficiently  accurately  makes  them  prime 
candidates  for  use  in  dynamic  models  for  the 
representation  of  non-linear  process  plants. 


□ 


Control  System 
Network 


Fieldbus 


row 


row 


Fieldbus 


Figure  1:  Fieldbus  System 

In  most  of  the  work,  arbitrary  nonlinear  plants  have 
been  approximated  using  ANN.  But  it  is  well  known  that 
one  cannot  generalise  the  nonlinear  problem  because  the 
nonlinearity  is  unique  in  different  applications.  Process 
plants  have  been  studied  extensively  and  their 
nonlinearity  has  been  characterised  but  this  knowledge 
has  not  been  utilised  in  the  application  of  ANN  to  process 
control.  Our  approach  is  unique  and  has  an  added 
advantage  of  applying  neural  networks  for  modelling  and 
control  of  a  Fieldbus  system,  since  abundant 
information  is  available  from  Fieldbus  instruments  than 
available  from  conventional  type  of  instrumentation 
system.  This  information  can  be  effectively  used  to 
characterise  the  structure  of  ANN  model  and  utilise  the 
full  capability  of  ANN  to  approximate  what  is  unknown 
[6].  The  information  allows  one  to  model  a  system  faster, 
more  accurately  and  also  to  control  it  effectively. 

Controlling  non-linear  systems  by  using  a  linear 
controller  via  feedback  linearisation  is  centred  around 
geometric  theory  [7].  However,  applicability  of  this 
method  is  quite  limited  since  the  theory  depends  on  exact 
knowledge  of  non-linearities  of  the  system  and  realisation 
of  non-linear  transformation  tends  to  become  difficult 
when  the  complexity  of  the  system  increases.  This 
limitation  in  turn  has  motivated  the  useage  of  feedback 
linearisation  using  ANN  [8-9],  A  novel  ANN  structure 
for  feedback  linearisation  of  the  ANN  plant  model  is 
proposed  in  this  work.  A  fixed  linear  controller  is  then 
used  to  control  the  linearised  pilot  plant  model. 
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The  paper  is  structured  in  the  following  manner.  The 
next  section  discusses  the  system  modelling  of  the 
Fieldbus  pilot  plant.  In  Section  3,  input-output 
linearisation  is  discussed.  In  Section  4,  ANN 
implementation  for  modelling  and  feedback  linearisation 
is  demonstrated  with  the  aid  of  simulations.  Application 
of  linear  controller  is  dealt  in  Section  5.  Finally  we 
conclude  our  work  in  the  conclusion  section. 

2.  Modelling  of  the  Pilot  Plant 

In  our  work,  neural  networks  are  used  to  model  the 
pilot  plant  equipped  with  Foundation™  Fieldbus  devices 
as  shown  in  Figure  2.  The  pilot  plant  is  divided  into  two 
major  subsystems  a  heat  exchanger  system  and  a  batch 
process  system.  The  pilot  plant  is  installed  in  the 
laboratory. 


Figure  2:  Pilot  plant  equipped  with  Fieldbus 
devices 


In  this  paper  we  model  and  control  only  the  heat 
exchanger  system  using  neural  networks.  The  schematic 
diagram  of  the  shell  and  tube  heat  exchanger  is  given  in 
Figure  3. 


Figure  3:  Schematic  of  a  Heat  Exchanger 


The  heat  exchanger  consists  of  a  shell  tube  and  an 
inner  tube.  The  heat  transfer  process  takes  place  from  the 
shell  tube  to  the  inner  tube.  The  inlet  and  outlet 
temperatures  of  the  shell  tube  are  Ty  and  Tl0  The  load 
disturbances  are  feed  flow  of  the  inner  tube  F2i  and  inlet 
temperature  of  the  inner  tube  T*.  The  shell  tube  inlet 
flow  is  Fjj  and  the  outlet  temperature  of  the  inner  tube  is 
T2o.  In  the  heat  exchanger,  the  liquid  flows  through  the 
inner  tube  and  it  is  heated  by  another  liquid  that  flows 
around  the  tube  as  shown  in  Figure  3. 

The  unknown  nonlinear  system  (heat  exchanger)  to  be 
modelled  is  expressed  by 

y(t  +  \)  =  f[y(t),y(t-\),---,y(t-ri), 

where  y(t)  is  the  scalar  output  of  the  system,  u(t)  is  the 
vector  input  to  the  system,  /[.]  is  the  unknown  nonlinear 
function  to  be  estimated  by  the  neural  network  and  n  and 
m  are  the  known  structure  orders  of  the  system.  General 
nonlinear  modelling  techniques  have  been  studied,  such 
as  NARMAX  [10],  for  the  identification  of  this  nonlinear 
function.  Based  on  the  theoretical  results  of  Cybenko 
[11]  and  Funahashi  [12],  it  has  been  suggested  that 
feedforward  networks,  such  as  the  multilayer  perceptrons, 
can  be  trained  to  form  any  realisable  vector  function /[.] 
and  hence  any  nonlinear  model. 

2.1  Neural  Modelling 

A  three  layer  neural  network  (input,  hidden  and 
output)  is  used  for  learning  purpose  and  the  standard 
algorithm  [13]  and  [14]  is  employed  to  train  the  weights. 
The  activation  functions  are  hyperbolic  tangent  for  the 
second  layer  and  linear  for  the  output  layer.  Let  the  input 
to  the  neural  network  be  denoted  by 

q  =  [y(.t)>y(t  -i),--, y(t  -n), 

m  (2) 

u(t ),  u(t  - 1),  •  *  • ,  u(t  -  m)\ 

and  expressing  the  neural  network  model  for  the 
unknown  system  (1)  as 

y(t  + 1)  =  f[y(t),  y(t  - 1),  ■■■,>(?  -  «), 
u(t),u(t  -  -  m)\ 

where  y(t  +  I)  is  the  output  of  the  neural  network  and  f 
is  the  estimate  of  f  The  backpropagation  algorithm 
guarantees  that 

^(Z  +  l)- j)(/  +  l)  =  8,5  « 1  (4) 

Let  the  cost  function  J  be  defined  as  follows: 
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J  =  -e2(t  +  \) 
2 

where 


(5) 


u(t  + 1)  =  u(t)  + 


r\e(t  + 1  )w2  [sec  h  2  q  +  bx )] 


dq_  (13) 
du 


e{t  + 1)  =  y(t  + 1)  -  y(t  + 1)  (6) 

The  control  signal  u(t)  should  be  selected  to  minimise 
the  cost  function  J.  Using  the  neural  network  model,  (3) 
can  be  rewritten  to  give 

y(t  + 1)  =  w2[tanh(wxq  +  bx )]  +  b2  (7) 

where  wh  w2,  b}  and  b2  are  the  weights  and  bias  matrices 
of  the  neural  network  respectively  [13].  To  minimise  J , 
the  input  u(t)  is  calculated  recursively  using  the  gradient 
descent  rule 


u(t + 1) = w(0  -  n~~  (8) 


where  rj  >0  is  the  learning  rate.  It  can  be  observed  that 
the  controller  relies  on  the  approximation  made  by  the 
neural  network.  Therefore  it  is  necessary  that  the  neural 
network  output  y(t  + 1)  approaches  the  real  system 
output  (heat  exchanger)  y(t  + 1)  asymptotically.  This  can 
be  realised  by  keeping  the  neural  network  training  online. 
Differentiating  (5)  with  respect  to  u(t ),  it  can  be  shown 
that 


du(t )  du(t ) 


(9) 


where  dy(t +  \)!du{t)  is  known  as  the  gradient  of  the 
neural  network  model  with  respect  to  u(t).  Substituting 
(9)  into  (8),  we  have 


u{t  + 1)  =  u{t)  +  r/e(t  + 1) 


Qp('  +  1) 

dll{t) 


(10) 


Eq.  (13)  can  now  be  used  in  the  software  program  for 
real-time  control. 

3.  Input-Output  Linearisation 

In  this  section,  we  discuss  feedback  lienarisation  of  the 
neural  network  model  via  suitable  transformation  based 
on  the  input-output  data  which  are  accessible  for 
measurements. 

Many  advanced  controllers  make  use  of  rigorous 
models  to  overcome  the  difficulties  of  controlling 
nonlinear  processes.  However,  it  is  usually  a  very 
difficult  and  time  consuming  task  to  rigorously  model  the 
process  over  the  whole  operating  range.  Because  of  this 
fact,  many  linear  controllers  are  used  for  controlling 
nonlinear  processes  in  the  industries.  The  local  dynamics 
of  the  nonlinear  system  varies  significantly  depending  on 
the  operating  range.  When  the  linear  controllers  are 
applied  to  a  nonlinear  system,  the  linear  controllers  are 
conservatively  tuned  to  ensure  the  stability  in  the  whole 
operation  range.  Around  the  operation  range  with  slower 
dynamics,  the  control  performance  can  deteriorate 
significantly.  Hence,  these  linear  techniques  have 
inherent  limitations  in  controlling  nonlinear  systems. 

It  is  expected  that  if  the  process  to  be  controlled  by  the 
linear  controller  can  be  linearised,  the  closed-loop 
performance  will  improve.  The  desired  transformation 
for  linearisation  is  realised  with  a  multilayer  neural 
network  which  is  trained  using  the  input-output  data. 

We  consider  the  nonlinear  neural  network  plant  model 
[NNP]  and  the  linearising  transformation  [NNT]  as  shown 
in  Figure  4  to  feedback  linearise  the  neural  network 
model.  The  control  input  u  is  the  flow  rate.  We  train  the 
network  NNT  based  on  the  error  between  the  output 
y(t+l)  which  is  in  the  controllable  canonical  form  [15] 
and  the  output  of  the  neural  network  model  y(t  + 1) . 


The  gradient  is  analytically  evaluated  using  the  known 
neural  network  structure  (7)  as  follows: 

+  =  W2 [sec/;2 (w, q  +  b,  )]w,  (11) 

du(t)  du 

where 

^  =  [0,0, -0,1,0,-, Of  (12) 

du 

is  the  derivative  of  the  input  vector  q  with  respect  to  u(t). 
Finally  (10)  can  be  written  as 


e(t+ 1) 


Figure  4:  Input-output  Linearisation  Structure  1 
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The  linearisation  problem  is  now  to  train  the  neural 
network  model  input  u(f)  based  on  the  error  e(H-l).  The 
network  NNT  cannot  be  directly  trained  based  on  its 
output  error  using  Figure  4,  a  novel  neural  network 
structure  given  in  Figure  5  is  used.  A  Theorem  in  [16]  is 
derived  to  basically  show  the  equivalence  of  Figure  4  and 
Figure  5.. 


Figure  5:  Input-output  Linearisation  Structure  2 
4.  Neural  Network  Implementation 

Now  we  configure  the  neural  network  structure  for 
modelling  the  heat  exchanger  plant  which  is  of  the  form 

y(* + 0  =  - 1),  «i  (0,  «i  0  - i), 

«2(0,«2  ('-!)] 

where  [u}(t),u2(t),y(t)\  represents  the  input-output  terms 
of  the  multi-input  single  output  (MISO)  heater  exchanger 
plant  at  time  t .  The  function  / :  R6  R  .  In  this  neural 
network  model,  the  output  of  the  plant  at  time  t+ 1 
depends  both  on  its  past  values  of  output  and  input. 

For  identification  of  the  heat  exchanger,  a  neural 
network  model  N3  68j  is  constructed,  where  the 
superscript  3  represents  the  number  of  layers,  the 
subscripts  6,8,1  represent  the  number  of  input  nodes  in 
the  input  layer,  number  of  processing  elements  in  the 
hidden  layer  and  number  of  units  in  the  output  layer 
respectively.  The  output  is  the  outlet  temperature  of  the 
heat  exchanger  at  time  t+L  A  one  step  delayed  output 
and  input  signals  are  used  to  determine  the  future  output. 
On-line  data  of  the  heat  exchanger  is  used  for  neural 
network  modelling. 

Figure  6  shows  the  output  of  the  process  plant  and  the 
model  during  training  phase,  where  the  identification 
procedure  was  carried  out  for  20,000  iterations  using 
random  steps.  The  input  and  output  data  is  normalised. 
A  step  size  of  0.1,  momentum  factor  of  0.8,  learning  rate 


of  0.1,  0.01  and  0.001  were  used  for  5,000,  10,000  and 
20,000  iterations  respectively. 


Figure  7  shows  the  output  of  the  plant  and  the  model 
during  the  testing  phase.  The  data  used  for  testing  the 
neural  network  model  was  different  from  that  used  during 
the  training  phase. 


I 

I 


Figure  7:  Testing  Result 
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4.1.  Linearisation 

The  performance  of  the  input-output  linearisation 
scheme  on  the  heat  exchanger  model  is  presented  here. 
To  accomplish  input-output  linearisation  of  the  neural 
model  NNP  the  network  NNT  is  trained  using  the  strucutre 
shown  in  Figure  5.  For  the  network  NNT  we  have  used 
N34,2,2  three  layered  neural  network  with  four  input  nodes 
in  the  input  layer,  2  nodes  in  the  hidden  layer  and  2 
output  nodes  in  the  output  layer.  The  input  vector  v(t)  to 
NNT  comprises  of  the  output  temperature  of  the  neural 
model  NNP  and  its  delayed  values.  The  output  that  has  to 
be  tracked  by  NNT  is  the  flow  rate  u(t)  and  its  delayed 
value.  A  random  flow  rate  u(t)  and  its  delayed  value  u{t- 
1)  uniformly  distributed  over  the  dynamic  input  range  for 
the  plant  emulator  NNP  was  used  to  drive  the  structure 
shown  in  Figure  5. 


322 


The  error  eu(t)  =  u(t)-u(t)  summed  over  different 
training  steps  were  used  as  a  performance  index  and 
adjustment  of  the  weights  of  NNT  was  performed  at  the 
end  of  each  sequence  to  decrease  the  error.  Different 
variable  step  sizes  were  used  to  adjust  the  weights  of  the 
network  NNT.  After  sufficiently  training  the  network 
NNt  to  the  desired  accuracy  using  Figure  5  then  the  over 
all  system  (Figure  4)  can  be  assumed  to  have  been  trained 
to  track  the  linear  system  which  is  in  the  controllable 
canonical  form. 

5.  Implementation  of  PI  Controller 

In  the  previous  section,  it  was  demonstrated  that  a 
nolinear  system  can  be  identified  and  feedback  linearised 
using  neural  networks  from  its  input-output  data.  Now 
that  the  characteristics  of  the  linearised  neural  network 
model  is  known,  we  proceed  to  control  the  linearised 
neural  network  model  using  a  PI  controller.  Figure  8 
Illustrates  the  controller  configuration  used  to  control  the 
linearised  system. 


Figure  8:  PI  Controller  &  the  Linearised  System 

The  PI  controller  tuning  parameters  was  properly  set 
based  on  the  conventional  control  theory,  the  IMC  based 
PID  tuning  rule.  The  set  point  was  set  to  52.32, 
proportional  gain  Kc  =  45,  and  the  integral  time  Tj  =0.4 
min.  Figure  9  illustrates  the  time  domain  response  of  the 
linearised  system  around  the  equilibrium  point  with  the 
PI  controller  in  action. 

6.  Conclusion 

This  paper  has  demonstrated  that  a  non-linear  pilot 
plant  fitted  with  Fieldbus  equipment  can  be  identified  and 
feedback  linearised  using  neural  networks  from  its  input- 
output  data.  Performance  evaluation  of  the  linearising 
network  from  the  closed  loop  control  point  of  view  is 
done.  The  proposed  linearising  scheme  attempts  to 
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Figure  9:  Response  of  Linearised  System  with  PI 
Control 

simplify,  in  effect,  the  control  of  a  non-linear  dynamic 
system  by  first  linearising  it  and  then  applying  a  fixed 
linear  controller,  such  as,  a  PI  controller  to  it.  The  effect 
of  linearising  network  indirectly  measured  by  the  ease  of 
control  achieved  through  an  order  of  magnitude 
improvement  in  the  proportional  and  integral  gain 
settings  achievable  is  shown  in  the  paper. 
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Abstract 

Recent  advances  in  computing  technology  along  with 
improvements  in  signal  processing  techniques  provide 
exciting  opportunities  for  the  development  of  highly 
functional  rehabilitative  aids  for  the  disabled.  This 
paper  reports  the  use  of  an  artificial  neural  network 
( Jordan  *s  Sequential  network)  for  use  in  a  myoelectric 
prosthetic  arm  to  facilitate  fault-tolerant  pattern 
recognition  and  simple  sequential  tasks.  Initial 
experiments  with  this  technique  have  yielded  promising 
results  and  indicate  that  the  technique  has  a  very  good 
potential  for  an  effective  rehabilitation  of  the 
functionally  disabled  in  terms  of  their  employability , 
independence,  and  self-esteem. 


1.  Introduction 

New  computing  paradigms  (such  as  neural  networks  & 
fuzzy  logic)  along  with  availability  of  low-cost  digital 
signal  processing  chips  provide  exciting  opportunities  for 
the  development  of  highly  functional  rehabilitative  aids 
for  the  disabled.  In  this  context,  this  paper  reports  the 
use  of  Jordan's  Sequential  Neural  Network  [1]  for  use  in 
a  myoelectric  prosthetic  arm  to  facilitate  fault-tolerant 
pattern  recognition  and  multifunctional  activations. 

Myoelectric  prostheses  utilise  the  myoelectric  signals 
(MES)  which  originates  within  the  remnant  muscles  of 
the  limb  stump  for  control  purposes.  The  MES  being  a 
very  low  magnitude,  low  frequency  stochastic  signal  (<  5 
mV)  requires  statistical  means  for  effective 
characterisation  and  processing  for  control  purposes. 
Typically  time  domain  parameters  (mean,  variance  etc.) 

[2]  and  frequency  domain  parameters  (mean  frequency, 
median  frequency  etc.)  have  been  employed  to  relate  the 
MES  to  the  movement  parameters  such  as  force  and 
velocity  of  contraction  of  the  respective  source  muscle(s) 

[3] .  The  control  of  a  powered  prosthetic  limb  using  a 
MES  would  therefore  have  to  rely  on  the  variation  of 
such  parameters  of  the  signal  with  the  contraction  level 
(force)  of  the  remnant  (source)  muscles. 
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Figure  1.  Illustration  of  the  MES  time  domain 
waveform  along  with  its  power  spectrum. 

Previous  control  techniques  utilised  these  parameters 
to  control  single  functions  (such  as  grasping,  turning  or 
releasing  etc.)  which  very  soon  exhausts  the  amputee  in 
the  execution  of  even  a  simple  manipulation  task.  With 
the  advancement  of  new  processing  paradigms  such  as 
artificial  neural  networks  and  fuzzy  logic  and  the 
availability  of  low-cost  signal  processing  chips,  it  is  now 
possible  to  develop  functionally  dexterous  and  light¬ 
weight  prostheses.  Specifically  it  is  reported  here  that  the 
ability  of  the  Jordan’s  network  to  learn  and  faithfully 
recall  sequences  of  signal  patterns  indicates  its 
usefulness  for  the  development  of  a  multifunctional 
prosthetic  limb  with  minimal  control  parameters. 

2.  Methodology 

In  the  myoelectric  control  scheme  amputation  is 
considered  as  removal  of  the  output  portion  of  the 
musculoskeletal  system,  and  of  associated  feedback 
pathways,  leaving  a  muscle  remnant(s)  which  is  still 
capable  of  contracting,  but  has  no  remaining  mechanical 
function.  For  the  control  purposes,  the  MES  is  obtained 
from  these  remnant  muscles,  using  suitable  surface 
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electrodes ,  in  the  portion  of  the  limb  superior  to  the 
point  of  amputation. 

For  the  case  of  a  ‘lower  arm  amputee’,  the  MES  could 
be  obtained  from  the  six  muscles  spanning  the  elbow 
joint.  However,  for  higher  level  amputees  the  number  of 
input  signals  available  would  reduce  dramatically.  This 
necessitates  the  use  of  a  new  technique  to  maximise  the 
movements  effected  for  any  given  activation  and  indeed 
the  current  technique  is  indicated  to  provide  such  a 
capability.  The  figure  below  illustrates  the  basic  aspects 
of  the  myoelectric  control  scheme: 


Figure  2.  A  basic  block  diagram  of  the  myoelectric 
control  scheme. 

The  basic  processing  methodology  is  illustrated  below 
(Figure  3).  The  MES  as  acquired  from  surface  electrodes 
has  a  very  low  amplitude,  typically,  below  5  mV.  This 
necessitates  the  use  of  high-gain  differential 
amplification  with  a  high  common  mode  rejection  ratio 
and  a  high  input  impedance.  The  MES  was  obtained  from 
four  healthy  subjects,  using  Ag/AgCl  electrodes,  from  the 
upper  arm  region  over  the  biceps  brachii.  Following 
acquisition  the  signals  were  amplified  with  a  bio¬ 
instrumentation  amplifier  with  a  gain  of  1000  and  a 
CMRR  of  86  dB. 


Figure  3.  Schematic  diagram  of  the  acquisition  of  the 
MES  and  its  processing. 

Following  the  amplification  as  described  above, 
suitable  characteristic  features  (feature  vectors)  of  the 
MES,  were  needed  to  be  extracted  so  as  to  facilitate 


distinguishing  between  different  contraction  levels.  In 
this  case,  different  feature  vectors  would  enable  effecting 
different  motions  or  motion  sequences  in  the  prosthetic 
limb.  Since  the  environment  would  be  noise-ridden,  the 
decision  algorithm  should  ensure  minimum  control  error 
or  maximum  information  transfer  between  the  MES 
feature  vectors  and  the  effectors  (typically  motors  at  the 
joints  of  the  limb).  Accordingly,  the  Jordan’s  sequential 
network  was  employed  as  the  appropriate  decision 
making  module.  Its  inputs  would  be  the  different  sets  of 
feature  vectors  to  be  classified  and  correctly  detected  and 
its  outputs  would  be  the  appropriate  sets  of  control 
signals  to  the  prosthetic  limb’s  actuator  motors. 

To  facilitate  the  feature  extraction  process,  the  MES 
was  formulated  as  a  auto-regressive  (AR)  model  as 
follows: 

m 

yk  =  Z  +Wlt  k  =0,1,2,... 

K  i  =  1 

where  the  az  are  the  m  AR  coefficients  and  the  residual 
Wfc  being  a  white  noise  sequence  (totally  uncorrelated). 

Though  in  theory  the  value  of  m  is  infinite,  it  has  been 
found  that  a  value  of  m-4  is  sufficiently  accurate  for  the 
purpose  of  control  and  spectral  analyses  [4]. 

In  the  case  of  upper-limb  prostheses,  there  are  several 
functions  possible  viz.  elbow  extension  and  bending, 
wrist  pronation  and  supination,  grasping  and  ungrasping. 
Since  we  were  interested  in  exploring  the  ability  of  the 
proposed  control  technique  to  correctly  execute  learned 
trajectories  despite  noisy  input  features,  only  the  actions 
of  elbow  bending,  extending  and  wrist  pronation  and 
supination  were  allowed  while  the  hand  was  held 
permanently  clasped..  In  the  actual  implementation,  these 
different  'limb  functions'  are  to  be  activated  by  the  user's 
generating  a  different  trained  muscle  activating  pattern  at 
the  remnant  muscles  in  the  vicinity  of  the  electrode  site. 
The  latter  muscle  activation  patterns,  yielding  different 
sets  of  a i  need  not  be  related  to  the  desired  limb  function. 

The  amputee  or  the  paralysed  user  is  however  trained  to 
remember  the  correspondence  between  the  individual 
muscle  activation  pattern  (at  the  electrode's  vicinity)  and 
the  appropriate  sequence  of  motions. 

3.  Results  and  discussion 

For  this  project  the  prosthetic  limb  was  fixed  to  a 
suitable  platform  such  that  the  hand  was  capable  of 
executing  a  trajectory  on  a  2  dimensional  plane  held 
orthogonal  to  the  limb’s  central  axis.  The  acquisition 
and  processing  of  the  MES  to  yield  the  feature  vectors 


326 


and  the  sequential  network’s  control  signals  were  all 
performed  in  a  personal  computer  suitably  interfaced  to 
the  myoelectric  arm. 


Figure  4.  Schematic  diagram  of  the  set-up  employed 
in  testing  the  proposed  control  technique. 

All  the  four  subjects  were  then  requested  to  perform  a 
elbow  flexion  and  pronation  and  were  further  required  to 
maintain  a  similar  state  of  contraction  as  much  as 
possible  over  several  trials.  The  AR  coefficients  for  each 
of  the  MES  samples,  obtained  from  each  of  them  over 
four  trials,  were  then  extracted  for  further  processing. 
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Figure  5.  Illustration  of  the  AR  coefficients  obtained 
from  the  MES  of  a  single  subject  upon  four  different 
trials  during  flexion. 
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Figure  6.  Illustration  of  the  AR  coefficients  obtained 
during  pronation. 

The  figures  above  are  an  example  of  the  AR 
coefficients  obtained  from  the  MES  of  a  single  subject 
upon  four  different  trials  and  for  the  two  different 


functions  associated  with  the  biceps  muscle  (flexion  and 
pronation). 

Following  their  acquisition,  the  AR  coefficients  (the 
feature  vectors)  were  suitably  preprocessed  for  input  to 
the  decision  making  module,  namely,  the  Jordan’s 
sequential  neural  network  (illustrated  below  in  Figure  7) 
which  employs  the  back-propagation  learning  algorithm. 
For  each  individual  subject  (user),  the  network  was  made 
to  learn  the  association  between  the  pre-processed 
feature  vectors  and  a  series  of  control  signals  which 
when  input  to  the  prosthetic  limb’s  motors  would  yield 
different  sequences  of  motion  in  three-dimensional 
space.  Thus  when  employing  the  identified  parameters  or 
coefficients  to  discriminate  between  limb  functions,  a 
decision,  relating  an  identified  coefficients  vector  a  = 
[a],  a2>  .  an]  to  a  given  limb  function,  was  to  be  made 

by  the  neural  network  based  on  the  similarity  of  the 
feature  vector  to  the  learned  ones. 


Figure  7.  Illustration  of  the  Jordan’s  Sequential 
Network.  ‘1’  refers  to  the  State  Units  which  are  input 
with  AR  coefficients,  ‘2’  refers  to  the  Input  Units 
which  have  the  different  states  of  the  trajectory,  ‘3’ 
refers  to  the  feedback  of  the  past  output  and  ‘4’ 
refers  to  the  two  layers  of  Hidden  Units. 

The  figure  below  illustrates  the  close  correspondence 
between  one  of  the  desired  trajectories  required  of  the 
prosthetic  arm  and  that  actually  executed  by  it. 
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Figure  8.  Illustration  of  one  of  the  desired  and  the 
corresponding  trajectory  executed  by  the  prosthetic 
limb. 

Furthermore,  the  Jordan’s  sequential  network  exhibits 
the  property  of  limit  cycle  behaviour  in  that,  given  any 
starting,  it  is  capable  of  reverting  to  the  learned  trajectory 
which  closely  matches  with  the  given  starting  position. 
This  is  also  exhibited  in  the  case  of  any  faults  which  may 
occur  in  the  midst  of  the  execution  of  any  of  the  learned 
trajectories.  Indeed  for  the  given  trajectory  above,  the 
illustration  below  shows  the  two  cases  of  the  novel 
starting  position  and  a  fault  occurrence  suitably  amended 
to  yield  the  learned  trajectory. 


Time  in  secs. 


Figure  9.  Illustration  of  the  limit  cycle  behaviour  of 
the  sequential  network  -  Curve  ‘Actuall’  has  a 
different  starting  position  while  curve  ‘Actual2’  also 
has  a  different  starting  position  and  a  fault  in  between 
its  execution. 

The  average  prosthetic  arm  would  allow  the  user  only 
one  specific  motion  (example,  grasping  or  releasing) 
leading  to  early  exhaustion  in  the  execution  of  even  a 
simple  manipulative  task  [5].  The  addition  of  the 
sequential  network  allows  the  user  the  option  of 
programming  specified  movements  into  the  neural 
network  which  may  be  activated  by  generating  a  specific 
pa^-m  of  myoelectric  signal.  For  example,  the  user  may 
perform  all  movements  associated  with  opening  a 


microwave  oven  simply  by  generating  the  MES  signal 
pattern  usually  associated  adducting  the  upper  arm  hard 
into  the  side  of  the  body.  Furthermore,  the  use  of  the 
sequential  neural  network  allows  the  input  of  a  pre¬ 
determined  sequence  of  control  signals  to  a  combination 
of  one  or  several  motors  in  the  myoelectric  limb.  This 
technique  could  be  extended  to  the  case  of  partly 
paralysed  persons  to  activate  a  powered-braces  system  or 
a  set  of  appropriately  positioned  electrical  stimulation 
electrodes. 

4.  Conclusions 

It  is  imperative  that  the  functionally  disabled  people  be 
provided  with  prospects  of  securing,  retaining  places  and 
advancing  in  educational  and  training  institutions, 
employment  and  recreation  otherwise  denied  to  them  as 
equal  members  of  the  society  because  of  their 
disabilities.  The  proposed  scheme  intends  to  meet  this 
need  in  a  small  but  hopefully  effective  manner  in 
restoring  the  self-dependency  and  self-esteem  of  the 
disabled  people.  It  is  also  in  tune  with  the  need  for 
introducing  state  of  art  technology  into  the  current 
prosthetic  and  orthotic  techniques.  It  specifically 
promises  to  offset  the  currently  available  bulky  and 
physically  exhausting  prostheses  by  incorporating  state 
of  the  art  processors  and  techniques.  Initial  experiments 
have  yielded  promising  results  warranting  further 
investigation  into  the  real-time  processing  and  testing 
with  amputee  subjects. 

The  lack  of  sensory  feedback  to  the  central  nervous 
system  is  a  significant  limitation  to  the  new  technique  as 
well  as  the  currently  popular  prostheses.  Solutions  can  be 
arrived  at  by  employing  the  new  and  forthcoming  sensor 
technologies  [6][7],  and  the  increasing  amount  of 
knowledge  in  the  field  of  neurophysiology  and  signal 
processing  [8]. 
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Abstract 

The  local  control  for  a  multi-chain  robotic  system  formed 
by  tentacle  manipulators  grasping  a  common  object  with 
hard  point  contacts  is  proposed  to  solve  using  a  robust 
control  system. 

The  stability  and  robustness  of  a  class  of  the  fuzzy  logic 
control  (FLC)  are  investigated  and  a  robust  FLC  is 
proposed  with  uncertainties  of  the  load. 

The  two-level  hierarchical  control  is  adopted.  The  upper 
level  coordinator  gathers  all  the  necessary  information  to 
resolve  the  distribution  force.  Then,  the  lower-level  local 
control  problem  is  treated  as  an  open-chain  redundant 
manipulator  control  problem. 

The  fuzzy  rules  are  established.  Simulation  results  are 
presented  and  discussed. 


1.  Introduction 

The  coordinated  operation  of  two  or  more  robots  is  a  field 
of  research  activity  which  opens  up  new  fields  of 
applications  in  assembly  automation  and  flexible 
manufacturing  systems. 

A  tentacle  manipulator  is  a  manipulator  with  a  great 
flexibility,  with  a  distributed  mass  and  torque  that  can 
take  any  arbitrary  shape.  Technologically,  such  systems 
can  be  obtained  by  using  a  cellular  structure  for  each 
element  of  the  arm. 

A  multiple  chain  tentacle  robotic  system  is  more 
complicated.  The  first  problem  is  the  global  coordination 
problem  that  involves  coordination  of  several  tentacles  in 
order  to  assure  a  desired  trajectory  of  a  load.  The  second 
problem  is  the  local  control  problem,  which  involves  the 
control  of  the  individual  elements  of  the  arm  to  achieve 
the  desired  position.  The  force  distribution  is  a 
subproblem  in  which  the  motion  is  completely  specified 
and  the  internal  forces/torques  to  effect  this  motion  are  to 
be  determined.  To  resolve  this  large-scale  control 
problem,  a  two-level  hierarchical  control  scheme[4]  is 


used.  The  upper-level  system  collects  all  the  necessary 
information  and  solves  the  interchain  coordination 
problem,  the  force  distribution  problem.  Then,  the 
problem  is  decoupled  into  j  lower-level  subsystems,  for 
every  arm.  The  local  fuzzy  controllers  are  assigned  to 
solve  the  local  control.  In  order  to  obtain  the  fuzzy  control 
an  approximate  model  of  the  tentacle  arm  is  used. 

The  stability  and  robustness  of  the  FLC  are  investigated 
and  a  robust  FLC  is  proposed  with  respect  to  the 
robustness  of  the  load  uncertainties.  The  control  strategy 
is  based  on  the  Direct  Sliding  Mode  Control  (DSMC) 
which  controls  the  trajectory  towards  the  switching  line 
and  then  the  motion  is  forced  directly  to  the  origin,  on  the 
switching  line.  A  fuzzy  controller  is  proposed  and  the 
fuzzy  rules  are  established  by  using  the  DSMC 
procedures.  Efficiently  considerations  of  the  method  are 
discussed.  Numerical  simulations  for  several  control 
problems  are  presented. 

2.  Cooperative  Robots  Model 

A  multiple-chain  tentacle  robotics  system  is  presented  in 
Figure  1. 


With  the  chains  of  the  system  forming  closed-kinematics 
loops,  the  responses  of  individual  chains  are  tightly 
coupled  with  one  another  through  the  reference  member 
(object  or  load).  The  complexity  of  the  problem  is 
considerable  increased  by  the  presence  of  the  tentacle 
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manipulators,  ( TMj ,  j  =  1. . .  &) ,  the  systems  with, 

theoretically,  a  great  mobility,  which  can  take  any 
position  and  orientation  in  space  [1,2].  In  the  Figure  Lb  is 
presented  a  plane  model,  a  simplified  structure  which  can 
take  any  shape  in  XOZ  plane.  The  dynamic  equations  for 
each  chain  of  the  system  are  [1]: 

TM^:pjA^  fjsin^'  -  +  J&'+ 

s 

+  pAg\co&qJ ds'+r^  =T\j  -  1,2  2.1 

LJ  .  .  ]j  y  .v  .LJ 

J  T^ds  =  FJ  J  [ -sin^lds+F^  |  cosq^ds,j  =  1,2  2.2 

0  •*  o v  y  *  0  * 

where  we  assume  that  each  manipulator  (TMJ)  has  a 
uniform  distributed  mass,  with  a  linear  density  and  a 
section  /f.  We  denote  by  s  the  spatial  variable  upon  the 

length  of  the  arm,  s  E  [0,Lj]  .  We  also  use  the  notations: 
cj  -  Lagrange  generalized  coordinate  for  TMJ  (  the 
absolute  angle),  c{=  cj  (s,t),  s  £  [0,Lj]  ,  t  £  [0}tj],  cj 
(s\t),  s'  E  [0,s]  ,  t  E  [0, tfj,  V  =  V  (s,t)  -  the  distributed 
torque  over  the  arm;  tj=tj  (s,t)-  the  distributed  moment 
to  give  the  desired  motion  specified  on  the  reference 
member.  All  these  sizes  are  expressed  in  the  coordinate 
frame  of  the  arm  TMP  .  The  k  integral  equations  are 
tightly  coupled  through  the  terms  rJ  ,  F^l  ,  where  all 

of  these  terms  determine  the  desired  motion.  We  propose 
a  two-level  hierarchical  control  scheme  [4,5]  for  this 
multiple-chain  robotic  system.  The  control  strategy  is  to 
decouple  the  system  into  k  lower-level  subsystems  that 
are  coordinated  at  the  upper  level.  The  function  of  the 
upper-level  coordinator  is  to  gather  all  the  necessary 
information  so  as  to  formulate  the  corresponding  force 
distribution  problem  and  then  to  solve  this  constrained, 
optimization  problem  such  that  optimal  solutions  for  the 

contact  forces  FJ  are  generated.  These  optimal  contact 
forces  are  then  the  set-points  for  the  lower-level 
subsystems.  We  consider  the  hard  point  contact  with 
friction  and  the  force  balance  equations  on  the  object  may 

be  written  as:  F°  =  ^  °DjFJ  2.3 

j 

where:  F®-  the  resultant  force  vector  applied  to  object 
expressed  in  the  inertial  coordinate  frame  (0), 

^Dj-the  partial  spatial  transform  from  the 

coordinate  frame  for  the  arm  TMJ  to  the  inertial 
coordinate  frame  (0). 

The  object  dynamic  equations  are  obtained  by  the  form 
M0r  =  GF°  2.4 

where  M0  is  inertial  matrix  of  the  object  and  r  defines  the 
object  coordinate  vector  r~(x,z,  cpf 

2.5 


and  r(t)  represents  the  desired  trajectory  of  the  motion. 

The  inequality  constraints  which  include  the  friction 
constraints  and  the  maximum  force  constraints  may 

be  associated  to  (2.3),  AjFJ  <B  2.6 

where  AJ  is  a  coefficient  matrix  of  inequality  constraints 
and  B  is  a  boundary-value  vector  of  inequality 

constraints.  The  problem  of  the  contact  forces  FJ  can  be 
treated  as  an  optimal  control  problem  if  we  associate  to 
the  relations  (2.3)  -  (2.6)  an  optimal  index 

'¥  =  Y,CJFi  2.7 

j 

This  problem  is  solved  in  several  papers  [4,5,6]  by  the 
general  methods  of  the  optimization  or  by  the  specific 
procedures  [7].  After  all  of  the  contact  forces  Fj  are 
determinate,  the  dynamics  of  each  arm  TM*  are 
decoupled.  Now,  the  equations  (2.1),  (2.2)  can  be 
interpreted  as  same  decoupled  equations  with  a  given 
tJ(s ),  s  e  [0,Lj]  acting  on  the  tip  of  the  arm. 

3.  Approximate  model 

A  discrete  and  simplified  model  of  (2.1),  (2.2)  can  be 


obtained  by  using  a  spatial  discretization. 

S1  >  S2  , ...  sN  ;  sj  -  sj.j  =  A  ;  ■  3.1 

Iqj  (Sj )  -  q)  (sk  )  |  <  e  ;  i,  k  =  1, 2, ...  n)  3.2 

where  e,A  are  constants  and  e  is  sufficiently  small. 

We  denote  sr=iA,  Lj=frA,  qj(Si)=qij, 

Tj(s)=TiJ  ,TJ(s)  =  Tj  3.3 

and  considering  the  arm  as  a  lightweight  arm,  from  (2.1), 
(2.2)  it  results  [2]: 

MJqJ  +CJqJ  +  DJ[qJ^FJ  =  TJ  3.4 

where  MJ,  CJ  are  (nJxnJ)  contact  diagonal  matrixes,  D  is 
(nJx2)  nonlinear  matrix  [2,13] 

FJ  =col(FxJ,F/)  3.5 

qJ  =col[q(...qJnJ)  3.6 

T  =  coI{txj...T’j )  3.7 

In  the  equation  (3.4),  FJ  assures  the  load  transfer  on  the 
trajectory.  The  uncertainty  of  the  load  m  defines  an 
uncertainty  of  the  force  FJ. 

We  assume  that  | Fw  —  FJ  |.  <  p /  =  1,2  3.8 


Where  FMJ  is  an  estimation  of  the  upper  bound  of  the 
force. 

4.  Fuzzy  control 

The  control  problem  asks  for  determining  the 
manipulatable  torques  (control  variables)  T1!  such  that  the 
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trajectoiy  of  the  overall  system  (object  and  manipulators) 
will  correspond  as  closely  as  possible  to  the  desired 
behavior.  In  order  to  obtain  the  control  law  for  a 
prescribed  motion,  we  shall  use  the  closed-loop  control 
system  from  Figure  2. 

Let  qdJ  the  desired  parameters  of  the  trajectory,  FdJ  the 
desired  force  applied  to  the  j  -  contact  point  of  the  object, 
and  qJ,  FJ  the  same  sizes  measured  on  the  real  system.  For 
a  bounded  smooth  trajectory,  a  tracking  error  is: 
e  =  q~qd  4.1 


Figure  2 

The  control  system  contains  two  parts:  the  first 
component  is  a  conventional  controller  which  implement 
a  classic  strategy  of  the  motion  control  based  on  the 
Lyapunov  stability  and  the  second  is  a  FLC. 

A  frizzy  control  is  proposed  by  using  the  control  law  in 
the  neighborhood  of  the  switching  line  (Table  1  and 
Figure  3) 

as  a  variable  structure  controller. 

The  physical  meaning  of  the  rules  is  as  follows:  the  output 
is  zero  near  the  switching  line  (s),  the  output  is  negative 
above  the  switching  line,  the  output  is  positive  below  the 
diagonal  line,  the  magnitude  of  the  output  tends  to 
increase  in  accordance  with  magnitude  of  the  distance 


between  the  switching  line  and  the  state  {e  J  ,e  J  ) . 

We  consider  that  all  input/output  fuzzy  sets  are  assumed 
to  be  designed  on  the  normalized  space: 


ei,N 9TiN  e[  l,l],z  —  1,2. . .nJ 


4.2 


eJ\eJ 

NBE 

NSE 

ZRE 

PSE 

PBE 

PBDE 

ZRC 

NBC 

NBC 

=NBC 

NBC 

PSDE 

PBC 

ZRC 

NSC 

NSC 

NBC 

ZRDE 

PBC 

PSC 

ZRC 

NSC 

NBC 

NSDE 

PBC 

PSC 

PSC 

ZRC 

NBC 

NBDE 

PBC 

PBC 

PBC 

|PBC 

ZRC 

Table  1 


and  the  input/output  gains  G f  =  and  Gc 

serve  as  scale  factors  between  the  normalized  space  and 
the  corresponding  real  space  of  the  process 

4.3 

Gief 


eN,i  “  &eie 


-N  j 


Tt,‘GaT,J 


4.4 

4.5 


If  we  consider  that  the  switching  line  s  in  the  space  of 
normalized  values  is  defined  by  the  diagonal: 

4,/  +  eN,i  =  0  4.6 


it  corresponds  to  the  following  diagonal  line  in  the  real 
error  phase  plane  [15], 


J  J  ,  J  J  . 
si  =ei  +aiei  =0> 

e  J  +  ue  J  -  0 


4.7 

4.8 


where  a=diag  (  ah  a* . . .  anj  ) 

The  memberships  of  the  input/output  variables  are 
represented  in  Figure  4,  where  NB,NS,ZR,PS,PB  define 
the  linguistic  variables:  NEGATIVE  BIG,  NEGATIVE 
SMALL,  ZERO,  POSIYIVE  SMALL,  POSITIVE  BIR, 
respectively. 


1STB 


NS 


-1 


ZR 


PS 


PB 


Figure  4 

Theorem  L 

The  closed-loop  system  of  Figure  2  is  stable  if  the  control 
is  defined  by 

TJ=-KJs  +  HJ(e,qd,FM)  +  uJF  4.9 

where  KJ  is  a  (nJxnJ)  symmetric  positive  definite  matrix, 
which  satisfies  condition 

KJ  —  MJ<jJ  +  CJ  positive  definite  matrix  4.10 

and  uF  is  the  output  vector  of  the  fuzzy  controller 

4  =-*£sgnj/  4.11 

kF,  H{e,qd,F)-HJ{e,qd,FM^  4.12 

Proof.  See  Appendix  1. 

The  Theorem  1  determines  the  conditions  which  assure 
the  motion  control  in  the  neighborhood  of  the  switching 
line.  In  order  to  accelerate  the  motion  on  the  switching 
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line,  we  can  use  the  DSMC  (Direct  Sliding  Mode 
Control).  The  DSMC  was  presented  in  [14]  and  it 
establishes  conditions  which  force  the  trajectory  along  the 
switching  line,  directly  toward  the  origin  (Figure  5). 


Figure  5 

In  the  first  part  of  the  motion  defined  by  Theorem  1 ,  we 

considered  that  the  switching  line  has  the  slope  -  The 

DSMC  can  require  a  new  switching  line, 

*  .J  J  J  n 
si  =ei  -Pi  ei  =° 

Proposition 

The  DSMC  control  is  assured  if  the  coefficient  kj  of  the 
controller  verifies  the  condition, 


Trajectory 


Figure  7 

These  two  manipulators  form  a  closed-chain  robotic 
system  by  a  common  object  which  is  manipulated.  An 
approximate  model  (3.4)  with  A=0.06  m  and  nJ=7  is  used 
(J=T,2). 

Also,  the  length  and  the  mass  of  the  object  are  0.2  m  and 
1  kg,  respectively. 

The  initial  position  of  the  arms  expressed  in  the  inertial 


TMJ 

JL^O) 

q2J(0) 

q3J(0) 

qV(0) 

q5J(0) 

q6J(0) 

q7(0) 

4.13 

TM' 

n/6 

7l/3 

7tc/12 

2tc/3 

7i/15 

15tc/8 

0 

TM5 

5n/6 

4tt/5 

4tt/5 

3n/4 

3ti/4 

2tc/3 

7T 

Table  3 

The  desired  trajectory  of  the  terminal  points  is  defined  by: 


(c,  +  kt )2  +  ct  +k^)  +  h)j  4.14 

Proof.  See  Appendix  2. 

The  DSMC  control  can  introduce  a  new  fuzzy  output 
variable,  the  coefficients  k;.  In  the  first  part  of  the  motion 
ki  verifies  only  the  condition  (4.10).  When  the  trajectory 
penetrates  the  switching  line  s*i5  the  k*  are  increased  in 
order  to  verify  (4.14) 


In  Figure  6  and  Table  2  are  presented  the  memberships 
functions  for  kj  and  fuzzy  rules,  where  we  defined  as  v1*, 
v2[  the  average  values  which  verify  the  condition  (4.10) 
and  (4. 1 4),  respectively.  _ _ _ 


eJ\eJ  NBE 

NSE 
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PSE 
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PBDE 
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S 

ZRDE 
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S 

NSDE 
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S 

S 

B 

S 

NBDE 

s 

s  i 

S 

S 

B 

Table  2 


5.  Numerical  Results 

The  purpose  of  this  section  is  to  demonstrate  the 
effectiveness  of  the  method.  This  is  illustrated  by  solving 
a  fuzzy  control  problem  for  a  two  tentacle  manipulator 
system  which  operates  in  XOZ  plane  (Figure  7) 


x  =  x0  +  a  sinoctf 


z  -  z0  +  b  cosat 


With  X(f=0.2  m,  Zo=0.1  m,  a=0.3m,  b=0.1  m,  co=  0.8  rad/s. 
The  trajectory  lies  the  work  envelopes  of  the  both  arm  and 
does  not  go  through  any  workspace  singularities.  The 
maximum  force  constraints  are  defined  by: 


Fx  -  Fmax  =  50yV 
and  the  optimal  index 


Fz  -  Fmax  =  50N 
/  \  /  \ 


min 


,min 


are  used. 

The  uncertainty  domain  of  the  mass  is  defined  as 

0.8%  <m<  1.4% 


The  solution  of  the  desired  trajectory  for  the  elements  of 
the  arms  is  given  by  solving  the  nonlinear  differentia! 
equation  [13] 

iiW-[^(«)  Jj  (<?)]"'  ^(M') 

Where  w=(x,z)T  and  J*  {$)  is  the  Jacobian  matrix  of 
the  arms  (J=T,2). 

A  conventional  controller  with  kjJ=0.5  ( i  =1,..7,  j=l,2)  is 
determined. 

A  FLC  is  used  with  the  scale  factors  selected  as 
G{  =G'l  =10,  :  i=l,..7,  J=l,2. 


The  conventional  and  DSMC  procedures  are  used  and 
new  switching  line  is  computed.  The  condition  (4.14)  is 
verified  and  the  new  switching  line  is  defined  for 
PiJ=1.03:i=l,..7,J=l,2. 
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control  (FLC)  are  investigated  and  a  robust  FLC  is 
proposed  in  order  to  cancel  the  uncertainties  of  the  load. 


Figure  8 

In  Figure  8  is  presented  the  evolution  of  k5J  for  a  DSMC 
procedure  and  the  evolution  of  the  position  error  e5!  and 

the  position  error  rate  e\  are  presented  in  Figure  9. 

1 


0. 


-0. 


-1. 


Figure  9 

Figure  10  represents  the  trajectory  in  the  plane 

for  conventional  procedure  and  Figure  11  the  same 
trajectory  for  a  DSMC  procedure  for  a  new  switching 
line.  Figure  12  presents  the  final  trajectory.  We  can 
remark  the  error  during  the  1th  cycle  and  the  convergence 
to  the  desired  trajectory  during  the  2nd  cycle. 

6.  Conclusion 

The  two  level  hierarchical  control  procedure  is 
constructed  in  this  paper  to  solve  the  large-scale  control 
problem  of  a  chain  robotic  system  formed  by  tentacle 
manipulators  grasping  a  common  object.  The  upper-level 
the  inter-chain  coordination  problem,  the  force 
distribution.  Then,  the  problem  is  decoupled  into  j  lower- 
level  subsystems,  for  every  arm.  The  local  fuzzy 
controllers  are  assigned  to  solve  the  local  control.  The 
stability  and  robustness  of  a  class  of  the  fuzzy  logic 


Figure  11 


Figure  12 

A  DSMC  procedure  is  used  and  the  fuzzy  rules  are 
established.  The  simulation  problem  for  two  closed-chain 
tentacle  robotic  systems  has  also  been  studied. 
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APPENDIX  1 

We  consider  the  dynamic  model  of  the  manipulator 
defined  by  (3.9).  The  parameter  sJ  from  (4.7)  represents 
an  error  measure  of  the  closed-loop  control. 
s  =  e  +  ae  Al.l 

Where  we  cancelled  the  upperscript  j,  for  the  simplicity. 
From  (3.9)  and  (4.1)  we  obtain: 

+  4^) +  +  ?</)  +  D^e  +  q^F  =  T  A1.2 

Ms  -  Mas  +  Ma2e  +  Cs-  Cae  +  L)[e  +  qd  )f  =  T 

we  separate  the  linear  part  of  s,  it  results, 

Ms-{Ma-C)s  +  H(e,qd,F)  =  T  A1.3 

where  H  is  a  (nJxl)  nonlinear  vector  defined  on  the 
trajectory  parameters  qd,e,F. 


In  order  to  prove  the  stability  of  the  closed-loop  system, 
we  use  a  Liapunov  function  by  the  form 
1  T 

V  =  ~s  Ms  A1.4 

Differentiating  (A  1.4), 

V  =  stMs  A1.5 

If  we  substitute  (A1 .3)  we  obtain 

V=sTK[kr\kb-0s-  Af'fe,qd,F)+ AT1 7]  A1 .6 
By  using  the  control  law  of  T  ,  (A  1.6)  becomes 

V  -  sT{k  -  Mb + 0s + sT j p{e, qd ,  FM )  -  F^e,  qd ,  F) j  +  sTuF  A 1 . ' 7 
And  using  (4.11),  [15], 

V<-/(k - ^+Qs+J]s^e>qd,FM)-^e,qd,^[ -*„] A1  8 

If  we  introduce  the  condition  (4.12)  and  we  denote  by  Xmin 
the  minimum  eigenvalue  of  (K-Ma+C),  it  results, 

A1.9 

(Q.E.D) 


APPENDIX  2 

We  consider  the  dynamic  model  (A1.3)  in  the  area  around 
switching  line  (up^O), 

M-(Mt -Cjs+Ae,qd,F)^-Ks+Ae,qd,FM)  A2.1 
From  (Al.l)  and  using  the  properties  of  the  matrices 
M,C.K,S  (diagonal  matrices)  we  obtain, 

m,e,  +  (c,  +  k)e,  +  (-  mp]  +  cpl  +  kp)e,  + 


+(H(e,qd,F)-H(e,qd,FM))'=  0  A2.2 

but 

(H{e,qd,F)-H{e,qd,FM))i  =(AH),  =ff  A2'3 

We  denote  this  term  as 

(A H)i=hl(qd,e)ei  A2.4 

From  (A2.2),  (A2.4)  we  obtain  the  switching  line, 
elr 

-  =  -Pi  = - -(c,  +k,)e,  +c,.  +k,)e, 

ei  miei  1  v  7 

~h>(<ld,e)e]  A2.5 

This  equation  determines  the  slope  pi  if  the  following 
condition  is  verified  [14], 

(c,  +  kf  >  4(a,(-  m,o,  +  c,  +  k,)  +  h,)m,  A2.6 
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Abstract 

A  fuzzy  compensator  is  designed  and 
implemented  to  linearize  the  nonlinear  load 
characteristics  of  a  single  phase,  half  controlled 
ac  to  dc  converter  under  discontinuous  current 
conduction.  The  fuzzy  compensator  uses  the 
converter-firing  angle  oc  and  the  average  load 
current  as  input  universes  of  discourse.  Its 
output  is  a  binary  coded  digital  signal 
representing  the  compensated  firing  angle  xc. 
The  experimental  results  show  considerable 
improvement  of  the  motor  load  characteristics 
over  all  the  discontinuous  conduction  regions. 


1.  Introduction 

Because  of  the  nonlinear  load  characteristics  of  the 
converter  encountered  under  discontinuous 
conduction  states  of  armature  current  as  depicted  in 
figure  (1),  it  has  been  difficult  to  properly  adjust  the 
gain  of  the  armature  current  controller  to  meet 
different  load  conditions.  In  other  words,  when  the 
gain  of  the  controller  is  adjusted  to  be  optimum  for 
continuous  current  conduction,  at  discontinuous 
conduction  the  closed  loop  response  is  observed  to 
become  sluggish  due  to  parameter  variations  of  the 
controlled  drive.  Conversely,  when  the  controller 
gain  is  adjusted  so  that  it  is  optimum  for 
discontinuous  mode  of  conduction  at  a  certain 
operating  point,  the  closed  loop  response  will  tend 
to  become  unstable  under  continuous  current 
conduction.  To  overcome  this  difficulty,  two 
possibilities  are  conceivable;  either  using  a  control 
system  with  adaptive  gains  or  designing  a  nonlinear 
compensation  circuit  in  an  attempt  to  linearize  the 
converter  load  characteristics. 

Konishi  et  al.  [1]  tried  to  solve  this  problem  by 


Figure  (1) .  Load  characteristics  of  the  converter 

fixing  the  firing  angle  and  linearizing  the  load 
characteristics  in  case  of  discontinuous  conduction 
using  function  generators  in  the  feedforward  path  of 
the  controller.  This  was  a  trial  and  error  technique 
which  is  tedious  and  sensitive  to  the  operating 
conditions.  Kannel  et  al.[2]  proposed  a  solution 
which  depends  on  pre-calculation  of  the  output 
behavior  of  the  system,  the  current  wave  form,  on 
the  basis  of  the  next  switching  state,  assuming  that 
the  converter  act  as  an  ideal  switch.  Then,  the 
actual  value  of  the  system  is  compared  with  the  pre¬ 
calculated  value.  When  they  are  equal  the  correct 
converter  firing  angle  is  selected  and  released 
.Hasegawa  et  al.[3]  proposed  a  feedback 
compensator,  which  becomes  active  when  a 
discontinuous  current  mode  is  detected.  This 
compensator  functions  as  a  variable  feed  back  gain 
of  the  current  PI  controller.  Naitoh  et  al.  [4] 
proposed  a  first  order  model  reference  adaptive 
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control  (MRAC).  The  proposed  algorithm  was  very 
complex  and  time  consuming  as  it  is  based  mainly 
on  Popov’s  hyperstability  theorem.  Ohmae  et  al.  [5] 
obtained  a  fast  response  current  controller  by 
employing  a  nonlinear  compensation  sub-loop.  The 
nonlinearity  during  discontinuous  conduction  was 
compensated  by  incrementing  the  firing  angle  oc  by 
Aoc  to  maintain  the  same  converter  gain  of 
continuous  conduction.  The  disadvantages  of  this 
method  are: 

1.  It  is  difficult  to  design  a  nonlinear 
compensation  sub-loop  that  works  at  all  motor 
operating  points. 

2.  The  motor  armature  current  stability  is  highly 
sensitive  to  deviations  in  sub-loop  gain. 

3.  The  conventional  look-up  table  method  to 
determine  Ax  needs  large  memory  requirements. 

4.  Interpolation,  which  is  time  consuming,  is 
always  needed  when  using  look-up  tables. 

5.  A  dual  mode  logic  was  needed  to  carry  out  the 
nonlinear  compensation  under  the  discontinuous 
conduction  state  only  when  there  is  a  large 
deviation  from  the  reference  to  avoid  instability  due 
to  over  compensation  under  continuous  conduction. 
To  overcome  the  disadvantages  of  this  method,  a 
simulation  study  was  performed  in  an  attempt  to 
make  use  of  the  fuzzy  set  theory  [6].  The  fuzzy 
compensator  relies  on  the  same  concepts  of  load 
characteristic  linearization  presented  in  [5]. 

In  this  paper,  the  practical  implementation  of  a 
fuzzy  compensator  is  applied  on  a  separately 
excited  dc  motor  driven  by  a  half-controlled,  single 
phase  converter  bridge.  This  particular  choice  was 
favored  by  the  wide  spread  of  its  armature  current 
discontinuity  region.  In  section  IL  an  explanation  of 
the  fuzzy  compensation  algorithm  is  shown, 
assuming  some  background  of  the  fuzzy  set  theory. 
Section  III,  gives  a  brief  explanation  of  the  practical 
firing  circuit.  The  experimental  setup  presented  in 
section  IV  is  used  to  verify  the  effect  of  the  fuzzy 
logic  compensation  in  linearizing  the  converter  load 
characteristics.  Finally,  the  conclusions  are  drawn 
in  section  V. 


2-  Fuzzy  Linearization  Algorithm 

Since  fuzzy  compensation  is  well-suited  for 
nonlinear  systems,  a  fuzzy  algorithm  is  used  to 
generate  a  suitable  increment  Ax  to  the  firing  angle 
x  to  linearize  the  load  characteristics  of  the 
converter  over  the  discontinuous  region.  The  value 
of  Ax  depends  on  the  firing  angle  x  and  the 
average  armature  current  la.  The  target  of  the 
linearization  is  to  keep  the  motor  armature  voltage 
constant  under  discontinuous  conduction  mode  of 
operation. 

The  universe  of  discourse  of  both  the  armature 
current  Ia  and  firing  angle  x  cover  the 


discontinuous  conduction  regioa  Figure  (2)  shows 
the  membership  functions  of  the  fuzzy  variables  Ia, 
x  and  Ax.  A  50%  overlap  has  been  provided  to 
insure  that  no  more  than  four  rules  are  fired  for  any 
input  data  pair  ( x  and  Ia ). 

The  rule  base  is  developed  using  heuristic 
reasoning.  The  set  of  rules  for  fuzzy  compensation 
is  given  by  table  (1),  where  all  the  symbols  are 
defined  in  the  usual  logic  using  context  free 
linguistic  terms. 


a)  Firing  angle  (x) 


b)  Armature  current  Ia 


c)  Ax 

Figure  (2)  -  Membership  functions  for  Ax 
compensation 
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Table  (1)-  Rule  base  for  the  fuzzy  compensator  3.The  Firing  Circuit 

The  implementation  of  the  firing  circuit  requires  the 
coordinated  operation  of  the  following  modules 

1 .  A  Zero  Voltage  Crossing  Detector  (ZVCD). 

2.  A  Delay  Circuit. 

3 .  A  Pulse  Conditioning  Circuit. 

4.  A  Pulse  Distribution  Circuit . 

The  required  logic  is  simple  and  can  be  summarized 
as  follows : 

At  the  beginning  of  each  halfcvcle  of  the  supply 
voltage,  the  ZVCD  circuit  will  initiate  a  pulse  used 
as  a  reference  point  for  counting  the  delay.  The 
computer  sends  the  required  delay  xc  binary  coded 
Due  to  the  fact  that  this  is  a  real  time  application,  as  a  byte.  Two  cascaded  synchronous  4-bit  counters 

the  rule  base  was  implemented  in  the  software  count  the  required  delay.  The  count  is  triggered  by 

algorithm  as  a  matrix  outside  the  program  loop  the  ZVCD  signal  and  a  delayed  pulse  is  released  at 

rather  than  in  the  traditional  set  of  the  IF . THEN  the  end  of  the  count.  This  pulse  is  widened, 

rules.  The  size  of  this  matrix  depends  on  the  synchronized  with  the  clock  pulses  and  sliced  by 

partitioning  of  the  input  universes  of  discourse.  the  pulse  conditioning  circuit.  The  resulting  pulses 

This  size  has  no  bearing  on  the  run  time  since  only  are  amplified  and  distributed  to  the  appropriate 

the  fired  rules  are  read  from  the  matrix.  thyristor  by  the  pulse  distribution  circuit. 

The  algorithm  for  fuzzy  linearization  can  be  Figure  (3),  gives  a  complete  overview  of  the  firing 

summarized  as  follows:  circuit  showing  the  relationship  between  its 

1-  Calculate  the  interval  indices  I  and  J  for  oc  and  la  different  modules, 

respectively  to  identify7  two  rules  out  the  four 
fired  rules. 

1  =  lnt.  ( (x  -  20)  /  20 ) 

J  =  Int  .  ( la  /  2 ) 

2-  Calculate  the  degree  of  membership  of  x  and  la 
for  the  fuzzy  subsets  identified  by  I  and  J  . 

(x)  j  =  (  20 1  +  40  -  x  )  /  20 
(la)  j  =  (2J  +  2-  Ia)/2 

Then  evaluate  the  degree  of  membership  for  the 
remaining  two  rules  in  subsets  (1+1)  and  (J+l)  by 
the  complement  relations 
(«0  i+i  =  1  -  P  («)i 
( la )  w  =  1  -  p  (Ia)j 

3-  Calculate  the  degree  of  membership  contributed 
by  each  rule  using  the  MIN  operator. 

R  (x,y)  =  Min  (  p  (x)Xj  p  (Ia)y) 
where, 

x  =  Iand(I+l) 
and 

y  =  J  and  (J+l) 

4-  Calculate  the  crisp  output  value  of  Ax  using  the 
weighted  height  defuzzification  method  |7J. 

Figure  (3)  -  A  Block  diagram  of  the  firing  circuit 


Z  M-  R  (x,y)  *  Aoc  (x,y) 

ae  —  _ 

Z  u  R  (x.y) 

where  Ax  (x^  =  height  of  the  output  membership 
function  of  the  rule  (x,y). 
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Current  feed  back 
signal 


4.  Real  Time  Implementation 


brake 


Figure  (4)  -  Single  line  diagram  of  the  fuzzy 
compensated  dc  drive  system. 


The  experimental  set  up  is  illustrated  in  figure  (4). 
The  system  consists  of  a  2k W,  11.7  Amp,  1500 
r.p.m.  separately  excited  dc  motor  driven  by  a  hall- 
controlled  converter.  The  motor  is  loaded  using  an 
electromagnetic  brake.  The  converter  is  fired 
through  a  digital  phase  control  firing  circuit.  A 
personal  computer  provided  with  a  data  acquisition 
card  converts  a  signal  proportional  to  the  average 
value  of  the  motor  armature  current  Ia  and  computes 
-  using  fuzzy  compensation  algorithm  -  the  firing 
angle  offset  Ax.  This  latter  is  added  to  the  firing 
angle  x  to  form  the  compensated  firing  angle  xc . 
Figures  (5)  and  (6)  give  the  experimental  results  of 
the  implementation  of  the  fuzzy  compensator  at 
different  firing  angles. 

The  results  show  considerable  improvement  of  the 
motor  load  characteristics  over  all  the  discontinuous 
conduction  regions,  as  the  armature  voltage 
approaches  the  constant  value  observed  under 
continuous  conduction. 

Figure  (6)  shows  the  pronounced  effect  of  the  fuzzy 
compensation  in  the  reduction  of  the  motor  speed 
regulation  under  discontinuous  conductioa 
It  is  important  to  mention  that,  only  one  set  of  fuzzy 
rules  can  linearize  the  converter  load  characteristics 
for  different  operating  points. 


With  compensation  »  » 

Figure  (5) .  Effect  of  the  fuzzy  compensator  on 
the  armature  voltage  (Vd). 
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Therefore,  a  major  problem  for  discontinuous 
operation  is  solved,  that  is,  the  adaptation  of  the 
controller  parameters  for  any  variation  of  the 
operating  point.  Furthermore,  no  look-up  tables  are 
needed  any  more.  Interpolation  is  inherently 
embedded  within  the  fuzzy  membership  functions. 

It  is  intended,  in  the  future  work,  to  use  this 
algorithm  as  a  part  of  a  closed  loop  computer 
controlled  drive,  its  execution  time  for  one  program 
loop  -  containing  the  A/D  conversion  of  two 
variables  (speed  and  current),  the  application  of  a 
digital  PI  algorithm  twice  in  addition  to  the  fuzzy 
compensation  algorithm  -must  be  minimized.  The 
storage  of  the  rule  base  as  a  matrix  reduced  the  run 
time;  using  a  pentium  processor  133  MHz;  down  to 
0.24  ms  as  shown  in  figure  (7). 


without  compensation 


with  compensation 


Figure  (6) .  Effect  of  fuzzy  compensator  on 
motor  speed  (N). 


hp  stopped 


Figure  (7) .  The  experimental  run  time 


5.  Conclusions 


A  fuzzy  compensator  is  used  to  linearize  the  load 
characteristics  of  a  half-controlled  converter  drive  a 
separately  excited  dc  motor.  The  experimental 
results  clearly  indicate  a  successful  linearization  of 
the  converter  characteristics  for  different  values  of 
both  the  firing  angle  and  the  armature  current.  That 
is,  when  using  such  a  compensator  only  one 
controller  will  perform  adequately  over  both 
continuous  and  discontinuous  modes  of  operation. 
Further  more,  storing  the  fuzzy  rule  as  a  matrix 
reduces  greatly  the  execution  run  time. 
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Abstract 

The  purpose  of  this  document  is  to  compare  several 
controllers  for  the  same  desired  control  surface 
implemented  in  the  popular  HC11  micro-controller  using 
various  fuzzy  and  neural  network  architectures .  Several 
neural  network  architectures  were  developed  and 
optimized  with  a  help  of  SNNS  -  Stuttgart  Neural  Network 
Simulator.  The  microprocessor  code  for  all  cases  was 
obtained  using  the  ICC11  C-compiler. 

It  was  proven  in  the  case  of  neural  controller 
implemented  on  a  microprocessor  the  code  is  simpler,  much 
shorter ;  the  processing  time  is  comparable,  and  the  control 
surfaces  obtained  with  neural  controllers  are  far  superior. 
Control  surfaces  obtained  from  neural  controllers  also  do 
not  exhibit  the  roughness  of  fuzzy  controllers  that  can  lead  to 
unstable  or  raw  control 

The  only  drawback  of  neural  controllers  is  that  the 
design  process  is  more  complicated  than  that  of  fuzzy 
controllers.  However,  this  difficulty  can  be  easily  overcome 
with  proper  design  tools. 

1.  Introduction 

In  recent  years,  a  significant  amount  of  research  has 
been  devoted  in  the  development  of  fuzzy  controllers 
[2]  [3]  [4]  [7]  [8]  [  14]  [  1 5]  [  1 6] .  Fuzzy  controllers  are 
especially  useful  for  nonlinear  systems,  which  are 
difficult  to  describe  by  mathematical  model.  Fuzzy 
controllers  are  easy  to  implement.  Membership  functions 
and  fuzzy  rules  are  chosen  arbitrarily  and  therefore  fuzzy 
controllers  are  often  good  but  not  optimal.  Fuzzy 
controllers  can  be  significantly  improved  when  they  are 
tuned  with  neural  network  [5]or  genetic  algorithm  [6]. 

2.  Implementation  of  Fuzzy  Controllers  Using 
Microprocessors. 

Microprocessors  use  primarily  trapezoidal  membership 
functions.  In  order  to  store  the  function  only  four  bytes  are 
required  xl,  x2,  x3,  and  x4  (see  Fig.  1).  The  triangular 
membership  function  is  a  special  case  of  trapezoidal  where 
x2=x3. 


Fig.  1.  Representation  of  the  membership  function  in 
microprocessor. 

Programming  the  fuzzy  controller  is  relatively  simple.  It 
only  requires  the  description  of  the  rule  table.  For  all 
combinations  of  input  membership  functions,  a  given  output 
membership  function  must  be  assigned.  The  block  diagram 
for  Zadeh  type  [1]  controllers  is  shown  in  Fig.  2,  while 
required  modifications  for  the  Sugano-Tagagi  controller  are 
shown  in  Fig  3. 


Fig.  2.  Block  diagram  for  Zadeh  type  controller 
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Fig.  3.  In  Tagagi-Sugeno  controller  normalization  and  weighted 
sum  routines  replace  the  denazification  routine. 

All  fuzzy  controllers  were  implemented  on  Motorola's 
68HC711E9  microcontroller.  This  is  a  low  cost,  8-bit 
microprocessor.  The  on-board  features  of  the  HC711  are 
512  bytes  of  RAM  and  EEPROM  and  12K  bytes  of  UV 
erasable  EPROM.  The  processor  was  used  with  an  8 
MHz  crystal,  allowing  an  internal  clock  frequency  of  2 
MHz.  A  serial  port  along  with  an  RS232  interface 
enables  code  to  be  down  loaded  from  a  computer.  ICC1 1 
for  Windows  V5  was  the  compiler  used  to  program  the 
HC71 1E9.  It  is  capable  of  converting  C  or  assembly  code 
into  the  *.S19  file  which  is  downloaded  to  the 
microprocessor.  ICC  11  also  has  a  terminal  window  for 
interfacing  with  the  HC71 1. 


Table  1.  Error  comparison  for  various  type  of  fuzzy 
controllers 


Approach  used 

error 

SSE 

error 

MSE 

1 

Zadeh  fuzzy  controller  with 
trapezoidal  membership 
function 

(7*7  input  and  7  output) 

908.4 

0.945 

2 

Zadeh  fuzzy  controller  with 
triangular  membership 
function 

(7*7  input  and  7  output) 

644.4 

0.671 

3 

Zadeh  fuzzy  controller  with 
Gaussian  membership 
function 

(7*7  input  and  7  output) 

562.0 

0.585 

4 

Tagagi-Sugeno  fuzzy 
controller  with  trapezoidal 
membership  function 
(7*7  input ) 

296.5 

0.309 

5 

Tagagi-Sugeno  fuzzy 
controller  with  triangular 
membership  function 
(7*7  input ) 

210.8 

0.219 

6 

Tagagi-Sugeno  fuzzy 
controller  with  Gaussian 
membership  function 
(7*7  input ) 

294.2 

0.306 

For  all  controllers  shown  in  Fig.  5  to  10  the  same 
rule  table  was  used  and  only  the  shape  of  membership 
functions  are  different.  Also,  two  different  defuzzification 
processes  were  used.  The  first  three  examples  used  the 
Zadeh  [1]  approach  and  for  the  following  throe  examples, 
the  Tagagi-Sugeno  [2]  approach  was  implemented.  All 
controllers  were  designed  to  emulate  the  control  surface 
shown  in  Fig.  4.  Three  different  membership  functions  were 
used:  trapezoidal  (Fig.  5  and  8),  triangular  (Fig.  6  and  9), 
and  Gaussian  (Fig.  7  and  10).  Error  comparisons  are  shown 
in  Table  1.  In  that  respect  the  Tagagi-Sugeno  approach  is  far 
superior  over  the  Zadeh  one.  The  Tagagi-Sugeno  algorithm 
has  noticeably  large  memory  requirements.  The  smoothest 
results  are  obtained  for  the  Gaussian  type  membership 
functions.  Unfortunately  it  is  very  difficult  to  implement 
Gaussian  function  on  microprocessor.  Computation  of 
Gaussian  function  is  very  time  consuming  and  it  can  be  used 
only  for  slow  controllers  where  time  is  not  the  critical  issue. 


Fig.  4.  Required  control  surface 


Fig.  5.  Control  surface  obtained  with  trapezoidal 
membership  functions  and  Zadeh  approach. 
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Fig.  6.  Control  surface  obtained  with  triangular  membership 
functions  and  Zadeh  approach. 


Fig.  8.  Control  surface  obtained  with  trapezoidal 
membership  functions  and  Tagagi-Sugeno  approach 


o  o 


.Fig.  9.  Control  surface  obtained  with  triangular 
membership  functions  and  Tagagi-Sugeno  approach. 


o  o 


Fig.  10.  Control  surface  obtained  with  Gaussian 
membership  functions  and  Tagagi-Sugeno  approach. 


3.  Implementation  of  Neurocontrollers  Using 
Microprocessors. 

Neural  network  implementations  usually  require 
computations  of  sigmoidal  functions  [7][8][12][13] 

/(net)  =  — - \ - -  (l) 


1  +  exp(-  net) 
for  unipolar  neurons,  or 

/(net)  =  tanh  {net)  = 


-1 


(2) 


1  -  exp(-  2  net) 
for  bipolar  neurons.  This  function  is  relatively  difficult  to 
compute  and  such  implementation  on  a  microprocessor  is 
difficult.  If  the  Elliott  function  is  used: 

,,  ..  net 

/(net)  =  —  ,  ■  (3) 

1  +  \net\ 
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instead  of  the  sigmoidal,  then  the  computations  are 
relatively  simple  and  the  results  are  almost  as  good  as  in  the 
case  of  sigmoidal  function.  Fig.  11  shows  comparison  of 
sigmoidal  and  Elliot  functions. 


Fig.  1 1  Various  shapes  of  activation  functions 


Neural  controllers  were  also  implemented  on  Motorola’s 
68HC711E9  microcontroller  with  the  code  written  in  C 
language.  Block  diagram  of  neurocontroller  is  shown  in 
Fig.  12. 


A-D  converters 

. jjlX . 

Calculation  of  netl 
values  for  the  first  layer 


Elliott  function  calculation 

lUiliZI 

Calculation  of  net2 
values  for  the  second  layer 

7~33kT7 

Elliott  function  calculation 


|  weighted  average 

Fig.  12  Block  diagram  of  neurocontroller  implemented  on 
Motorola  68HC71 1E9 

During  a  design  process  of  a  fuzzy  controller,  the 
designer  must  know  what  output  should  be  expected  for 
given  input  values.  More  precisely,  what  the  output  value  is 
for  a  given  combination  of  input  membership  functions. 
The  exact  same  information  can  be  used  to  train  the  neural 
network.  This  of  course  must  be  done  be  specially  written 
program,  or  by  using  ready  software.  In  our  case  we  have 
used  Stuttgart  Neural  Network  Simulator  SNNS  [18).  This 


software  is  available  free  of  charge  form  http://www, 
informatik.uni-stuttgart.de/ipvr/bv/projekte/snns/snns.html 
and  it  may  run  in  both  platforms  UNIX  and  Windows.  Tire 
Elliott  activation  function  also  implemented  in  the  program 
SNNS  (Stuttgart  Neural  Network  Simulator).  Using  the 
dedicated  software  and  the  proper  architecture,  the  required 
weights  for  each  neurons  can  be  found.  First,  the  pattern 
file  has  then  loaded.  SNNS  then  trains  a  network  for  the 
desired  control  surface.  Many  network  configurations 
were  tested.  The  goal  was  to  keep  the  network  as  simple 
as  possible  while  achieving  the  lowest  possible  error. 
Different  types  of  networks  that  were  tested  include  a) 
multiple  neurons  in  one  hidden  layer,  b)  multiple  neurons 
in  cascade  and  c)  multiple  neurons  in  multiple  hidden 
layers.  RProp  was  the  training  algorithm  used  to  train  the 
networks.  It  proved  to  have  the  fastest  convergence  time 
and  provided  the  lowest  errors. 

For  the  given  control  surface  shown  in  Fig.  4,  several 
different  controllers  that  are  shown  in  Table  2  were 
implemented  in  the  Motorola  68HC711E9 
microprocessor.  In  order  to  simplify  the  computation  for 
the  neural  architectures  with  limited  microprocessor 
functions,  the  Elliot  function  (3)  was  used  instead  of  the 
traditional  sigmoidal  activation  function.  Several  neural 
network  architectures  were  developed  and  optimized  with 
a  help  of  SNNS  -  Stuttgart  Neural  Network  Simulator. 
The  microprocessor  code  for  all  cases  was  obtained  using 
the  ICC  11  C-coinpiler.  Of  course  more  optimal  code  can 
be  written  directly  in  assembly  language,  but  relative  ratio 
of  codes  and  processing  times  should  be  similar. 

Depending  on  the  complexity  of  neural  networks, 
various  levels  of  accuracy  were  obtained.  Fig.  13  shows 
implemented  neural  network  architectures  and  Fig.  14 
shows  obtained  control  surfaces  for  these  architects 


(b) 
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Fig.  13  Various  neural  network  architectures  developed 
for  the  required  control  surface  of  Fig.  4:  (a)  with  3  hiden 
neurons,  (b)  with  5  hidden  neurons,  and  (c)  with  6  hidden 
neurons  organized  in  one  hidden  layer. 


(c) 

Fig.  14  Control  surfaces  of  neuralcontroller:  (a)  with  3 
liiden  neurons,  (b)  with  5  hidden  neurons,  and  (c)  with  6 
hidden  neurons  organized  in  one  hidden  layer. 


Table  2.  Error  comparison  for  various  type  of  neural 
controllers 


Approach  used 

error 

SSE 

error 

MSE 

1 

Neural  network  with  3  neurons 
in  cascade  (Fig  13  (a)) 

0.5559 

0.000578 

2 

Neural  network  with  5  neurons 
in  cascade  (Fig  13  (b)) 

0.0895 

0.000093 

3 

Neural  network  with  6  neurons 

0.2902 

0.000302 

|  1  in  one  hidden  layer  (Fig  13  (c))  | _ | _ | 

4.  Comparison  of  fuzzy  and  neural  approach 

ll  is  shown  that  it  is  much  simpler  to  implement  a  desired 
control  surface  in  a  microprocessor  using  a  neural  network 
structure  rather  than  using  a  fuzzy  type  of  controller.  The 
only  drawback  of  neural  controllers  is  that  the  design  process 
is  more  complicated  than  that  of  fuzzy  controllers.  However, 
tins  difficulty  can  be  easily  overcome  with  proper  design 
tools. 

It  was  proven  that  in  the  case  of  neural  controller 
implementation  using  a  microprocessor,  the  code  is  simpler, 
much  shorter,  the  processing  time  is  comparable,  and  the 
control  surfaces  obtained  with  neural  controllers  are  far 
superior.  Control  surfaces  obtained  from  neural  controllers 
also  do  not  exhibit  the  roughness  of  fuzzy  controllers  that 
can  lead  to  unstable  or  raw  control. 

5.  Conclusion 

Fuzzy  controllers  do  have  several  advantages  such  as 
simple  rule  based  design,  but  they  usually  produce 
relatively  raw  control  surfaces,  which  are  not  acceptable 
for  precision  control.  This  obstacle  can  be  overcome  by 
several  means.  Instead  of  the  triangular  or  trapezoidal 
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membership  functions,  Gaussian-like  functions  could  be 
used.  Better  results  are  also  possible  with  Tagagi-Sugeno 
fuzzy  controllers. 

One  severe  disadvantage  of  a  fuzzy  system  is  its  limited 
ability  of  handling  problems  with  multiple  inputs.  Fuzzy 
systems  work  well  with  two  inputs.  However,  with  an 


increase  in  the  number  of  inputs,  the  size  of  the  rule  table 
grows  exponentially.  Because  of  this,  three  inputs  is  a 
practical  limit.  In  the  case  of  neural  networks,  the  number 
of  inputs  can  be  much  larger. 


Table  3.  Comparison  of  various  fuzzy  and  neural  controllers 


Type  of  controller 

length  of  code  in 
bytes 

processing  time 
(ms) 

Error  MSE 

1 

Zadeh  fuzzy  controller  with  trapezoidal  membership  function 
(7*7  input  and  7  output) 

2324 

1.95 

0.945 

2 

Zadeh  fuzzy  controller  with  triangular  membership  function 
(7*7  input  and  7  output) 

2324 

1.95 

0.671 

~T~ 

Zadeh  fuzzy  controller  with  Gaussian  membership  function 
(7*7  input  and  7  output) 

3245 

39.8 

0.585 

4 

Tagagi-Sugeno  fuzzy  controller  with  trapezoidal  membership 
function  (7*7  input ) 

1502 

28.5 

0.309 

5 

Tagagi-Sugeno  fuzzy  controller  with  triangular  membership 
function  (7*7  input ) 

1502 

28.5 

0.219 

6 

Tagagi-Sugeno  fuzzy  controller  with  Gaussian  membership 
function  (7*7  input ) 

2845 

52.3 

0.306 

7 

Neural  network  with  3  neurons  in  cascade  (Fig  13  (a)) 

680 

1.72 

0.000578 

8 

Neural  network  with  5  neurons  in  cascade  (Fig  13  (b)) 

1070 

3.3 

0.000093 

9 

Neural  network  with  6  neurons  in  one  hidden  layer 

(Fig  13(0) _ 

660 

3.8 

0.000302 
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Abstract 

A  DSP-based  magnetic  flux  controlled  induction  motor 
drive  with  a  grey-fuzzy  controller  is  designed  and 
implemented.  This  paper  investigates  the  possibility  of 
applying  grey  and  fuzzy  algorithms  in  a  DSP-based 
induction  motor  controller ,  which  requires  faster  and 
more  accurate  response  compared  with  a  traditional  fuzzy 
controller.  The  proposed  controller  contains  two  main 
parts.  The  first  one  is  the  grey  prediction  controller , 
which  uses  the  modified  grey  model ,  and  the  other  is  the 
general  fuzzy  logic  controller.  The  experiments  are 
performed  by  controlling  the  induction  machines,  that  it  is 
executed  by  the  digital  signal  processor  (TMS320C30). 
From  the  experimental  results,  the  good  dynamic  speed 
performance  and  torque  regulation  characteristics  will  be 
obtained.  It  also  shows  the  robustness  of  the  fuzzy  logic 
and  predictive  ability  of  the  grey  model. 

1.  Introduction 

Because  the  high  speed  digital  signal  processor  has  the 
characteristic  to  access  a  high  accurate  computation 
within  a  short  sampling  period,  the  speed  and  the  torque 
control  with  a  high  precision  of  induction  machines  have 
been  widely  applied  to  various  areas  of  the  industrial 
control  system.  The  control  strategy  for  induction  motor 
is  usually  carried  out  by  the  proportional- integral  (PI) 
method.  The  strategy  will  be  good  enough  as  long  as  the 
speed  and  accuracy  requirements  of  the  control  system 
are  not  critical.  The  usual  way  to  optimize  the 
performance  is  to  tune  the  PI  gains,  but  this  cannot  cope 
with  a  varying  control  environment  or  system 
nonlinearity.  Therefore,  it  is  important  to  develop  an 
effective  control  technique  for  precise  speed  and  torque 
control. 

In  recent  years,  adaptive  theory  has  been  widely 
employed  to  design  a  controller  for  induction  motor  [1]- 
[3].  They  have  developed  a  new  control  scheme  which 
combines  the  direct  adaptive  control  law  with  a  variable- 
structure  control  law  for  nonlinear  induction  motor  drive. 
This  adaptive  control  combination  has  been  shown  to 
improve  the  transient  behavior  and  the  robustness  with 


respect  to  the  external  input  disturbance  and  unmodelled 
dynamics,  and  also  to  overcome  the  chattering  problem 
which  is  one  of  the  main  drawbacks  of  the  variable- 
structure  method.  However,  the  application  of  adaptive 
control  approach  still  needs  some  knowledge  about  the 
system  order  and  range  of  variations,  and  its  output 
response  has  oscillatory  features  during  the  initial 
learning  interval. 

Since  the  dynamic  model  of  vector  control  for 
induction  motor  with  decouple  is  very  complicated, 
model-based  traditional  controllers  face  on-line 
computation  problems  during  their  implementation. 
Hence  the  development  of  model  free  intelligent 
controllers  is  attracting  recent  attention,  in  particular. 
Fuzzy  logic  control  is  a  promising  approach.  However, 
the  application  of  fuzzy  logic  control  still  needs 
considerable  effort  to  find  the  appropriate  membership 
functions  and  fuzzy  rules,  especially  when  the  system  is 
complicated  or  rapidly  changing.  These  significantly 
increase  the  difficulties  in  designing  a  traditional  fuzzy 
controller  for  induction  machines. 

With  the  recent  development  of  grey  theory  and 
microprocessors,  the  grey  control  theory  is  being 
increasingly,  applied  in  various  fields  of  engineering  [4]- 
[6].  In  this  paper,  a  grey  prediction  algorithm  is  employed 
to  estimate  the  system’s  next  step  output  for  a  look-ahead 
fuzzy  controller,  which  is  based  on  the  predictive  speed 
error  and  error  change.  Therefore,  the  proposed  control 
strategy  will  reduce  the  difficulties  in  implementing  the 
fuzzy  controller  and  use  a  predictor  to  pre- com  pen  sate  the 
system  output  error  instead  of  post-compensation.  This 
strategy  also  effectively  improves  the  time  delay  problem 
of  control  system. 

2.  Derivation  of  the  Fuzzy  Controller  with 
Grey  Prediction  for  Motor  Drive 

Generally,  the  requirements  for  a  better 
performing  motor  drive  system  are:  (i)  fast  tracking  of  a 
set  point  without  overshoot;  (ii)  the  maximum  speed  dip 
and  restore  time  due  to  load  change  must  be  kept  as  small 
as  possible;  and  (iii)  the  steady-state  errors  must  be  zero. 
In  order  to  achieve  the  object  (i),  the  fuzzy  control 
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algorithm  is  suitable.  The  grey  predictive  ability  will 
respond  to  the  trend  of  the  error  in  advance  under  load 
disturbance  condition,  so  it  can  solve  the  problem  (ii).  To 
overcome  (iii),  the  outer  speed  feedback  loop  must  be 
added.  Since  the  dynamic  drive  system  model  is  not 
necessary  for  the  grey-fuzzy  controller  design,  and  the 
performance  of  the  grey-fuzzy  controller  is  robust,  it  is 
very  suitable  for  this  application.  The  block  diagram  of 
the  proposed  controller  is  shown  in  Fig.  1.  In  the 
following,  the  development  of  the  grey  model  and  the 
fuzzy  controller  is  described. 


operation,  where  Jf(l)  =  [a-1  (1),  jc1  (2), . . . ,  jc1  («)] 
and  is  derived  as  follows: 

x'(k)  =  Yjx(m).  (1) 

m= l 

Step  3:  From  X{\)  we  can  form  the  following  first  order 
differential  equation: 

dx]  ,  ... 

—  +  ax  =  u  .  (2) 

dt 


2.1  Grey  System  [7] 


Step  4:  From  Step  3  we  have 


Grey  theory  treats  any  random  variations  as  a 
variation  in  the  grey  value  in  a  certain  range,  and  the 
random  process  is  considered  as  a  time-varying  grey 
process  in  a  certain  range.  In  spite  of  using  statistical 
regulation,  the  grey  theory  employs  the  method  of  data 
generating  to  obtain  a  more  regular  generating  sequence 
from  the  initial  random  data.  The  grey  prediction  is  hence 
to  establish  a  grey  model  extending  from  the  past 
information  to  the  future,  based  upon  the  past  and  present 
known  or  undetermined  information.  Then,  the  grey 
model  can  be  used  to  predict  the  future  variation  in  the 
system.  The  specific  feature  of  establishing  a  grey  model 
is  the  use  of  discrete  time  sequence  data  to  build  up  an 
ordinary  differential  equation. 

During  the  operation  of  this  method,  accumulated 
generating  operation  and  inverse  accumulated  generating 
operation  are  the  basic  tools  for  searching  the  grey  model. 
The  orders  of  these  operations  depend  on  the  order  of  the 
differential  equation  of  the  grey  model  and  the  number  of 
grey  variables.  The  general  form  of  a  grey  model  is 
GM( «,/?),  where  n  is  the  order  of  the  ordinary 
differential  equation  of  the  grey  model  and  h  is  the 
number  of  grey  variables.  The  computing  time  of 
generating  operation  increases  exponentially  with  respect 
to  the  equation  order  n  and  variable  numbers  h .  In 
addition,  better  prediction  accuracy  is  also  not  assured  by 
using  large  n  and  h  values.  Hence,  the  GM(1,1)  model 
is  extensively  employed  in  numbers  of  grey  systems  for 
prediction  application. 

The  GM(1,1)  grey  model,  i.e.,  a  single  variable 
first  order  grey  model,  is  summarized  as  follows: 

Step  1: 

Given  the  initial  data 

JT(0)  =  [jc(1),  jc(2) . jc(w  - 1),  x(«)] ,  where  x(i) 

corresponds  to  the  system  output  at  time  X .  We 
try  to  predict  the  next  x{n  +  k),  k  >  1 . 

Step  2:  From  the  initial  X(0)  a  new  sequence  ^(1)  is 
generated  by  the  accumulated  generating 


x'(k  + 1)  = 


(3) 


x(* +  !)  =  *' (*  +  !)-*(*),  (4) 


where 


=  (BTBylBTyN , 


(5) 


~0.5(jc1(1)  +  x'(2))  1 

-  0.5(;cl(2)  +  x'(3))  1 

-0.5(xl(n-\)  +  x(n)  1 


yN  =[x(2),jc(3 ),•••, x(«)]7,  and  x(£  +  l)  is  the  predicted 
value  of  x(k  + 1)  at  time  k  + 1 . 

From  the  above,  we  know  that  the  GM  is 
restrictive  and  that  only  non-negative  information  is 
available  for  it.  Hence,  it  is  necessary  to  overcome  this 
drawback.  In  [8],  the  proposed  scheme  adds  a  bias  to  the 
series;  thus,  all  elements  can  be  pulled  up  and  be  avoided 
being  negative.  This  method  is  very  simple  and  is  easy  to 
implement  in  a  DSP  chip. 

Since  the  grey  model  prediction  is  a  local  curve 
fitting  extrapolation  scheme,  at  least  four  data  sets  are 
required  for  the  first-order  single-variable  grey  prediction 
model  to  offer  a  fairly  accurate  grey  prediction. 
According  to  the  literature  [9]  and  our  experimental 
experience,  five  data  sets  can  yield  a  more  accurate 
prediction  value  than  other  choices.  However,  the 
prediction  accuracy  is  not  proportional  to  the  number  of 
data  sets,  while  the  corresponding  computing  time 
increases  exponentially.  Therefore,  in  this  study,  the  most 
recent  five  output  data  are  accumulated  to  predict  the  next 
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step  output  by  using  a  grey  model.  These  data  sets  are 
substituted  into  equations  (5)  and  (6)  for  solving  the  grey 
parameters  a  and  u  recursively.  This  principle  is  similar 
to  that  of  the  forgetting  factor  parameter  of  identification 
algorithm.  Then  the  system  next-step  output  can  be 
predicted  by  substituting  the  grey  parameters  into 
equation  (3).  This  predicted  output  of  the  next-step  is 
employed  to  calculate  the  predicted  next-step  speed  error 
and  error  change  in  the  look-ahead  fuzzy  control  strategy. 

2.2  Fuzzy  Logic  Control 

The  control  performance  of  a  traditional  controller 
fully  depends  on  the  accuracy  of  the  known  dynamic 
model  of  the  system.  However,  since  the  complex 
systems  with  several  variables  are  difficult  to  model 
appropriately  for  their  dynamic  characteristics,  a  model- 
free  intelligent  controller  is  hence  introduced  here  to 
solve  this  kind  of  problem  by  using  the  fuzzy  set  theory. 
The  main  feature  of  fuzzy  logic  controller  is  to  choose 
appropriate  linguistic  fuzzy  rules  table  established  from 
human  control  experience  and  database,  by  using 
decision-making  process.  These  fuzzy  rules  are  then 
transformed  into  a  control  type  of  human  thinking 
according  to  fuzzy  logic  and  fuzzy  set  operation.  The 
fuzzy  set  theory  was  first  proposed  by  Zadeh  [10],  and 
subsequently  there  has  been  many  control  applications 
based  on  this  theory  [11-[14].  In  most  of  these 
applications,  the  main  design  objective  is  to  construct  a 
fuzzy  system  to  approximate  the  desired  control  action. 
The  fuzzy  set  theory  has  also  employed  as  an  alternative 
to  traditional  modeling  and  control  designs  for  providing 
suitable  representation  of  the  system  [15]. 

Fuzzy  logic  control  applies  the  fuzzy  set  theory  to 
simulate  the  logical  reasoning  of  human  beings.  The 
major  parts  of  the  fuzzy  controller  are  a  set  of  linguistic 
control  rules,  composed  of  fuzzy  rules,  and  an  inference 
engine  to  digest  these  rules.  These  rules  offer  a 
transformation  between  the  linguistic  control  knowledge 
of  an  expert  and  the  automatic  control  strategies  of  an 
activator.  Every  fuzzy  control  rule  consists  of  an 
antecedent  and  a  consequent;  a  general  form  of  the  rules 
can  be  expressed  as 

R, :  IF  X  is  A  and  Y  is  B ,  THEN  U  is  C , 

where  Rt  is  the  /  th  rule,  X  and  Y  are  the  states  of  the 
system  output  to  be  controlled  and  U  is  the  control  input, 
A  ,  B  and  C  are  the  corresponding  fuzzy  subsets  of  the 
input  and  output  universe  of  discourse. 

The  importance  is  the  outputs  of  each  rule  depends 
on  the  membership  functions  of  the  linguistic  input  and 
output  variables.  The  membership  functions 


corresponding  to  the  speed  error  and  flux  error  can  be 
divided  into  unequal  spans,  in  congruence  with  the 
system  characteristics.  The  range  of  the  fuzzy  variables 
can  also  be  adjusted  according  to  variations  in  the  system. 
According  to  the  characteristics  of  system  dynamics,  the 
appropriate  fuzzy  rules  can  be  established  and  adjusted  by 
using  a  prescriptive  algorithm  to  maintain  the  system  step 
response  within  a  band  [16].  In  addition,  the  fuzzy  rules 
and  scaling  factors  can  be  planned  and  adjusted  on  the 
basis  of  the  fuzzy  phase  plane  analysis  by  incorporating 
the  closed-loop  system  output  error  in  the  domain  of 
fuzzy  logic  control  rules  [17].  In  the  present  study,  49 
fuzzy  rules  are  employed  to  control  the  induction 
machine  as  listed  in  table  I. 

The  input  variables  of  the  fuzzy  controller  are  the 
predictive  speed  error  and  error  change  of  speed  error  for 
the  next  step.  The  membership  function  used  in  this  study 
the  fuzzification  is  of  triangular  type,  which  can  be 
expressed  as 

K*)  =  ^(r\x-P\  +  W)  (7) 

where  W  is  the  span  of  distribution  of  the  membership 
function  and  p  is  the  parameter  corresponding  to  value  1 
of  the  membership  function.  The  height  method  is 
employed  to  defuzzify  the  fuzzy  variables  in  order  to 
obtain  the  control  input.  The  equation  can  be  described  as 

y=YLw<y‘ IYLWi’  (8) 

i  !  i  j 

where  ^  (Xj)  is  the  linguistic  value  of  the  fuzzy  set 
variable,  wi  is  the  weight  of  the  corresponding  activated 
rule,  y.  is  the  resulting  fuzzy  control  value  of  the  i  th 
fuzzy  rule  and  y  is  the  net  fuzzy  control  action. 

3.  The  Magnetic  Flux  controlled  Induction 
Motor  Drive 

In  order  to  evaluate  the  performance  of  the 
proposed  controller  based  upon  the  system  response,  a 
magnetic  flux  controlled  induction  machine  is  designed 
and  constructed  in  our  laboratory.  The  magnetic  flux 
pulse-width-modulation  (PWM)  method  is  shown  in  Fig. 
2.  It  is  based  on  eight  kinds  of  space  vectors  from  an 
inverter  bridge.  In  the  figure,  the  on  (off)  state  of  the 
upper  inverter  bridge  denotes  “1”  (“0”).  There  are  six 
nonzero  vectors  V1-V6  that  have  length  and  two  zero 
vectors  VO  and  V7  that  have  no  length  because  the 
inverter  bridge  produces  zero  voltage.  The  space  voltage 
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vector  is  selected  at  every  time  interna!  so  that  the  flux 
vector  produced  from  the  inverter  output  line-to-line 
voltage  may  move  along  an  ideal  circle. 

To  select  proper  voltage  vectors,  the  selection  rule 
[18]  should  be  taken  into  consideration.  The  selection  of 
the  voltage  vectors  is  finished  in  advance  and  the  whole 
patterns  are  burned  into  a  ROM  table.  The  relation 
between  the  output  voltage  and  the  DC  link  voltage  is 
expressed  as  [19] 


Vrrnt  =  Jf x-x-x(l-r)x^-, 
V  2  7t  n  2 


(9) 


where  VPWK{  is  the  rms  fundamental  component  of  the 
output  voltage,  Edc  is  the  dc  link  voltage  of  the  inverter, 
and  r  is  the  portion  of  zero  vectors.  Let 


implemented  individually  to  an  induction  motor.  The 
block  diagram  is  shown  in  Fig.  1  where 

e(k)  =  wr(k)-wr(k), 

e(k  + 1)  =  G(five  error  data),  (11) 

A  e(k  + 1)  =  e(k  + 1)  -  e(k). 

Here  Wr  is  the  feedback  speed  of  the  motor  and  w]  is 
the  command  speed  of  the  motor.  In  addition,  e(k  + 1)  is 
the  predictive  value  of  e(k)  at  the  (k+l)th  sampling 
interval,  Ae(£  +  1)  is  the  difference  between  e(k  + 1)  and 
e(k) ,  and  G(  •  )  is  the  grey  operator.  The  actual  control 
input  for  this  fuzzy  control  with  grey  prediction  system  is 

w(*)  =  w(A-1)  +  Am(*)  (12) 


A 4  =  —  x  —  x  (1  -  r),  then 
k  n 

(10) 

where  M  can  be  termed  as  the  voltage  modulation  factor. 

Finally,  the  induction  motor  is  driven  by  a  PWM 
transistor  inverter  wherein  the  current  feedback  loop  is 
employed.  Fast  and  stable  current  controlled  by  PWM  is 
achieved  by  using  a  pulse  number  high  enough  to  limit 
the  current  ripple  to  an  allowable  value  with  the  help  of 
the  leakage  inductance.  The  frequency  is  determined  by 
the  rotating  speed  of  the  rotor  and  slip  frequency.  The 
rotor  speed  is  detected  by  an  encoder  mounted  on  the 
shaft. 

4.  Experimental  Results 

Since  induction  machines  exhibit  obvious 
nonlinear  behaviors,  the  application  of  a  traditional  PI 
controller  needs  intricate  gain  adjustment  for  each 
situation.  Hence  fuzzy  logic  control  is  here  employed  to 
improve  the  system  robustness  and  facilitation.  However, 
the  fuzzy  logic  control  still  needs  considerable  efforts  to 
find  the  appropriate  membership  functions  and  fuzzy 
rules  for  quick  speed  or  torque  control.  Hence,  the  grey 
predictive  theory  is  utilized  in  this  study  to  predict  the 
system  next-step  output  response  for  pre-correction  by 
using  a  look-ahead  fuzzy  control.  This  strategy  is 
expected  to  improve  the  robustness  of  the  fuzzy  controller 
and  to  reduce  the  difficulty  in  implementing  the  fuzzy 
controller. 

In  this  paper,  both  a  traditional  fuzzy  controller 
and  a  fuzzy  controller  with  grey  prediction  are 


In  order  to  demonstrate  the  performance  of  the 
proposed  fuzzy  controller  with  grey  prediction, 
experimental  results  obtained  from  this  controller  are 
compared  with  those  obtained  from  the  traditional  fuzzy 
controller.  The  membership  functions  of  speed  error, 
error  change  and  control  input  used  here  are  given  in  Fig. 
3,  where  a,  is  the  scaling  factor,  the  subscript  7=1,2, 
devotes  the  speed  and  flux  where  the  superscript  j  =  1,2,3 
is  used  to  devote  the  error,  error  change  and  control 
output.  The  parameters  of  the  membership  functions  used 
in  this  study  are  listed  in  table  II. 

The  hardware  of  the  drive  system  and  the  software 
realization  of  the  proposed  grey-fuzzy  controller  using  a 
TMS320C30  DSP  chip  is  performed.  The  motor  used  in 
this  drive  system  is  a  3-phase  60-Hz  Y-connected  4-pole 
220-V  3-HP  induction  motor.  Fig.  4  shows  the  dynamic 
speed  responses  of  both  the  proposed  grey-fuzzy 
controller  and  the  traditional  fuzzy  controller  in  case  of  a 
speed  command  at  80  rpm  (equivalent  voltage  8v).  The 
results  indicate  that  good  speed  following  response  is 
obtained.  It  also  shows  the  proposed  grey-fuzzy  controller 
has  a  faster  rising  time  than  the  traditional  fuzzy 
controller  does. 

One  may  say  that  the  fuzzy  controller  can  also 
obtain  the  same  rising  time  as  can  the  proposed  controller. 
After  tuning  the  parameters  of  traditional  fuzzy  controller, 
Fig.  5  can  be  obtained.  It  is  clearly  that  the  two 
controllers  have  the  same  rising  time,  but  an  overshoot 
occurs  in  the  pure  fuzzy  controller.  As  for  the  load 
regulation  characteristics,  the  same  load  disturbance  is 
used  both  in  the  proposed  drive  and  the  traditional  fuzzy 
controller.  Fig.  6  shows  the  dynamic  speed  responses  at  a 
load  disturbance  under  two  different  operating  conditions. 
Good  speed  regulation  characteristics  of  the  proposed 
system  are  also  observed  in  Fig.6.  The  experimental 


351 


results  show  that  the  proposed  grey-fuzzy  controller  gives 
a  better  result. 

5.  Conclusions 

The  control  system  design  and  implementation  of  a 
DSP-based  grey-fuzzy  induction  motor  drive  have  been 
presented  in  this  paper.  Based  on  the  proposed  scheme,  a 
very  fast  speed  dynamic  response  under  a  constant  flux 
condition  is  achieved.  Integrate  a  fuzzy  controller  with  a 
grey  predictor  is  designed  to  obtain  good  dynamic  speed 
responses  and  load  disturbance  regulation  characteristics. 
By  using  the  experience  with  the  motor  drive  system,  the 
fuzzy  algorithms  are  designed  easily  and  the 
mathematical  grey  processing  can  be  simply  carried  out 
by  the  DSP  chip.  The  experimental  results  also  show  that 
good  responses  is  achieved  by  the  proposed  grey-fuzzy 
controller. 
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Figure.  2  Voltage  vectors  made  by  the  inverter. 
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Figure.  1  The  block  diagram  of  the  proposed  grey-fuzzy  controller 
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Figure.  3  The  membership  functions  of  the  traditional 
fuzzy  controller 
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Figure.  4  Speed  responses  from  0  to  80  rpm  (8v)  of  grey- 
fuzzy  and  traditional  fuzzy  controller 
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Figure.  5  Speed  responses  with  the  same  rising  time  of 
grey-fuzzy  and  traditional  fuzzy  controller 


Figure.  6  Load  regulation  characteristics  of  grey-fuzzy 
and  traditional  fuzzy  controller 

Table  I.  The  fuzzy  control  rules  of  this  fuzzy  controller 
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Table  II.  The  scaling  factors  of  this  fuzzy  controller 
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Abstract 

This  paper  presents  a  fuzzy  control  scheme  for  the 
regulation  of  pulse-width  modulation  (PWM)  inverters 
used  in  uninterruptible  power  supplies  (UPS).  The 
performance  is  superior  to  a  conventional  UPS. 
Conventional  UPS  systems  operate  to  protect  against 
such  disturbances  using  complex  fdtering  schemes ,  often 
employing  large  passitive  components.  This  paper 
presents  the  fuzzy  control  [1,2]  of  UPS  systems.  A 
prototype  has  been  implemented  and  tested  to  verify  its 
performance.  The  experimental  results  justify  the  input 
and  output  performance  of  the  proposed  system  that  is 
better  than  the  conventional  control  method. 


1.  Introduction 

Uninterruptible  power  supplies  provide  electric  power  for 
critical  functions  and  equipment  when  the  quality  of  the 
normal  supply,  i.e.,  utility  power,  is  not  adequate  or  fails 
entirely.  With  the  rapid  growth  in  the  utilization  of  data 
processing  systems,  life  care  medical  equipment,  alarm 
systems,  and  safety  lighting,  the  demand  for  high  quality 
uninterruptible  power  is  increasing  [3-6].  Further,  with 
the  widespread  application  of  high  efficiency  static  power 
converter  devices,  many  electrical  loads  are  nonlinear, 
including  the  above  mentioned  critical  loads,  and  generate 
harmonics.  Therefore,  additional  harmonic  filtering 
techniques  must  be  applied  in  order  to  maintain  a  high 
quality  sinusoidal  UPS  inverter  output  voltage. 

A  typical  UPS  consists  of  a  rectifier  supplied  battery 
bank  for  energy  storage  and  a  static  inverter-filter  system 
to  convert  a  dc  voltage  to  a  sinusoidal  ac  output  (Fig.  1). 
The  inverter  is  typically  operated  with  a  PWM  strategy 
under  feedback  control  to  realize  the  desired  output 
waveform.  UPS  systems  can  be  off-line,  where  the  load  is 
connected  directly  to  the  utility  under  normal  operation, 
and  emergency  power  is  provided  by  the  UPS.  The  on¬ 
line  configuration  is  shown  in  Fig.  1,  where  the  UPS 
inverter  powers  the  load  continuously. 

Control  of  the  UPS  inverter  switching  is  important 
to  minimize  the  total  harmonic  distortion  (THD)  of  the 


output  voltage.  The  difficulty  in  successful  switching 
control  operation  stems  from  the  output  impedance  of  the 
inverter  filter  [4].  Much  attention  has  been  focused  on 
providing  a  near  zero-impedance  inverter  stage  which  in 
theory  would  provide  near  zero  distortion  of  the  output 
voltages,  independent  of  the  load  conditions.  Low  output 
filter  impedance  can  be  realized  via  a  high  inverter 
switching  frequency. 

Thus,  modem  UPS  systems  minimize  the  THD  of 
the  inverter  output  voltage  through  the  use  of  complex 
filtering  schemes  employing  large  passive  components.  In 
addition,  a  number  of  pulse  width  modulation  techniques 
have  been  investigated  to  compensate  for  the  filter  output 
impedance  and  reduce  the  output  voltage  distortion  [5]. 
Real  time  PWM  control  of  the  inverter  output  voltages 
provides  the  ability  to  dynamically  adapt  to  changing  load 
conditions. 

In  this  paper,  we  propose  a  fuzzy  controller  to 
control  the  inverter  switches  to  generate  the  required 
PWM  pattern  to  produce  low  THD  sinusoidal  output 
voltage.  The  feature  of  the  fuzzy  logic  control  schemes  is 
that  even  if  the  mathematical  model  of  the  considered 
system  is  unknown,  fuzzy  control  still  can  achieve  the 
control  object.  The  fuzzy  control  is  constructed  by  the 
rule-based  expert  knowledge  or  experience  that  is  simple 
to  design. 

This  paper  is  organized  as  follows.  In  Section  2,  the 
configuration  of  the  DC-AC  inverter  is  introduced.  In 
Section  3,  the  design  of  proposed  fuzzy  controller  for 
UPS  system  is  described.  The  experimental  results  are 
presented  in  Section  4.  Finally  in  Section  5,  the  general 
conclusion  is  formulated. 

2.  Configuration  of  the  dc-ac  inverter 

Single-phase  voltage  source  PWM  inverters  are 
commonly  used  in  low  and  medium  power  UPSs.  In  such 
applications,  the  output  of  the  inverter  is  usually  provided 
with  an  LC  filter  to  reduce  the  inverter  output  harmonics. 
In  the  following,  the  configuration  of  DC-AC  output 
stage  including  single-phase  PWM  inverter  and  LC  filter 
is  introduced. 
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2.1  Single-phase  PWM  inverter 

Switch-mode  DC-AC  power  inverters  are  used  in 
AC  motor  drives  and  UPSs  where  the  objective  is  to 
produce  a  sinusoidal  AC  output  with  controllable 
magnitude  and  frequency  [6].  Although  several  possible 
inversion  schemes  exist,  today’s  most  applications  are 
dominated  by  PWM  inverters  due  to  their  abilities  of 
providing  both  voltage  and  frequency  control  and 
generating  output  waveform  with  low  harmonic  distortion 
[7-11]. 

In  the  PWM  inverter  of  single  phase  systems,  the 
most  obvious  and  popular  configurations  are  the  half¬ 
bridge  and  the  full-bridge  inverters  shown  in  Fig.  2  [12]. 
With  the  same  DC  input  voltage,  the  maximum  output 
voltage  of  the  full-bridge  inverter  is  twice  that  of  the  half¬ 
bridge  inverter.  This  implies  that  for  the  same  power,  the 
output  current  and  the  switch  currents  are  one-half  of 
those  for  a  half-bridge  inverter.  At  high  power  levels,  this 
is  a  distinct  advantage,  since  it  requires  less  paralleling  of 
devices. 

The  full-bridge  inverter  is  shown  in  Fig.  2(b).  If  Sa 
and  Sd  are  turned  on  and  off  together  by  a  single  pulse 
width  modulator  and  a  single  modulation  waveform,  we 
can  observe  that  the  output  voltage  V0  switches  between 
-Vd  and  Vd  voltage  levels. 

2.2  LC  filter 

PWM  uses  an  analog  signal  to  modulate  the  width  of 
pulses  whose  durations  are  proportional  to  the  amplitude 
of  the  modulating  signal  at  that  instant.  Since  most 
electronic  loads  are  nonlinear  and  inject  harmonic 
currents  into  the  UPS,  additional  harmonic  filtering 
techniques  must  be  applied  and  the  inverter  must  allow 
for  almost  instantaneous  control  over  its  output  AC 
waveform  to  reduce  the  harmonic  distortion  to  acceptable 
levels.  The  application  of  a  second  order  LC  type  low 
pass  filter  shown  in  Fig.  3  is  common  practice  in  the 
attenuation  of  the  harmonics  [13,14].  Design  of  the  output 
LC  filter  is  dependent  on  the  inverter  switching  frequency 
[15].  Individual  component  size  selection  depends 
upon  the  type  of  load  being  applied  and  the  DC  bus 
voltage.  When  supplying  a  nonlinear  load,  the  filter 
output  impedance  should  be  minimized  by  maximizing 
the  capacitor  size.  Large  inductor  with  low  DC  bus 
voltage  will  make  the  output  voltage  feedback  sluggish. 
Too  small  an  inductor  will  result  in  large  ripple  in 
inductor  current,  resulting  in  increased  losses.  Optimum 
component  values  may  be  determined  by  using 
simulation. 


2.3  Feedback  control  of  PWM  UPS  inverter 

A  typical  specification  for  the  output  voltage  of 
a  UPS  is  5  percent  THD.  To  achieve  this  level,  a 
method  can  be  used  is  to  utilize  feedback  control  of  the 
output  voltage  waveform  to  cancel  the  harmonic 
distortion.  Theoretically,  the  output  impedance  of  a 
PWM  UPS  inverter  can  be  controlled  by  the  gain 
and  bandwidth  of  the  output  voltage  in  perfect 
coincidence  with  a  sinusoidal  reference  voltage,  in  the 
fact  of  nonlinear  load,  then  the  inverter  can  be  said  to 
have  zero  output  impedance  at  the  frequency  components 
of  the  load  current.  Some  characteristics  of  using 
feedback  control  are  listed  below: 

1 .  There  appear  to  be  inherent  limits  to  the  reduction  of 
output  impedance  and  bandwidth  when  using 
discrete-time  feedback  control.  Two  controllers 
have  been  used  effectively:  One  is  to  compensate  for 
harmonic  disturbances;  the  other  is  for  fast  response 
with  sudden  load  changes. 

2.  The  highest  possible  PWM  switching  frequency 
should  be  used,  so  that  the  break  frequency  of  the 
LC  filter  occurs  for  outside  the  range  of  the  load 
current  harmonics.  This  allows  the  highest  possible 
control  bandwidth. 

3.  The  inverter  must  be  designed  with  additional 
voltage  capability  to  compensate  for  the  voltage 
drop  across  the  filter  inductance. 

Because  zero  output  impedance  is  achieved  by 
compensation  of  forward  voltage  drop  on  the  output 
inductor,  it  will  be  possible  to  compensate  the  voltage 
drop  by  measuring  the  inductor  current,  and  thus  to 
eliminate  output  voltage  variation.  Inverter  with  desired 
small  output  voltage  variation,  must  have  an  additional 
circuit  for  compensation  of  the  inductor  voltage  drop. 
Equivalent  circuit  of  the  inverter  output  stage  is  shown  in 
Fig.  3  where  the  filter  real  inductor  L  is  represented  as  an 
equivalent  ideal  inductor  L  in  series  with  its  ohmic 
resistance  R*.  From  Fig.  4,  the  inductor  voltage  drop  VL  is 
given  as 

VL=IL(Re+L  )•  (1) 

at 

By  measuring  the  inductor  current  IL,  a 
corresponding  compensation  voltage  Vc  is  obtained, 
proportional  to  the  inductor  voltage  drop  VL.  By  adding 
this  compensating  voltage  Vc  to  the  input  signal  Vinv,  the 
influence  of  the  inductor  voltage  drop  to  the  inverter 
output  voltage  will  be  eliminated.  If  Vc  equals  to  VL,  VQ 
will  equal  to  Vinv.  Hence,  the  output  voltage  V0  depends 
on  the  fundamental  of  the  PWM  modulated  signal,  that  is 
proportional  to  the  sinusoidal  reference  signal  Vref  and 


compensation  signal  Vc.  Theoretically,  such  an  inverter 
will  have  zero  output  impedance. 

3.  The  fuzzy  controller  design  for  UPS  system 

The  most  important  three  inputs  of  the  controller  are 
Verr(k),  cVerr(k),  Vc(k)  and  one  output  AT{k  4*1)  is  used 
for  the  PWM  pulse  width  control  system.  In  view  of  this 
figure,  Verr(k)  denotes  the  error  between  the  reference 
sinusoidal  waveform  Vrej(k),  and  the  output  voltage 
Vinv(k)  at  time  instance  AT,  and  Vc(k)  denotes  the 
inductor  voltage  drop,  where  A:  is  a  positive  integer  and  T 
is  the  sampling  time;  cVerr(k)  denotes  the  change  of  the 
error;  A T(k)  denotes  the  output  change  of  fuzzy 
controller,  they  are  defined  as  follows: 


Verr(k)  =Vre/k)  - Vinv(k)  +Vc(k) 

(2) 

c  Verr(k)  —VerrflO  —  Verr(k  —1) 

(3) 

A  m  =  F[V^{k),cVtrr{k)} 

(4) 

where  Ff4]  denotes  the  fuzzy  controller. 

The  control  rules  for  the  UPS  system  are  shown  in 
Table  1.  In  these  control  rules,  seven  linguistic  sets  are 
applied  for  the  fuzzy  variables.  The  seven  linguistic  sets 
are  defined  as 

{  NB,  NM,  NS,  ZE,  PS,  PM,  PB  }  , 

where  N  is  negative,  P  is  positive,  B  is  big,  M  is 
medium,  S  is  small,  and  ZE  is  zero. 

Having  defined  the  linguistic  control  rules,  the 
membership  functions  corresponding  to  each  element  in 
the  linguistic  set  must  be  defined.  For  simplicity,  the 
trapezoidal  functions  shown  in  Fig.  5  are  proposed,  where 
the  universal  sets  Verr(v)  and  cVerr(v)  are  all  defined  at 
the  interval  (-3.6,-3.0,...,3.0,3.6)  and  (-1.8, -.15, ...,1.5, 1.8). 
After  the  above  operation  is  done,  then  we  can  get  a  fuzzy 
controller  in  Fig.  6. 

As  shown  above,  the  output  of  the  plant  will 
feedback  to  the  first  stage  with  the  reference  signal  to 
generate  the  error  signal.  Then,  the  fuzzy  mechanism  will 
generate  a  suitable  AT  to  control  the  PWM  pulse  width. 

4.  Experimental  results 

The  whole  system  has  been  established  and  tested 
with  a  laboratory  prototype.  In  all  cases  the  inverter 
system  is  supplying  a  single-phase  rectifier-type  nonlinear 
load.  The  PWM  switching  frequency  is  6  kHz. 


Fig.  7  and  Fig.  8  show  the  output  voltage  and  load 
current  waveforms  with  the  deadbeat  controller  and  fuzzy 
controller  respectively  for  Vd=200V,  where  the  DC 
voltage  source  is  adopted  the  switching  power  supply. 
The  total  harmonic  distortions  of  the  output  voltage  are 
equal  to  2.5%  and  1.2%,  respectively.  From  the  result, 
the  voltage  drop  has  significant  improvement  with  the 
fuzzy  controller.  When  the  DC  voltage  is  turned  more 
larger,  the  load  current  will  be  overload.  Thus,  in  the 
following  experiment,  the  DC  voltage  source  is  replaced 
with  an  autotransformer  and  a  rectifier. 

Fig.  9  and  Fig.  10  show  the  output  voltage  and  load 
current  waveforms  with  the  deadbeat  controller  and  fuzzy 
controller  respectively  for  Vd=300V.  The  total  harmonic 
distortions  of  the  output  voltage  are  equal  to  8.8%  and 
4.5%,  respectively.  It  is  apparent  that  the  fuzzy  controller 
works  better  than  the  deadbeat  controller. 

5.  Conclusions 

This  paper  describes  an  algorithm  which  applies  a 
fuzzy  controller  to  digital  control  of  the  single  phase 
PWM  inverter  for  UPS  application.  This  controller  is  used 
to  compensate  the  nonlinear  load.  The  output  is  fed  into 
the  fuzzy  logic  controller  to  get  a  proper  PWM  switching 
pattern. 

The  features  of  the  proposed  DC-AC  converter  are 
shown  below. 

1.  The  experimental  results  demonstrate  the  total 
harmonic  distortion  of  the  output  voltage  can  be 
reduced  to  less  than  5%  by  feedback  control  to 
achieve  zero  output  impedance  of  the  PWM 
inverter. 

2.  No  matter  what  the  input  DC  voltage  level  is, 
the  fuzzy  controller  has  the  better  performance 
than  the  deadbeat  controller. 

From  the  experimental  results,  the  proposed  fuzzy 
controller  has  better  transient  performance  and  low  total 
harmonic  distortion  for  nonlinear  loads  than  deadbeat 
controller. 
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<a)  Half-Bridge. 


(b)  Full-Bridge. 

Figure  2  Configuration  of  single-phase  inverter. 


Figure.  3  Single  phase  full-bridge  inverter  with  LC  filter. 


Figure  4  Real  inverter  output  circuit. 


Figure  1  Block  diagram  of  on-line  UPS  system. 
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Figure  5  The  membership  function 
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Figure  6  The  fuzzy  controller 


Figure.  7  Inverter  output  voltage  and  load  current 
waveforms  with  deadbeat  control. 


Figure  9  Inverter  output  voltage  and  load  current 
waveforms  with  deadbeat  control. 


Figure  10  Inverter  output  voltage  and  load  current 
waveforms  with  fuzzy  controller. 


Table  1.  The  control  rules 
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Figure  8  Inverter  output  voltage  and  load  current 
waveforms  with  fuzzy  controller. 
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Abstract 

Hydraulically  driven  crane  is  a  multibody  system,  which 
can  be  studied  by  the  means  of  simple  kinematics 
transformation  matrix  method  [1]  and  applied  for 
hydraulic  driven  crane .  However  if  the  elasticity  of 
mechanical  structure  need  to  be  taken  into  account  the 
number  of  degrees  of  freedom  increases  significantly  and 
special  methods  are  required.  ADAMS-software  is  a 
suitable  environment  for  simulating  and  analysing  such 
systems.  Elastic  structures  can  be  modelled  in  FEM- 
software  and  then  imported  to  ADAMS  where  the 
complete  system  can  be  assembled  and  simulated.  This 
approach  leads  to  precise  result  but  it  is  a  numerically 
heavy  method  and  requires  advanced  knowledge  on 
mechanics.  The  purpose  of  this  study  is  to  evaluate  the 
importance  of  mechanical  flexibility  in  the  case  of  a 
typical  forest  crane.  The  evaluation  is  made  by  comparing 
the  eigenfrequency  charts  of  cranes  where  flexibility  is 
and  is  not  taken  into  account. 


1.  Introduction 

Typical  hydraulically  driven  log  cranes  are  constructed 
with  light  and  long  boom  structures  that  can  present 
important  deflections  during  its  operation  time.  Until  the 
latest  years,  dynamic  simulations  of  cranes  where  carried 
using  rigid  models  of  the  mechanical  structures  of  the 
crane.  Therefore,  elastic  deformations  and  damped 
vibrations  where  omitted,  making  the  analysis  less 
accurate.  The  dynamic  simulation  of  flexible-bodies 
mechanisms  involves  solving  a  large  second  order  of 
coupled  differential  equation  system  for  each  one  of  the 
discretizations  of  the  flexible  bodies,  as  it  is  showed  in 
section  (2.3).  Therefore  coding  all  the  equations  of  motion 
for  each  model  is  not  encouraged. 

In  this  paper,  an  FEM-software  (ANSYS)  and  dynamic 
system  simulation  software  (ADAMS)  is  used  to  carry  out 
the  simulation  of  a  hydro-mechanical  system  with 
mechanical  flexibility  properties.  Firstly,  the  flexible 
components  of  the  mechanism  are  modelled  in  ANSYS. 


Then,  by  performing  on  them  a  Modal  Analysis,  the 
flexible  properties  of  each  body  can  be  extracted  and 
saved  as  a  binary  file  (Modal  Neutral  file)  directly 
importable  to  ADAMS  software.  The  figure  1  illustrates 
the  procedure: 


Figure  1 .  Layout  of  the  flexibility  transfer,  model  assembly  and 
analysis 

On  the  other  hand,  a  rigid  model  of  the  same  log  crane 
is  built  in  order  to  compare  the  simulation  results  between 
the  two  models  of  the  crane.  In  the  rigid  model,  the 
flexible  properties  of  the  booms  spread  along  the  nodes 
are  replaced  by  the  mass  and  inertia  concentrated  at  each 
centre  mass  of  the  bodies.  The  rest  parts  of  the  model  and 
the  fluid  power  remain  modelled  in  the  same  way  as  the 
flexible  model. 

2.  Method 

From  ANSYS-software  package,  every  single  flexible 
body  is  modelled  and  analysed  separately  from  the  others 
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by  using  the  ANSYS  Modal  Analysis  algorithm.  The 
ANSYS  Modal  Analysis  is  able  to  write  a  file  that 
contains  all  the  geometrical  and  mechanical  flexible 
properties  of  the  body  analysed.  Therefore,  this  file  can  be 
imported  to  ADAMS,  where  the  elastic  body  will  be 
created  automatically.  The  MNF  is  directly  importable  to 
ADAMS-software,  and  the  flexible  bodies  are 
automatically  created. 

The  next  steps  consist  on  creating  (in  ADAMS- 
software  environment)  the  cylinders  and  other  rigid 
bodies,  assembling  the  model,  and  implementing  the  fluid 
power  model  governing  the  actuators. 

Afterwards,  ADAMS/Solve  and  ADAMS/Linear 
modules  are  used  simultaneously  in  order  to  perform 
dynamic  analysis  and  to  perform  eigenanalysis.  Dynamic 
simulation  drives  the  model  to  an  operating  point  and  then 
the  eigenanalysis  extract  the  natural  frequencies  and  their 
mode  shapes.  The  operation  is  repeated  until  reaching  all 
the  points  desired. 

2.1.  Modal  Analysis 

Modal  Analysis  is  the  process  of  determining  the 
inherent  dynamic  characteristics  of  a  system  and  using 
them  to  formulate  a  mathematical  model  of  the  dynamic 
behaviour  of  the  system.  Modal  analysis  characterise  the 
deformation  of  a  body  due  to  external  vibration  input. 
When  the  body  is  excited  by  external  vibration  energy,  it 
is  deformed  in  a  number  of  well-defined  wave- like 
patterns  or  modes.  Each  mode  has  its  own  specific  natural 
frequency  (eigenvalue)  and  shape  (eigenvector).  The 
analytical  formulation  to  extract  the  modes  of  a  flexible 
body  is  the  following: 

The  equation  of  motion  for  an  undamped  body 
expressed  in  matrix  notation  is 

[mIu}+[k]{u}=0.  (1) 

Where  [M]  and  [K]  are  the  mass  and  stiffness  matrices 
respectively,  {u}  =  {ur..uN}  is  the  nodal  coordinate  matrix, 
N  is  the  number  of  nodes  defining  the  FEM  modelled 
body,  and  {ii}  is  the  nodal  acceleration  matrix. 

For  a  linear  system,  free  vibrations  are  harmonic  of  the 
form  that  nodal  coordinates  can  be  expressed  as 

{u}=E{(Pi}cos(ft)it).  (2) 

Where  {<p},  is  the  eigenvector  associated  to  the  i,h 
natural  frequency  and  C0j  is  the  ith  natural  circular 

frequency.  Using  the  equation  2  in  the  equation  of  motion 
1,  the  following  equation  is  obtained: 


(-  co,2  [m]+  [k])-  {cp, }  =  0 .  (3) 

The  equation  (3)  is  satisfied  if  the  determinant  of 
M  -co2[m])  is  zero.  This  is  an  eigenvalue  problem,  which 

may  be  solved  for  up  to  n  values  of,  cd2  and  n 
eigenvectors  {cpj,  where  n  is  the  number  of  degrees  of 
freedom  (DOF).  The  eigenvalue  problem  has  the  form 

(4) 

ANSYS  Modal  Analysis  computes  the  mass  and 
stiffness  matrix  of  the  body  using  lumped  mass 
approximation  and  solves  the  numeric  eigenvalue 
equation  (4),  extracting  the  natural  frequencies  and  the 

mode  shapes  cp(  for  each  mode  i. 

2.2.  Exporting  the  flexible  information  to 
ADAMS 

ANSYS  and  ADAMS  have  been  working  together  in 
order  to  build  the  capability  of  exporting  the  geometrical 
and  mechanical  flexible  properties  of  a  body  modelled  in 
ANSYS  to  ADAMS.  This  is  possible  due  to  a  binary  file 
named  Modal  Neutral  File  (MNF)  written  during  an 
ANSYS  Modal  Analysis  of  a  single  body.  The  MNF  is 
directly  importable  to  ADAMS-software  and  contains  the 
following  information  for  each  body: 

-  Nodal  coordinates,  nodal  masses  and  nodal  inertia 

-  Element  topology 

-  Generalised  stiffness  and  mass  matrix 

-  Nodal  displacements  (mode  shapes)  and  their  natural 
frequencies 

Description  of  the  compatibility  between  the  binary 
MNF  and  ADAMS  software  in  different  hardware 
platforms  can  be  found  in  [2]. 

The  three  arms  of  the  hydraulically  driven  log  crane 
presented  in  this  study  have  been  modelled  and  analysed 
separately  in  ANSYS.  Therefore,  three  MNF  have  been 
extracted  from  the  analyses  and  they  have  been  imported 
to  ADAMS  using  the  interface  module  ADAMS/Flex, 
which  uses  an  assumed  modes  method  of  modelling 
flexible  bodies. 

2.3.  Flexible  Bodies  in  ADAMS.  Lagrange 
formulation 

ADAMS  uses  an  assumed  modes  method  of  modelling 
flexible  bodies  called  Modal  Flexibility.  Modal  Flexibility 
assigns  a  set  of  mode  shapes  to  a  flexible  body.  During  a 
time  analysis  the  relative  amplitude  of  each  eigenvector  is 
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calculated  as  a  state  variable.  The  principle  of  linear 
superposition  is  then  used  to  combine  the  mode  shapes  at 
each  step  time  to  reproduce  total  deformation  of  the 
flexible  body.  This  is  executed  by  means  of  Lagrange 
formulation. 

The  nodes  of  the  finite  element  model  imported  to 
ADAMS  can  be  assumed  to  undergo  in  linear 
deformations  and  rotations  as  a  combination  of  mode 
shapes.  The  position  of  the  j*  node  is 

rj=x+A(sj+Ojq).  (5) 

Where  x  is  the  coordinates  vector  from  the  global 
origin  to  the  local  body  reference  (l.b.r.),  A  is  the 
orientation  between  the  l.b.r.  and  the  global  origin  in 
Euler  angles,  Sj  is  the  undeformed  location  of  the  jth  node 
in  the  l.b.r.,  is  the  contribution  of  the  mode  shape  to 

the  j*  node,  and  q  is  the  vector  of  modal  amplitudes.  Now 
the  vector  of  generalised  coordinates  for  the  flexible  body 
can  be  represented  as 

\  =  {x  v  q}  •  (6) 

According  to  Lagrange  formulation,  the  motion  of  a 
body  defined  by  the  generalised  coordinates  in  equation  6 
can  be  described  as 


d  | 

8L  SF 

~0O_ 

dt 

UJ 

+  di;  + 

_d$\ 

Where  L  is  the  Lagrangian,  O  is  the  matrix  of 
constraint  equations,  F  is  the  matrix  of  damping  forces, 
X  is  the  matrix  of  the  reaction  forces  vectors  in  the 
constraints,  and  Q  is  the  matrix  of  external  applied  forces. 

After  relating  the  derivative  of  the  Euler  angles  \j /  with 
the  angular  velocity  of  the  l.b.r.  relative  to  the  ground 
through  the  matrix  B,  the  velocity  of  a  node  j  on  the 
flexible  body  can  be  expressed  as 

Vj  =  [l  -A(sj+$jq)B  A®J.  (8) 

The  Lagrangian  L  can  be  expanded  as  L  =  T-V  with 

T=^Zmj{vj}T{vj}.  (9) 

2  j=l 

v  =  Vffe)+^TK§.  (10) 

Combining  the  equation  8  and  equation  9,  the  kinetic 
energy  of  the  Lagrangian  T  simplifies  into 

T  =  ^tm($.  (11) 

The  gravitational  potential  energy  Vg  in  equation  10 
can  be  expressed  as 


vg=Smj[xj+A(sj+®iqlrg- 

i=i 

Therefore,  the  gravitational  force  is 


(12) 


(13) 


The  matrix  of  damping  forces  F  depends  on  the  modal 
velocities.  According  to  Rayleigh  dissipation  function: 

F  =  |qTDq.  (14) 

Where  D  is  the  damping  matrix  of  the  body,  and  its 
coefficients  have  the  form 


dj  =  2tljA/kjmj  .  (15) 


Then,  the  final  expression  of  the  motion  of  the  body 
according  to  Lagrange  formulation  turns  from  equation 
(7)  to  the  following  in  terms  of  mass,  stiffness  and 
damping: 


0M  ■ 

04  * 


4+  K4+fg+D4+ 


X-Q  =  0. 


(16) 


2.3.1  The  extraction  of  modes 

ADAMS/Linear  allows  ADAMS  models  to  be 
represented  by  complex  valued  eigendata  (eiganvalues 
and  mode  shapes).  ADMS/Linear  carries  out  a 
condensation  of  the  flexible  structure  by  using  the  Modal 
Analysis  method  described  in  section  (2.1)  to  the  whole 
mechanism.  Therefore,  the  flexible  mechanism  is 
modelled  by  two  components:  The  Rigid  Body  component 
describing  the  motion  of  the  Body  reference,  and  a  set  of 
Differential  Equations  describing  the  structural 
deflections.  The  eigendata  solution  results  from  solving 
the  general  eigenvalue  problem  of  the  same  form  as 
equation  4.  A  developed  analytical  method  for  obtaining 
the  Linear  State  Equations  can  be  found  in  [4]. 

3.  The  model 

The  hydraulically  driven  log  crane  simulated  is  a 
typical  hydraulic  crane  used  in  harvesting  and  log 
handling  forestry  machines.  Three  booms  (swing  arm,  lift 
arm  and  stick  arm)  compose  the  crane.  Three  actuators 
drive  the  real  crane  by  means  of  electrically  directional 
valves,  conferring  the  mechanism  3  DOF.  However  the 
turning  actuator  has  not  been  modelled  as  the  contribution 
of  this  rotational  DOF  to  the  final  results  has  been 
observed  to  be  negligible. 
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3.1.  Model  of  the  boom  structure 


(17) 


The  three  booms  of  the  crane  structure  (swing  arm,  lift 
arm  and  stick  arm)  have  been  modelled  as  elastic 
structures  using  ADAMS  software.  4-node  shell  elements 
have  been  used  to  model  the  boom  structures  flexibility. 
The  swing  arm  is  modelled  with  434  nodes,  the  lift  arm 
with  961  nodes  and  the  stick  arm  with  599  nodes, 
conferring  to  the  whole  mechanism  almost  12  000  DOF. 

The  three  FEM-modelled  booms  are  analysed  and 
exported  separately  to  ADAMS,  where  the  rest  of  the  rigid 
parts  of  the  crane  (load,  cylinders  and  links)  are  modelled 
and  assembled  to  the  mechanism.  A  load  of  5  000  N  is 
applied  at  the  end  of  the  stick  arm. 

The  ANSYS  Modal  Analysis  has  been  set  up  to  extract 
30  modes  associated  to  the  first  30  natural  frequencies  of 
each  body.  ADAMS  by  default  enables  all  the  modes 
when  the  MNF  is  imported.  However,  in  order  to  increase 
the  efficiency  of  simulations,  only  the  modes  that 
contribute  to  the  motion  to  simulate  are  enabled.  An 
another  criterion  is  to  disable  the  modes  that  do  not 
significantly  contribute  to  the  total  strain  energy  of  the 
flexible  body.  This  task  can  be  carried  out  automatically 
by  ADAMS. 

3.2  Modelling  he  hydraulic  circuit 

The  hydraulic  cylinders  are  the  interface  between  the 
mechanism  and  the  fluid  power  system.  The  rest  of  the 
hydraulic  components  modelled  are  the  valves  and  the 
pump.  The  scheme  of  the  hydraulic  circuit  used  to 
transmit  power  to  the  log  crane  is  presented  in  figure  2. 


Figure  2.  Scheme  of  the  hydraulic  circuit  and  control  signal  for 
the  valves 

The  flexible  behaviour  of  the  fluid  power  is  described 
[4]  by  the  dynamic  pressure  fluctuation  in  the  circuit: 


P  =  ^(QC+Qv)- 

VC 

Where  Beff  is  the  effective  bulk  modulus,  Vc  is  the 
volume  in  the  active  chamber  of  the  cylinder  plus  the 
volume  compressed  between  the  actuator  and  the  valve. 
Qc  and  Qv  are  the  flow  to  the  actuator  chamber  and  the 
flow  through  the  valve  respectively,  and  computed  as 
follows 

QC=ACX.  (18) 


Qv  =|Rv|Kva/p„-P  •  (19) 


Where  Ac  is  the  section  area  of  the  cylinder,  and  X  is 
the  relative  speed  between  the  two  chambers.  On  the  other 
hand,  Rv  is  the  relative  control  signal  (see  figure  2),  PP  is 
the  supply  pressure  and  Kv  is  the  flow  coefficient  of  the 
control  valve  that  can  be  written  as  follows 


Kv 


Qvnom 

VapT 


(20) 


Where  Qvnom  and  Pv  are  the  flow  and  the  drop  pressure 
respectively  when  the  valve  is  fully  open.  The  resultant 
hydraulic  force  in  the  cylinder  can  be  set  from  the 
following  equation 


F  AplPpl  Am1Pmi  C*X .  (21) 

Where  the  term  C^X  is  the  friction  force  between 
seals  and  the  cylinder  wall.  In  this  model  the  viscous 
friction  Cvis  has  been  fixed  as  a  constant.  However  in  a 
more  accurate  approach  [5],  the  viscous  friction  would  be 
treated  as  a  variable  depending  on  the  sliding  velocity  and 
the  pressure  acting  in  the  cylinder. 


3.3  Design  of  the  simulation 

The  comparison  between  the  rigid  model  and  the 
flexible  model  of  the  log  crane  is  carried  out  by 
comparing  the  natural  frequencies  extracted  in  different 
operating  points.  Combinations  of  20  positions  in  both 
actuators  have  been  fixed  in  order  to  perform  the 
experiment. 

Therefore,  a  400-point  chart  of  natural  frequencies  is 
obtained  for  each  rigid  and  flexible  models  of  the  log 
crane.  Figure  3  and  figure  4  show  the  range  of  positions 
taken  by  the  crane. 
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4.  Results  and  conclusions 


Figure  3.  Minimum  and  maximum  stroke  among  the  20  positions 
of  the  lifting  cylinder 


Figure  4.  Minimum  and  maximum  stroke  among  the  20  positions 
of  the  piling  cylinder 

The  operating  points  have  been  reached  by  powering 
the  lifting  actuator  from  initial  position  (minimum  stroke) 
to  final  position  20  steps.  At  each  one  of  the  20  lifting 
cycles,  the  piling  actuator  takes  a  different  configuration 
according  to  the  figure  4  (from  horizontal  end  arm  to 
vertical  end  arm).  Both  actuators  have  been  powered 
according  to  the  control  ramps  showed  in  figure  2. 
Therefore,  a  linearization  of  the  dynamic  equations  of 
motion  is  carried  out  along  the  combination  of  20  equally 
spaced  positions  of  both  piling  and  lifting  actuators, 
obtaining  the  modes  of  the  system  at  400  operating 
positions  of  the  crane. 


Firstly  are  presented  the  modes  extracted  concerning 
the  two  DOF  of  the  crane  mechanism.  Both  rigid  and 
flexible  models  show  identical  mode  shapes  but,  on  the 
other  hand,  their  natural  frequencies  present  some 
differences. 

4.1  First  mode 

The  chart  in  figure  5  shows  the  natural  frequencies 
belonging  to  the  first  mode  of  the  rigid  model.  The  values 
of  natural  frequencies  are  strongly  dependent  on  the 
position  of  the  stick  arm,  however  the  position  of  the 
lifting  cylinder  is  not  very  sensitive  to  the  natural 
frequencies  of  the  system. 


cy12  (m] 


Figure  5.  Chart  of  the  natural  frequencies  of  the  first  mode  of  the 
rigid  model. 

A  similar  chart  (figure  6)  is  obtained  from  extracting 
the  natural  frequencies  in  the  flexible  body. 
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Figure  6.  Chart  of  the  natural  frequencies  of  the  first  mode  of  the 
flexible  model. 
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The  only  detectable  difference  between  rigid  and 
flexible  models  can  be  found  in  a  smooth  variation  of  the 
slope  of  the  surface  of  the  flexible  model  chart,  around  an 
intermediate  position  of  the  piling  actuator.  This  fact  can 
be  observed  in  the  next  chart  (figure  7)  where  the  relative 
error  between  both  natural  frequency  extractions  has  been 
plotted.  The  maximum  difference  occurs  when  the  stick 
cylinder  has  a  stroke  of  1330  mm.  (intermediate  position 
according  to  figure  4)  and  the  lifting  cylinder  is  in  the 
lowest  position  (according  to  figure  3).  In  that  position  the 
relative  error  between  the  natural  frequencies  of  both  rigid 
and  flexible  models  is  4.7  %. 


Figure  7.  Chart  of  the  relative  error  between  the  first  natural 
frequencies  of  the  rigid  and  the  flexible  model. 

4.2  Second  Mode 

The  natural  frequencies  obtained  from  the  extraction  of 
the  second  mode  in  the  400  positions  of  the  crane  are 
showed  in  the  following  charts.  In  figure  8  the  second 
natural  frequency  chart  of  the  rigid  model  is  showed. 


Figure  8.  Chart  of  the  natural  frequencies  of  the  second  mode  of 
the  rigid  model. 


Figure  9  shows  the  second  natural  frequency  chart  of 
the  flexible  model.  As  in  the  first  mode,  the  position  of  the 
lifting  cylinder  has  not  an  important  influence  on  the 
discrepancies  of  the  natural  frequencies  between  both 
models.  Comparing  the  value  of  the  natural  frequencies  in 
both  rigid  and  flexible  charts,  it  can  be  observed  that 
major  discrepancies  are  obtained  in  the  second  mode. 
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Figure  9.  Chart  of  the  natural  frequencies  of  the  second  mode  of 
the  flexible  model. 

The  phenomena  can  be  better  observed  in  figure  10, 
where  the  relative  error  between  the  natural  frequencies  of 
both  models  is  plotted.  For  extended  positions  of  the  stick 
arm  (minimum  stroke  of  the  piling  actuator)  the  natural 
frequencies  in  the  flexible  body  are  up  to  14.8%  higher 
than  in  the  rigid  model.  The  divergence  decreases  when 
the  position  of  the  piling  actuator  makes  the  stick  arm  to 
reduce  the  total  length  of  the  crane. 

The  relative  error  chart  obtained  in  the  extraction  of 
the  second  mode  (figure  10)  seems  to  be  more  logical  than 
the  one  obtained  in  the  first  mode  (figure  7).  The  torque 
arm  of  the  crane  reach  the  maximum  value  for  extended 
positions  of  the  stick  arm  (when  the  stick  arm  is  parallel  to 
the  lift  arm).  In  those  positions  the  stress  in  the  boom 
structure  is  also  maximum  and  deformations  of  the 
flexible  modelled  booms  are  induced.  These  deformations 
have  a  non-negligible  rule  on  the  equation  of  motion  of 
the  system,  and  consequently  they  are  contributing  to  the 
values  of  the  natural  frequencies. 
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Figure  10.  Chart  of  the  relative  error  between  the  second  natural 
frequencies  of  the  rigid  and  the  flexible  model. 

The  two  first  modes  presented  before  were  associated 
to  vibrations  of  the  system  along  the  equilibrium  position 
of  the  two  actuators.  However,  between  the  two  natural 
frequencies  associated  to  the  two  DOF  of  the  mechanism, 
the  flexible  model  presents  an  intermediate  mode.  This 
mode  is  associated  to  vibrations  of  the  log  crane  due  to 
the  deformations  of  the  mechanical  boom  structure.  Figure 
1 1  shows  the  natural  frequency  chart  of  the  mode  shape. 
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Figure  11.  Intermediate  mode  of  the  flexible  model  due  to 
mechanical  deformation  of  the  booms. 

The  mode  shape  associated  to  this  intermediate  mode 
is  characterised  by  the  deformations  of  the  boom  structure 
due  to  the  bending  of  the  lift  and  stick  arm.  Figure  12 
shows  the  deformation  taken  by  the  structure  when  it  is 
excited  by  an  arbitrary  natural  frequency  belonging  to  the 
intermediate  mode  shape  (figure  11). 


Figure  12.  Mode  shape  of  an  intermediate  mode  of  the  flexible 
model. 

4.3  Conclusions 

The  natural  frequencies  of  a  mechanical  system  use  to 
be  a  design  variable  during  the  machine  design  process. 
From  the  analyses  of  the  results  obtained  in  this  paper,  it 
can  be  concluded  that  the  flexibility  component  plays  an 
important  role  in  the  determination  of  this  dynamic 
property.  The  results  show  that  hydraulic  flexibility 
governs  the  problem  and  mechanical  flexibility  deviates 
the  natural  frequencies  up  to  a  15  %.  Moreover,  many 
other  modes  can  be  identified  in  the  mechanical  flexible 
model.  For  instance  a  dozen  of  mode  shapes  (with  natural 
frequencies  contained  between  5  Hz  and  200  Hz)  due  to 
the  bending  (of  lifting  and  sticking  booms)  and/or  torsion 
(of  the  lifting  boom)  has  been  found.  Other  local  modes  of 
higher  frequency  can  be  also  detected  although  they  do 
not  contribute  to  the  general  motion  of  the  crane. 

The  modelling  of  the  flexibility  properties  should  be 
taken  into  account  when  analysing  such  hydraulically 
driven  mechanisms.  Therefore,  virtual  testing  technique 
can  be  very  suitable  as  an  optimisation  tool  for  drive 
technology  systems  of  complex  kinematics  and  dynamic 
equations.  The  use  of  both  FEM  and  ADAMS  software  to 
model  the  prototype  is  the  most  suitable  choice  to  achieve 
accurate  results. 
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Abstract 

paper  reveals  theoretical  and  experimental  results , 
concerning  the  dynamic  experimental  identification  of  a 
proportional  valve,  destined  to  replace  the  moving  coil 
valves  (an  old  generation),  belonging  to  the  speed 
governors  of  the  hydraulic  turbines.  It  was  made  the 
experimental  validation  of  the  mathematical  model, 
written  for  the  servomechanism  equipped  with  the 
previously  mentioned  proportional  valve.  It  was  proposed 
a  synthesis  method  for  the  PI  controller,  with  additional 
state  feedback ,  corresponding  to  a  classical 
servomechanism  (containing  a  moving  coil  valve). 

7 he  experiments,  the  numerical  simulations  and  the 
proposed  algorithm  of  synthesis  revealed  important 
practical  conclusions,  concerning  the  dynamic  behaviour 
of  the  proportional  valve  by  himself  and  within  the 
sen’omechanism  and  the  advantages  of  the  proposed 
synthesis  algorithm. 


L  Introduction 

The  presence  of  electronic  equipment  in  the 
electrohydraulic  servomechanisms  confirms  the  modern 
concept  of  mechatronics  [3]  Flexibility  of  these 
servomechanisms  depends  on  the  hardware  and  software 
devices,  highly  integrated  in  the  concept  of  total  automatic 
processes  (hierarchical  control  structures). 

Modem  theoretical  and  experimental  research  are 
focused  on  structural  and  design  improvement  for  elements 
and  systems,  in  association  with  a  coherent  methodology 
for  analysis  and  synthesis  of  the  system  controllers. 

A  very  important  device  in  every  servomechanism  is  the 
electrohydraulic  interface  (proportional  valves  and  servo 
valves).  This  ones  were  spectacular  developed  recently. 


great  companies  (Bosch,  Rexroth,  Parker)  offering 
proportional  valves  with  high  dynamic  performances,  by 
using  local  position  controlling  loops.  The  position 
feedback  loop  from  the  spool  valve,  completed  with 
additional  feedback  state  control  increases  the  stability  and 
improves  the  transient  response  [7]. 

The  servomechanisms  destined  to  speed  governors  for 
hydraulic  turbines  include  two  fundamental  non  linear 
aspects  (different  acting  areas  of  the  guide  vanes  hydraulic 
cylinder  and  the  ‘  dead  band”  of  the  characteristic  curve  for 
the  proportional  valve).  The  asymmetry  of  behaviour  can 
be  eliminated  by  using  two  different  controllers, 
corresponding  to  each  moving  direction.  A  self-learning 
controlling  algorithm  is  able  to  compensate  different 
influences  on  the  width  of  the  “dead  band”,  increasing  the 
steady-state  and  dynamic  performances  [5]. 

This  paper  reveals  some  theoretical  and  experimental 
results,  obtained  by  the  authors  in  the  following  areas  of 
interest  [4]: 

-  the  dynamic  experimental  identification  of  a 
proportional  valve,  destined  to  replace  the  moving  coil 
valves  (an  old  generation),  belonging  to  the  speed 
governors  of  the  hydraulic  turbines;  it  was  determined  the 
equivalent  transfer  function,  using  the  experimental  data, 
previously  mentioned; 

-  the  experimental  and  numerical  simulation  validation 
of  the  mathematical  model,  written  for  the  servomechanism 
equipped  with  the  previously  mentioned  proportional  valve; 

-  the  conception  of  a  coherent  synthesis  method  of  the 
PI  controller,  with  additional  state  feedback,  corresponding 
to  a  classical  servomechanism  (containing  a  moving  coil 
valve). 

All  numerical  simulations  implied  graphical  networks, 
specific  to  the  software  package  MATLAB-  S1MULINK 
and  TOOLBOX  CONTROL  MATLAB  [9] 
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2.  Experimental  dynamic  identification 
of  the  proportional  valve 

The  proportional  valves  steady-state  characteristics  are 
non  linear,  with  a  greater  hysteresis  than  the  moving  coil 
valves  [4,5].  The  low  sensibility  for  impurities,  the  modern 
fabrication  technology  and  the  cheaper  price  impose  this 
valve  category.  In  order  to  diminish  the  insensitivity  and 
the  hysteresis,  over  the  command  signal  it  is  superposed  a 
“dither”  signal,  or  it  can  be  controlled  the  position  of  the 
spool  valve  within  a  local  servomechanism. 

The  reference  model  of  the  3/3  proportional  valve  (fig. 
1)  includes  a  solenoid  and  a  critical  centre  flow  valve  with 
circular  hydraulic  inlets  on  the  sleeve  [4] 
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Fig.  1  The  symbolic  functioning  scheme 
for  the  proportional  valve 


Response 


Fig.  2  Step  response  and  the  graphical 
identification  scheme 


In  accordance  with  the  experimental  step  response 
(fig.  2)  we  propose  an  approximate  third  order  transfer 
function: 


h(s)= 


K 


Qi 


(l) 


An  automatic  methodology  for  dynamic  identification  is 
implemented  in  the  software  package  MATLAB- 
TOOLBOX  NCD  (NCDBLOCK  [9]).  This  way  we 
determined  precisely  the  values  of  the  coefficients, 
introduced  by  equation  (1):  K.^95,7533  1/min/A; 

con3=  109,3  rad/s;  £=0,6928;  T3=16,6  ms. 


3.  Experimental  identification  of  a 
servomechanism  with  proportional  valve 

The  experimental  stand  scheme  of  the  servomechanism 
with  proportional  valve  is  presented  in  Figure  3. 

The  transfer  function  of  the  PD  electronic  controller  can 
be  written  as  follows: 


1  + 


(k1,  +  I0)xl0  's 
Trs  +  1 


(2) 


K|>  1  + 


Tds 


Tfs  +  1. 


where:  K  P  =  0...J00  is  a  gain  coefficient;  Kn  =  0...100  is 
an  accordance  parameter  for  the  derivative  component;  Tf 
—  1 0"3  s  -  “filtering”  time  parameter,  active  at  low 
frequencies  and  at  starting  of  a  transient  regime;  KP  -  gain 
coefficient  for  the  proportional  component;  Tj>  -  time 
derivative  coefficient. 


Fig.  3  Experimental  testing  stand 
of  the  servomechanism 

The  mathematical  model  of  the  studied  servomechanism 
(fig.  3)  contains  the  following  equations  [4]: 

-  equations  corresponding  to  the  proportional  valve; 

-  the  continuity  equation  applied  to  the  flow  domain 
placed  between  the  “A”  port  of  AEH  and  the  high  surface 
chamber  of  CHDE; 

-  the  dynamic  equation  applied  on  the  moving  part  of 
CHDE. 

With  the  upper  mentioned  equations  we  assemble  a 
graphical  simulation  network  and  determine  the  input  step 
response  of  the  servomechanism.  The  numerical  values  of 
the  coefficients  are:  Aj  =  1963,5  mm  2;  Rj,c  =  20  MN/m; 
mcn-  1 2  Kg;  A2  =  1159  mm 2;  ps  =  30  bar;  FjC'n  -  40  N. 
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A  comparison  between  the  simulated  and  experimental 
step  responses  is  presented  in  Figure  4.  The  experimental 
identified  values  of  the  accordance  parameters  tor  the  PD 
electronic  controller  are:  Kp  =  8,9  and  T»  =  46  ms.  A  very 
good  resemblance  exists  between  the  two  curves,  excepting 
some  differences  in  the  beginning  of  the  transient  regime; 
this  differences  are  connected  to  the  influence  of  length  of 
the  supply  line  (some  pressure  variations)  [4]. 


step  responses 

4.  Synthesis  algorithm  for  the  controller 
of  an  electrohydraulic  servomechanism 

The  classical  electrohydraulic  servomechanism  destined 
to  the  speed  governors  for  hydraulic  turbines  contains  a  3/3 
moving  coil  valve  CEHBM,  a  4/3  power  mechanohydraulic 
proportional  valve  AMH,  a  position  transducer  TPZ  and  an 
electronic  controller  REZ  (fig.  5). 

The  mathematical  model  corresponding  to  the  dynamics 
of  the  servomechanism  are  [4]: 

-  the  transfer  function  for  the  moving  coil  valve,  with 
state  feedback  and  integrator  bloc  on  the  command 
channel; 

-  flow  equation  for  the  open  centre  moving  coil  valve; 

-  the  continuity  equation  written  on  the  flow  domain  (£) 

(%  5); 

-  the  dynamic  equation  on  the  spool  of  the 
mechanohydraulic  proportional  valve  AMH. 

Starting  from  these  equations  we  propose  a  state 
controller;  the  transfer  function  for  the  fixed  part  of  the 
system  can  be  written  as  follows: 

H(s)  =  HCEM(s)H,(s) 

= _ goKoK-'a _  (3) 

(b0s  +  K0K')(p3s3  +  P2s2  +  P,s  +  p0) 

On  the  command  channel  we  choose  a  PI  controller 
configuration  (fig.  6),  in  order  to  ensure  the  steady-state 
precision. 
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Fig.  5  The  structure  of  the  classical 
electrohydraulic  servomechanism 


Fig.  6  The  state  controlled 
servomechanism  equivalent  scheme 


The  transfer  function  of  the  state  controlled 
servomechanism  can  be  written  as  follows: 


H«(s)  = 


goKpK'a 

z(s)  1*0^3 


KPzs-f 


1 


(4) 


where: 


up(s)  Numitor2(s) 

|  KpK’P;  +g0K0K'«K„  +  bq|3,  3  ( 

K0K,p1  +  g0K0K'aKvz  j-  b0p0  ^ 
fiop3 

,  K0K-p0+gQK0K-aKp/G|  g0K0K'q 

b0P3  W3 

With  the  transfer  function  (4)  it  is  realised  a  good 
steady-state  precision  (zero  error  for  input  step  response). 
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The  problem  to  solve  is  the  placement  of  the  proper 
values  corresponding  to  the  following  form  of  the  transfer 
function  H06  (s): 


-“(s  +  z2) 
Z2 

(s  +  p52) 


(5) 


x  _ P12P22PJ2P42 

(s+p12)(s+p22)(s  +  p32Xs  +  p42) 

With  some  preliminary  numerical  simulations  we 
conclude  the  existence  of  two  dominant  poles, 
corresponding  to  a  second  order  system,  completed  with 
the  presence  of  two  complex  related  poles,  characterised  by 
a  very  low  damping  factor.  The  additional  pole  and  zero, 
p52  and  z2  were  chosen  very  close  to  the  origin  (a 
“dipole”). 

We  choose  the  poles  p32  and  p42  like  having  the  modulus 
of  the  imaginary  part  very  high  (in  accordance  with  the 
preliminary  numerical  simulations),  the  real  part  being 
calculated  with  the  formula: 


Re(p32 )  =  Re(p42 )  =  ElL±Eu  (6) 

From  the  identification  of  the  other  coefficients  from  the 
transfer  functions  (4)  and  (5)  we  obtain  the  equations  for 
the  accordance  parameters  of  the  controller: 

_  Pl2P22P32P42Ps2k<)p3 

z2g«KoK’a 

1  (8) 
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P|2P22(P32  "l"  P42) 
b0P3|  "*"P32P42(Pl2  +  P22) 

f  P12P22  P32P42  ^ 

\(pl2  +  P22XP32  +  P42). 
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PS2 
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goKoK'a 
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(9) 


P12P22  +  P32P42 
^oP?|  +(Pl2  +  P22XP32  +  P42) 
+(Pl2  +  P22  +  P32  +  P42)P5: 
K0K'p2  +  b„P, 


(10) 


By  accepting  the  dynamic  behaviour  of  the 
servomechanism  with  a  second  order  delay  element  and 
imposing  Q2  and  tr2  we  can  calculate  the  values  for  the 
according  parameters  with  the  equations  (8),...,(i  1)  [4], 


5,  Conclusions 

The  experiments,  the  numerical  simulations  and  the 
proposed  algorithm  of  synthesis  for  the  controller  of  the 
servomechanism  lead  to  some  practical  conclusions: 

-  the  dynamic  behaviour  of  the  proportional  valve 
corresponds  to  a  third  order  delay  element,  over  damped;  it 
is  possible  to  accept  a  second  order  or  even  a  first  order 
transfer  function,  depending  on  the  dynamics  of  the  entire 
system; 

-  the  according  controller  parameters  KP  and  TD  of  the 
servomechanism  with  proportional  valve  are  identically  for 
the  mathematical  model  and  in  the  experimental  tests. 
Choosing  a  right  balance  between  the  parameters  we  can 
satisfy  the  compromise  precision-stability,  depending  on 
the  imposed  performances; 

-  the  proposed  synthesis  algorithm  for  the  classical 
servomechanisms,  dedicated  to  the  speed  governors  for 
hydraulic  turbines  isn’t  unique,  concerning  the  structure  of 
the  controller  and  the  position  of  the  proper  values  on  the 
equivalent  transfer  function.  The  proposed  analytical 
equations  facilitate  a  rapid  first  dimensioning  of  the 
controller  and  the  implementation  of  a  self-learning  control 
structure. 
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Abstract 

This  paper  presents  the  realisation  of  force  transducers 
with  0  —  6300  kN  measure  range,  designed  to  work  with 
hydraulic  cylinders  in  automatic  positioning  systems  with 
real  force  control.  These  systems  are  used  in  materials  * 
stress  testing  installations  as  well  as  in  vibrations  testing 
installations  for  industrial  equipment.  The  constructive 
solution  chosen  by  the  authors  is  to  use  displacement 
sensors  with  variable  reluctance  in  order  to  measure  the 
central  arrow  of  an  elastic  steel  membrane.  The  paper 
contains  an  accessible  presentation  for  calculating  the 
sensible  items  and  it  presents  some  experimental-obtained 
conditions  required  to  obtain  high  linearity.  There  is  also 
presented  the  automated  hydraulic  testing  system  of  these 
transducers  considering  that  the  measuring  ranse  of  force 
islOkN ...  630  kN. 


1.  Introduction 

In  the  last  10-15  years  the  hydraulic  and  pneumatic 
equipment  became  wide-spread  in  the  industry  and  the 
trend  is  to  grow  even  more  and  to  replace  another  type  of 
systems  ( e.g.  electrical  and  mechanical  ones ).  This  trend 
is  due  to  their  great  advantages  -  sturdiness  and 
adaptability  to  a  wide  range  of  working  conditions,  high 
level  forces  and  torque  obtained  with  relatively  low  cost 


prices  and,  most  of  all,  the  intregrability  in  complex 
automation  systems.  Using  the  proportional  elements’ 
techniques  and  electronically  controlled  servo-  devices, 
the  hydraulic  systems  can  be  integrated  in  computer- 
controlled  automation  systems  with  high  precision, 
excellent  response  time  and  smaller  dimensions,  making 
them  unbeatable  in  industrial  control  and  robotics.  A 
typical  example  is  represented  by  the  hydraulic 
servocylinders.  The  compact  actuator-type  variant  (which 
includes  the  cylinder,  the  servovalve,  the  position 
transducer  and  the  electronic  regulator  )  is  quite  used  in 
mass  production  by  several  companies.  This  solution  is 
based  on  electrical  position  feedback  and  represents  the 
typical  mechatronic  subsystem.  Basically,  electric 
feedback  represents  an  automated  system  for  position 
adjusting  that  becomes  able  to  realise  a  strictly  controlled 
translation  displacement  using  a  specific  transducer  and 
an  electronic  regulator.  In  many  cases  it  is  necessary  to 
control  also  the  realised  force  by  introducing  a 
supplementary  electrical  feedback  that  uses  a  force 
transducer.  In  some  cases  it  is  necessary  to  be  able  to 
measure  the  pressure  difference  between  the 
servocy finder’s  chambers,  the  acceleration  or  the  velocity, 
so  there  are  needed  specific  transducers  for  these 
mechanic  variables.  On  the  other  hand,  due  to  the  trend  of 
miniaturisation,  the  electronic  blocks  for  conditioning  the 
transducers  signals  are  integrated  in  transducers  bodies 
and  this  is  the  way  to  obtain  integrated  electronics 
transducers  which  offer  the  big  advantage  of  a  high  level 
immunity  to  electro-magnetic  interference  (because  the 
output  signal  is  a  current  and  not  a  voltage  ). 
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Considering  these  concepts,  The  Research  Institute  for 
Hydraulics  and  Pneumatics  (IHP)  has  started  some  years  a 
go  the  design  of  a  full  range  of  sensors  with  integrated 
electronics  that  are  used  in  hydraulic  and  pneumatic 
systems.  At  this  time  we  have  pressure  transducers  till  500 
bars,  200  bars  differential  pressure  transducers,  400  mm 
linear  displacement  transducers,  5000  RPM  rotation 
transducers,  100  kN  force  transducers  and  ,  in  the  future, 
200  Nm  torque  transducers.  As  a  common  aspect  of  these 
transducers,  all  of  them  are  made  with  variable  reluctance 
inductive  sensors.  The  advantage  of  this  kind  of  sensors  is 
their  high  sensibility  ,  the  fact  that  movable  core  is  also 
the  elastic  item  and  it’s  not  necessary  to  be  in  mechanical 
contact  with  it. 

The  present  paper  describes  the  100  kN  inductive  force 
transducer,  designed  and  made  by  the  authors  in  order  to 
use  it  in  hydraulic  installations  of  a  stress/vibrations 
testing  machine. 

2.  Theoretical  aspects 

Force  transducer  made  at  IHP  contains  elastically 
diaphragm  and  variable  reluctance  displacement  sensor. 
This  type  of  transducer  has  three  sensible  items: 

-  one  circular  elastic  clamped  diaphragm  made  from 
stainless  steel;  the  measured  force  is  applied  to  the  centre 
of  diaphragm 

-  the  measuring  sensor  represented  by  a  14  millimetres 
diameter  ferroxcube  potcore  placed  in  front  of  the 
diaphragm  with  an  initial  gap  80 

-  the  reference  sensor,  identically  with  the  measurement 
one,  but  placed  in  its  behind;  the  magnetic  circuit  is 
closed  through  a  stainless  steel  armature;  there  is  also  an 
adjustable  gap  for  the  symmetry  of  the  two  sensors. 

The  main  problem  is  the  correct  expression  of 
mechanical  condition  of  the  diaphragm  from  the  point  of 
view  of  elasticity  theoiy. 

For  the  elastically  diaphragm  loaded  with  a 
concentrated  force,  it  is  recommended  to  use  following 
calculation  formulas: 

-  for  equivalent  unit  stress  calculation: 

cre  =  KxF/h2  (1) 

-  for  centred  deflection  of  the  diaphragm: 

Xo  =  K,  xFR2/Eh3 

(2) 

where  F  -  central  force,  E  -  elastic  modulus  of  the 
diaphragm’s  steel,  h  -  diaphragm’s  thickness,  R  - 
diaphragm’s  outer  radius,  K  and  Kj  -  R  /r0  ratio 
depending  coefficients,  where  r0  is  a  radius  of  the  central 
coupling  bolt . 

The  calculation’s  premises  must  to  foresee  the 
realisation  dimension’s  size  witch  covered  domain 
10. ..630  kN  using  a  unitary  construction  and  a  commune 


electronics.  For  industrial  development  it  used  the 
solution  in  witch  diaphragm’s  radius  and  R/  r0  ratio  is 
choosing  in  three  steps  and  diaphragm’s  thickness  is 
choosing  in  depending  of  load  F  and  radius  R. 

The  transducer  presenting  in  this  paper  is  of  thick 
diaphragm  type  witch  has  the  ratio  xmax/  h  <0,2.  For  this 
ratio  range,  the  relation  (2)  is  linear. 

The  changes  of  diaphragm’s  central  deflection  cause  the 
increase  or  decrease  of  the  sensor’s  gape  and  therefore  of 
the  inductance  value,  according  to  formula: 


N2 

— k — + 2 

MoMrSpe  M0Sa 

where  L  -  sensor’s  inductance,  N  -  number  of  the  spool 
wire,  lm  -  magnetically  circuit’s  length,  Sfe  and  Sa 
magnetically  field  areas,  po  and  p,-  -  magnetically 
permeability  for  air  and  magnetically  core,  50  -initial  air- 
gape  and  x0  -  diaphragm’s  central  deflection.  For 
reducing  of  the  errors,  it  is  used  a  reference  sensor.  For 
simplicity  it  considers  equal  both  areas  Sfe  and  Sa,  both 
initial  air-gape,  too  and  consider  lm  /  p*  «  50.  The 
inductance  of  the  measure  sensor  can  be  calculated  with 
formula: 


L(x0)=L0 
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(4) 

where  Lo  is  the  initial  inductance  value. 

If  the  two  inductive  sensors  are  half-bridged  connected 
and  supplied  with  a  sinus  voltage  with  o>0  frequency  and 
U0  amplitude  and  we  neglect  the  ohmic  resistance  of  the 
coil,  then  span  voltage  can  be  calculated  with  formula: 


A  U  =  LL 
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2zn 
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A L 
2  U 


=  ±Un 
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(5) 

With  a  view  to  obtain  a  satisfactory  linearity,  it  was 
necessary  to  take  the  some  steps:  it  choose  an  initial  air- 
gape  80  »xmax  ,  a  ferromagnetic  material  with  high 
value  of  pr,  and  a  high  excitation  frequency  to  obtain 
cooLo  »  r0  thus  is  possible  to  linearize  equation  (5) 
Because  x0  is  proportional  to  force,  the  bridge  output 
voltage  is  proportional  to  force,  too.  The  measurement 
system  assure  a  conversion  linearity  better  as  1%. 


3.Construction 


The  force  transducer  consist  in  circular  clamped 
diaphragm  made  from  stainless  steel  with  a  central 
coupling  bolt  to  apply  the  measured  force. 
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The  diaphragm  is  mounted  on  the  cylindrical 
undeformable  body  (fig.  1).  This  transducer’s  body 
contain  the  two  sensors  placed  in  front  of  centre  of 
diaphragm.  An  aluminium  box  fixed  on  the  transducer’s 
body,  contain  the  electronic  module  for  measurement 
conversion  and  the  output  connector. 

The  more  constructive  problems  must  was  resolved 

-  choose  of  diaphragm’s  steel  . 

-  realisation  of  rigidity  of  diaphragm’s  mounting 

-  realisation  mounting  of  the  central  coupling  bolt 

-  fixing  of  the  diaphragm’s  dimensional  tolerances 

-  fixing  of  the  initial  value  of  the  air-gape  to  obtain  a 
best  linearity 


amplified  by  a  third  operational  amplifier  and 
nonlinearities  compensated.  A  fourth  operational 
amplifier  is  configured  as  an  active  low-pass  filter  and 
provide  to  output  connector  a  voltage  in  the  range  - 
10V...+10V  (  unified  signal  ).  The  four  operational 
amplifiers  are  integrated  in  the  same  chip  LM324  which 
allows  a  small  size  PCB.  It  is  possible  a  zero  adjust  and  an 
amplifier  gain  adjust  for  calibration  of  the  transducer; 
after  calibration  the  transducer  is  sealed.  The  output  stage 
can  be  eventually  configured  as  current  generator  in  order 
to  obtain  an  current-unified  (4... 20  mA)  output  signal. 
Power  supplies  are  +15V  and  _15  V  stabilizated. 


Figure  1.  Force  transducer 


4.  Electrical  structure 

The  functional  block  diagram  of  the  electronic 
amplifier  of  the  force  transducer  is  given  in  Fig.  2.  The 
electronic  module  is  realised  on  a  42X52  millimetres  PCB 
and  contains  one  quadrature  oscillator,  achieved  with  two 
operational  amplifiers,  providing  a  veiy  stabile  5  Vrms  at 
5  KHz  sinus  signal  for  the  measuring  bridge.  A  cosines 
signal  from  second  output  of  the  oscillator  is  passed 
through  two-transistors  zero-crossing  detector  then 
differentiated  by  the  third  transistor,  thus  obtaining  pulses 
required  by  a  sample  and  hold  circuit  made  by  a  FET  and 
a  capacitor.The  sample  pulses  are  situated  on  the  peak  of 
the  sinus  signal.  The  output  measure  bridge  voltage  is 


Figure  2.  Transducer  bloc  diagram 


In  Fig.3  is  shown  the  PCB  of  the  module, 
componentlayer.  The  transducer  has  an  output  4  pins 
circular  connector  for  a  3  wire  shielded  cable. 


Figure  3.  Printed  circuit  board 


5.  Experimental  works 

Force  transducer  made  by  IHP  was  tested  in  a 
calibrating  stand  in  conjunction  with  a  high  precision 
reference  transducer.  Calibrating  curve  was  obtained 
using  a  modem  data  acquisition  system.  The  block 
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schematics  of  the  hydraulic  testing  system  is  shown  in 
Fig-4. 

The  calibrating  stand  contains  a  hydraulic  servocylinder 
with  automatic  control  of  both  displacement  and  force. 
The  special  electronic  control  module  was  also  made  in 
our  testing  laboratory  and  it  can  be  connected  to  the  stand 
computer.  Using  a  ramp-type  command  is  possible  to 
obtain  on  the  computer  screen  the  calibrating  curve  of  the 
transducer  as  well  as  the  force-deflection  curve  of  the 
elastic  membrane.  This  last  curve  allowed  us  to  study  the 
behaviour  of  different  types  of  membranes  in  order  to 
achieve  optimal  solution  and  to  find  initial  gap  between 
sensor  and  membrane  for  optimal  global  linearity  of 
transducer. 


Figure  4.  Test  stand  schematic 

The  process  computer  of  the  stand  has  a  DAS  1701  AO 
data  acquisition  board  from  Keithley.  Using  TestPoint 
software  package  there  were  developed  virtual  instruments 
for  calculating  the  mentioned  curves.  In  Fig. 5  is  shown 
the  virtual  instrument  for  calculating  calibrating  curve  of 
force  transducer. 


Figure  5.  Calibration  diagram 


6.  Conclusions 

The  experiments  on  the  test  stand  proved  that  this  type 
of  transducer  has  high  sensibility,  linearity  (  global 
linearity  under  1%  )  and  reliability  and  veiy  good 
dynamic  behaviour,  despite  its  constructive  simplicity  and 
its  low  cost  .One  big  advantage  of  this  device  is  the 
presence  of  conversion  electronics  into  the  body  of  the 
transducer,  which  means  high  noise  immunity.  In  the 
future  the  electronics  will  be  completed  with  a 
microprocessor  subsystem  in  order  to  increase  precision 
and  linearity  of  the  transducer,  as  well  as  to  use  a  serial 
communication  bus. 
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Abstract 

This  paper  presents  the  different  phases  from  the  design  of 
an  in  pipe  micro  robot  to  its  implementation .  The 
mechanical  part  designed  in  order  to  satisfy  several 
criteria  of  performances  is  detailed.  In  a  second  time  the 
modelling  and  identification  of  each  parameter  have  been 
done,  simulated  and  compared  to  experimental  results. 
Finally  the  control  law  of  the  robot  is'  investigated  to 
optimise  the  positioning  accuracy  and  the  carried  load. 
Perspectives  are  given  to  improve  the  task  of  the  in  pipe 
locomotion  by  reducing  the  size  of  the  robot  for  example. 

1.  Introduction 

Micro  robotics  is  now  a  growing  field  which  has 
been  pushed  ahead  by  the  developments  of 
microelectronics  over  the  last  few  years.  Micro  machining 
such  as  chemical  etching  has  been  a  starting  point  for 
novel  technologies  such  as  micro  stereo  lithography  or 
LIGA  with  those  very  small  pieces  (few  tens  of  microns) 
can  be  machined  very  accurately. 

In  the  same  time,  smart  materials  have  been 
discovered  and  developed  in  order  to  realise  many  micro 
actuators  and  micro  sensors. 

The  miniaturisation  of  macroscopic  robots  allows 
the  realisation  of  micro  tasks.  Most  of  applications  are 
based  on  the  manipulation,  the  locomotion  or  the  handling 
of  tools. 

Domains  most  interested  by  these  micro  tasks  are 
medicine,  defence,  biologic  and  nuclear. 

Micro  systems  closed  than  micro  robots  are 
widely  used  in  medicine  for  endoscopy  for  example. 
Micro  robots  designed  as  ants  or  flies  are  studied  in  order 
to  watch  over  a  hostile  land.  Such  systems  need  to 
communicate  with  each  other  and  take  a  decision 
according  to  several  cases.  Micromanipulation  is  an 
important  domain  of  investigation,  because  micro  robots 
are  useful  to  assembly  other  micro  systems  such  as  micro 
motor  or  else.  This  research  can  lead  to  the  creation  of 


whole  micro  workshops.  For  genetic  or  biologic 
applications  it  is  required  to  catch,  transport  and  set  down 
very  small  organic  pieces. 

It  remains  a  specific  task  that  is  the  locomotion  in 
restraint  environment.  Our  investigations  specially 
concern  the  in-pipe  locomotion.  Many  applications  of 
maintenance  such  as  those  of  pipelines,  canalisation,  or 
nuclear  generators  of  power  plant  or  submarine  are 
targeted.  We  chose  to  study  the  general  case  of  nuclear 
generator  pipes. 

2.  In  Pipe  Micro  Robot 

Many  researches  are  leaded  to  accomplish  an  in 
pipe  micro  robot.  Several  micro  robots  based  on  a 
piezoelectric  have  been  designed  for  in  pipe  locomotion. 

Matsuoka  has  designed  a  robot  that  has  3  legs 
passive  legs  always  in  contact  with  the  pipe  side  [2].  Its 
piezoelectric  actuator  is  placed  in  the  body  of  the  robot.  It 
constitutes  the  only  link  between  legs  and  a  fore  mass. 
This  mass  is  actually  a  micro  sensor.  Then  the  robot 
moves  forward  by  doing  a  slow  stretching  of  piezoelectric 
actuator  and  a  quick  contraction  hence  legs  stick  and  slip 
on  the  pipe  side.  The  advantages  of  this  technology  are  its 
small  size  (pipe  is  10  mm  in  diameter),  its  velocity  that 
can  reach  5  cm/s  and  this  system  can  move  reward. 

Another  kind  of  micro  robot  is  developed  in  the 
French  Japanese  laboratory  LIMMS  or  in  LMA  by 
Minotti.  The  micro  robot  has  many  active  legs  that  push 
against  the  pipe  side  according  one  direction.  The 
performances  of  this  robot  are  closed  to  the  precedent  one. 
Piezoelectric  actuators  usually  need  a  high  voltage  of 
supply  but  can  be  controlled  with  high  frequency. 

Actuators  based  on  the  Shape  Memory  Alloy  are 
very  used  for  in-pipe  micro  robotic.  Indeed  two  French 
laboratories  work  on  this  technology.  In  order  to  inspect 
rectilinear  pipe  of  10  to  15  mm  in  diameter,  the  inchworm 
motion  seems  to  be  the  movement  the  most  suitable. 
Touaiba  has  designed  an  inchworm  micro  robot  that  is 
constituted  by  a  stretching  actuator  in  the  body  placed 
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between  two  parts  with  legs  [1].  The  main  actuator  creates 
a  step  and  legs  are  pushed  against  the  pipe  sides  to  assure 
the  support.  Actuators  are  linear  and  based  on  a  spring  of 
a  Shape  Memory  Alloy  (SMA)  that  can  be  stressed  and 
stretched  according  to  the  temperature  of  the  wire. 

Each  actuator  is  controlled  by  direct  current 
supplied  through  the  SMA  wire.  To  accelerate  SMA  to 
reach  their  initial  position,  each  actuator  works  in  opposite 
or  against  a  natural  spring. 

According  to  another  principle,  Libersa  has 
designed  an  inch  worm  micro  robot  composed  by  five 
modules  as  a  train  [8].  Each  module  is  able  to  push  against 
the  pipe  side  to  support  the  body  or  to  be  expanded  to 
create  a  small  motion  forward.  Each  module  is 
successively  excited  that  create  a  wave  hence  contact 
point  against  the  pipe  moves  forward  as  a  snake. 

A  module  is  actuated  by  SMA  wires  that  deform 
a  passive  flexible  square  structure  that  is  pre-stressed.  In 
this  case  the  displacement  create  by  actuator  is  amplified 
by  the  mechanical  structure  and  each  wire  works  in 
opposite  with  another  one. 

Such  actuators  are  very  interesting  for  micro 
robotic  because  they  have  a  good  power  to  weigh  ratio 
and  they  can  be  machined  with  the  desired  shape.  SMA 
provide  very  large  displacement  contrary  to  piezoelectric. 
However  their  main  inconvenient  is  their  control  where  it 
is  quite  easy  to  heat  the  wire  to  control  its  shape  but  it  is 
more  difficult  to  cold  it.  This  is  why  they  have  a  long 
response  time. 

The  last  main  technology  employed  for  in  pipe 
micro  robotics  is  fluidic.  Indeed  the  pneumatic  power  can 
provide  high  force.  For  this  reason  several  actuators  have 
been  designed  to  profit  of  this  characteristic. 

In  the  macroscopic  world,  artificial  muscles 
exemplify  an  interesting  kind  of  actuator  [3].  Indeed  such 
actuators  provide  a  stretching  motion  when  they  are 
supplied  by  pneumatic  pressure.  As  real  muscles,  the 
actuator  is  inflated  hence  its  section  is  growing  while  its 
length  is  decreasing.  When  these  actuators  are  mounted  in 
parallel  structure  and  work  in  opposite  each  other,  they 
can  supply  linear  displacement  in  their  main  axis  and 
rotation  movement  around  several  axis.  This  characteristic 
is  enable  because  whole  actuator  is  compliant.  This 
actuator  could  be  used  to  actuate  micro  robot  arm  where 
each  joint  can  create  several  rotations. 

The  Flexible  Micro  Actuator  (FMA)  is  shaped 
like  a  finger  [4].  3  long  chambers  that  can  be 
independently  supplied  by  pneumatic  pressure  compose 
its  section.  When  the  3  chambers  are  excited  in  the  same 
time  the  actuator  is  stretched,  in  any  other  cases  it  is 
bending  in  a  direction  of  the  chamber  that  is  the  least 
supplied  in  pressure. 

A  repeating  sequence  of  the  different  movement 
provides  an  elliptic  displacement  of  the  tip  of  the  actuator. 
Hence  it  can  reproduce  the  movement  of  a  foot  on  the 
ground.  This  actuator  has  been  miniaturised  to  realise  an 


in  pipe  robot  with  5  legs  by  section  in  contact  with  the 
pipe  side.  It  is  composed  by  twenty  legs. 

Main  advantages  of  this  actuator  are  the 
compliance,  its  various  motion  and  the  available  force.  Its 
main  inconvenient  is  the  supply  of  the  pneumatic  pressure 
that  involves  a  servo  valve  and  a  supply  pipe  for  each 
chamber  that  requires  space. 

The  last  kind  of  actuator  employed  for  micro 
robotics  is  bellows.  An  in  pipe  micro  robot  has  been 
designed  by  KATO  [5]  to  inspect  pipes  of  18  mm 
diameter.  It  is  composed  of  five  rubber  bellows 
independently  driven  and  it  is  based  on  the  same  principle 
of  locomotion  as  snake  detailed  above.  It  reaches  77  mm/s 
in  velocity  but  its  support  being  assumed  by  the  stick  slip 
motion  of  the  bellows  it  can  not  push  or  pull  a  mass  higher 
than  200  g. 

An  inchworm  micro  robot  was  conceived  with 
three  metal  bellows  actuators  [6].  It  moves  in  circular 
section  25  mm  in  diameter  and  can  cross  a  junction  in  Y. 
Its  module  of  elongation  is  carried  out  by  a  system  with 
pulley  actuated  by  two  metal  bellows.  Both  modules  of 
blocking  ensure  the  contact  against  the  walls.  The  robot 
progresses  to  1  mm/s  approximately. 

According  to  many  implementations  developed 
in  our  laboratory  we  have  chosen  the  metal  bellows  to 
create  an  actuator  for  the  in  pipe  locomotion  [7].  Indeed 
every  robot  actuated  by  this  technology  has  allowed  to 
control  accurately  a  position  or  the  available  force.  It  is 
why  a  XY  table  conceived  in  LAI  has  been  able  to  create 
displacement  with  accuracy  better  than  2  microns.  In 
another  case  the  metal  bellows  actuator  have  been  used  to 
control  the  compliance  of  a  robot  arm. 

3.  Design 

The  main  constraint  is  the  very  confined 
environment  ;  this  is  why  it  must  be  precisely 
characterised. 

In  our  case  the  pipe  is  rectilinear  and  has  a 
constant  circular  cross  section  of  18  mm  of  diameter.  The 
actual  material  of  the  pipe  is  steel  but  for  our 
experimentation  we  used  a  transparent  pipe  made  of  PVC 
that  allows  observing  of  each  motion  of  the  robot. 

Finally  we  chose  to  study  the  worst  case  where 
the  pipe  is  vertical  and  the  robot  has  to  move  up. 

An  inchworm  micro  robot  is  composed  by  2 
blocking  modules  that  assume  the  support  of  die  body  and 
one  stretching  actuator  that  creates  the  step  displacement 
(flg.l).  The  fore  blocking  module  is  actuated  by  micro 
motor  that  drives  a  screw  /  nut  system  to  brace  three  legs 
against  the  pipe  sides.  The  rear  blocking  module  makes 
moving  3  legs  made  as  pistons  that  are  actuated  by  air 
pressure  [7]. 

The  main  actuator  is  composed  with  metal 
bellows.  It  acts  like  a  pneumatic  jack  except  it  has  no  joint 
between  the  rod  and  the  cylinder.  Such  technology  limits 
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the  friction  that  involves  a  more  accurate  positioning.  In 
this  case  contrary  to  Yoshida's  micro  robot,  we  chose  to 
supply  pressure  out  side  of  the  metal  bellows.  This  is  the 
reason  why  a  solid  cylinder  machined  from  a  bulk  of 
Aluminium  is  placed  around  the  bellows  to  carry  out  the 
chamber.  This  characteristic  makes  that  bellows  always 
work  in  stressed  way.  Hence  that  allows  to  ensure  a  longer 
life  of  the  metal  bellows  and  a  constant  stiffness. 

The  displacement  of  the  rod  represents  a  step. 
Inchworm  movement  is  a  succession  of  such  steps.  A  Hall 
effect  sensor  has  been  placed  on  the  back  of  the  cylinder 
in  front  of  a  magnet  glued  on  the  rod  back.  This  allows  to 
measure  the  magnitude  of  displacement  and  so  the  step 
length. 

The  general  design  has  shown  few  difficulties  to 
obtain  wanted  performances.  The  most  important  is  the 
pneumatic  and  electric  power  supply.  Indeed  the  robot  is 
supplied  by  air  pressure  through  2  different  pipes  that 
involves  a  good  airtightness  between  both  chambers  (rear 
blocking  module  and  stretching  actuator).  The  micro 
motor  needs  to  be  supplied  with  current ;  this  is  why  the  3 
mm  diameter  rod  has  been  drilled  end  to  end  to  allow  the 
insertion  of  a  wire  from  the  back  to  the  head  of  the  robot. 
Thus  this  wire  is  placed  through  the  chamber  of  the  main 
actuator  where  it  is  necessary  to  avoid  leakage.  The 
problem  is  the  same  with  the  3  wires  of  the  effect  Hall 
sensor. 

Note  that  the  robot  has  been  built  with  standard 
materials  and  components  then  it  is  relatively  cheap. 


Figure  1.  Drawing  of  the  micro  robot 
4.  Design  calculations 


Design  of  a  complex  system  such  as  a  micro 
robot  needs  to  follow  several  courses.  Indeed  the  most 
important  criterion  is  the  small  size  because  the  pipe 
imposes  its  internal  diameter.  Power  available  to  carry 
heavy  loads  has  been  chosen  as  second  characteristic. 
Accuracy  of  positioning  is  placed  at  the  same  level  of 
priority.  Velocity  appears  as  the  last  performance 
required.  Many  studies  work  at  the  opposite. 
Classification  of  criteria  is  very  important  for  the  design 
of  complex  systems  to  optimise  the  device. 

4.1.  Stretching  actuator 

In  our  case  the  bellows  have  been  chosen 
according  to  the  supply  pressure,  the  stroke  and  the 
stiffness.  Stiffness  must  be  high  enough  to  move  up  the 
rear  mass  with  tubes  and  wires.  Then  bellows  must  be 
shorter  as  possible  to  be  stiff  while  knowing  the  stroke  is 


proportional  to  the  length.  The  solution  held  is  to  place  an 
added  spring  in  parallel  of  bellows  to  ensure  a  strong 
stiffness.  Thus  bellows  could  be  very  flexible  and  not  too 
long  to  provide  large  step.  Note  that  pressure  is  high  to 
stress  bellows  that  have  to  be  enough  resistant.  The  length 
of  the  robot  and  the  step  size  depend  of  loads  that  it  can 
pull  up. 

4.2.  Rear  blocking  module 

This  part  has  to  support  the  whole  mass  of  the 
robot  plus  carried  loads.  It  depends  on  supply  pressure, 
cross  section  of  piston  and  friction  between  tip  of  legs  and 
pipe  sides.  The  small  diameter  of  pistons  (legs)  involves 
once  more  supplying  under  high  pressure. 

4.3.  Fore  blocking  part 

.  It  has  the  same  function  as  the  rear  blocking  part 
described  above.  The  micro  motor  and  its  speed  reducer 
have  been  chosen  firstly  for  their  diameter,  secondly  for 
the  maximal  torque  supplied  during  a  short  time.  Indeed 
the  high  reducing  ratio  plus  the  screw  /  nut  system  make 
the  device  irreversible.  Then  the  motor  is  not  supplied 
while  legs  are  braced  against  pipe  sides.  Strength  provided 
at  contacts  is  therefore  function  of  screw  characteristics 
(diameter,  thread,  screw  angle)  and  inclination  of  guides 
on  the  nut.  This  device  is  used  in  all  or  nothing.  Length  of 
this  module  depends  on  the  whole  mass  transported. 

4.4.  Hall  effect  Sensor 

This  sensor  estimates  the  distance  between  the 
rod  and  the  cylinder  back  -  i.e.  position  of  the  rod  -  by 
measuring  the  magnitude  of  the  magnetic  field.  Indeed 
this  field  is  provided  by  the  magnet  glued  on  the  rod  back. 

The  magnitude  of  magnetic  field  measured  is 
decreasing  when  distance  between  magnet  and  sensor  is 
growing.  We  searched  to  obtain  a  significant  signal  with  a 
good  resolution  along  die  whole  stroke  of  the  rod.  That 
depends  on  the  magnetisation  or  size  of  magnet  and  the 
initial  distance  to  sensor.  The  sensitiveness  of  the  Hall 
effect  sensor  reaches  1  micrometer  but  according  to  its 
characteristic  modelled  it  can  ensure  measures  reliable 
with  an  error  less  than  7  micrometers. 

5.  Modelling  and  identification 

5.1  Mechanical  identification 

This  work  has  been  done  in  two  times  where  we 
firstly  have  estimated  the  parameters  of  the  mobile  part 
and  secondly  the  pneumatic  part  -  i.e.  the  servo  valve  and 
the  supply  pipe  to  the  robot. 
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The  other  modules  that  actuate  the  legs  are  not 
detailed  in  this  paper  because  they  function  on  all  or 
nothing  basis.  They  have  been  designed  to  brace  legs 
against  the  pipe  side  in  order  to  support  a  heavy  mass  that 
involves  choosing  a  high  power  motor  and  a  supply 
pressure  of  20  bars. 

The  figures  2  and  3  show  a  rising  step  and  a 
falling  step.  The  robot  is  controlled  in  opened  loop.  The 
bold  line  represents  implemented  points  whereas  the 
continuous  line  represents  simulated  points. 


x  IQ'3  Simulation  of  the  mechanical  part 


Fig.2.  Simulation  of  the  whole  mechanical  part 
In  opened  loop 


This  implementation  can  verify  the  accurate 
estimation  of  the  moving  mass  (whole  the  fore  blocking 
part  with  legs),  the  stiffness  of  the  metal  bellows  actuator 
(metal  bellows  plus  the  added  spring),  the  viscous  friction 
coefficient,  and  the  friction  force  certainly  du  to  the 
guidance  of  the  rod. 

As  these  characteristics  are  known  those  of  the 
pneumatic  part  can  be  identified.  The  figure  4  illustrates 
the  difference  between  the  modelling  and  the  real  process. 
This  graphic  confirms  a  good  identification  of  the  gain  of 
flow  (Kv)  and  the  gain  of  pressure  (Kp)  of  the  servo 
valve.  The  gap  between  simulated  points  and  implemented 
points  is  certainly  du  to  the  characteristic  linearised  of  the 
servo  valve. 

5.2  Modelling  of  the  whole  process 

This  part  has  been  designed  to  be  very  stiff  to 
allow  the  pulling  up  of  a  heavy  rear  mass.  Supply  pressure 
is  controlled  in  order  to  allow  the  pushing  up  of  the  fore 
mass  with  accurate  positioning.  This  relation  gives 
dynamic  behaviour  : 


Fig.3.  Simulation  of  the  mobile  part  in  opened  loop 


Where  : 

x0  :  initial  rod  position, 

x  :  actual  rod  position, 

S  :  bellows  section, 

Patm :  atmospheric  pressure, 

P :  internal  pressure, 

Ms :  mass  in  translation, 

b  :  viscous  friction, 

ke :  equivalent  stiffness, 

f :  friction. 

Metal  bellows  actuators  do  not  use  joint  to  guide 
the  rod  contrary  to  classical  jack  then  friction  is  very 
limited.  Pneumatic  behaviour  is  given  by  the  relation 
based  on  the  mass  conservation  given  below  : 

dM  V  dP  dxSP_ 
dt  yrT  dt  dt  rT 

Where  : 
dM  . 

- :  mput  massic  flow, 

dt 

T :  temperature, 

y  :  specific  heat  ratio, 

r :  air  constant, 

V  :  chamber  volume. 

As  the  system  is  adiabatic  and  the  variations  of 
temperature  are  very  low,  model  can  be  linearised  at  an 
equilibrium  point.  Thus  it  can  be  represented  as  a  closed 
loop  block  diagram  drawn  below  (fig  5). 


Figure  4.  Block  diagram  of  the  stretching  actuator 

Notations  : 

V7>  ■  K£r  M 

1  V  V  •  3  V 

r0  Ko  K0 

6.  Implementation  and  performances 


the  Hall  effect  sensor,  error  of  positioning  can  not  exceed 
10  micrometers. 


x  i  o'3  Zoom  on  the  simulated  line 


Our  device  is  currently  controlled  by  D-Space 
environment  that  uses  Matlab  and  Simulink  software. 
Firstly  the  micro  robot  has  been  tested  to  realise 
displacement  stairs  shaped.  During  this  task  the  control 
law  employed  is  a  simple  PID  correction.  The  figures  5 
shows  the  gap  between  the  simulated  lines  and 
experimental  points.  The  figure  6  is  a  zoom  where  the  line 
with  stars  is  implemented  points  and  continuous  line  is  the 
simulated  result.  This  small  gap  confirms  us  the  good 
modelling  of  each  part  of  our  device. 


X  10’3  Simulation  of  the  control  law 


Figure  5.  Comparison  of  the  modelling  to 
experimental  results  in  closed  loop 


After  the  validation  of  our  whole  modelling  a 
control  law  has  been  established.  Firstly  we  have  chosen 
to  optimise  the  positioning  accurate  of  the  robot  in  the 
pipe.  The  matching  of  gains  of  the  corrector  has  allowed 
reaching  a  very  good  positioning  accuracy  on  every  step 
better  than  3  micrometers.  According  to  the  reliability  of 


The  micro  robot  realises  8  steps  of  0.5  mm  in  size 
before  one  change  of  supports.  Currently  a  step  is  done 
each  0.25  seconds  but  the  figure  5  shows  that  the  very 
short  response  time  of  our  device  could  allow  to  move  up 
very  faster  while  keeping  the  good  positioning  accuracy. 

Changes  of  support  constitute  the  most  serious 
problem  that  limits  the  global  positioning  accuracy. 
Indeed  with  a  low  load  carried  (less  than  1  kg)  the 
supports  can't  slip  along  the  pipe  sides  but  during  the 
change  of  supports  the  rod  can  change  a  bit  of  orientation 
(few  degrees  around  a  vertical  line).  This  phenomenon  can 
involve  disturbances  in  measurement  of  the  position  of  the 
rod.  This  is  the  reason  why  we  currently  obtain  a  0.5  % 
error  of  positioning  during  a  course  80  mm  long. 

The  locomotion  can  be  faster  than  the  precedent 
one.  Indeed  the  micro  robot  can  move  up  quickly  by 
creating  only  steps  of  4  mm  size.  By  this  way  the  velocity 
exceeds  5  mm/s  without  added  load.  At  this  moment  this 
criteria  is  optimised  by  the  improving  of  a  states  feedback 
control  law. 

According  to  the  pipe  material  hence  its  friction 
coefficient  with  legs,  the  robot  is  able  to  support  added 
loads  higher  than  2  kg  without  observing  a  slipping 
displacement  at  the  contact  points.  The  main  actuator  has 
pushed  a  load  of  5  kg  while  ensuring  the  positioning 
accuracy  better  than  10  micrometers.  Obviously  the 
response  time  is  increasing  in  this  case. 

The  robot  can  pulled  an  added  load  but  the 
efficiency  of  the  stroke  is  decreased  if  the  metal  bellows 
actuator  is  not  enough  stiff.  The  moving  up  of  the  rear  part 
is  only  assumed  by  the  general  stiffness.  The  design  of  the 
robot  has  foreseen  this  case  ;  this  is  why  it  is  able  to 
replace  the  added  spring  by  a  stronger  one  and  to  supply 
pneumatic  pressure  up  to  20  bars. 
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Figure  7.  Picture  of  the  micro  robot 


7.  Conclusions  and  perspectives 

A  robot  for  in-pipe  locomotion  has  been  designed 
to  move  up  vertical  pipes  of  1 8  mm  diameter  (locomotion 
is  able  in  pipe  from  16  mm  to  20  mm  in  diameter).  Its 
main  advantage  is  to  carry  heavy  loads  higher  than  2  kg 
with  an  accurate  positioning.  Modelling  of  each  part 
(actuators,  sensors)  has  been  carried  out  and  implemented 
results  have  been  compared  with  behaviour  simulated. 

Now  the  control  law  using  states  feed  back  is 
implemented  to  control  the  positioning  of  the  robot  in  the 
pipe  and  to  avoid  disturbances  due  to  the  delay  of  pressure 
along  the  supply  pipe. 

Behaviours  of  the  micro  robot  will  be  still 
investigated  with  different  conditions  (load,  velocity...). 
Another  prototype  has  been  designed  to  realise  the  same 
task.  This  industrial  project  is  shorter  (about  80  mm 
length)  than  the  current  one  because  the  fore  blocking 
module  is  composed  with  a  micro  motor  of  3  mm 
diameter.  When  it  will  be  machined,  it  will  test  as  the 
same  way  as  the  current  robot. 
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Abstract 

A  stochastic  program  is  de\>eloped  for  adaptive 
control  and  identification  of  industrial  mobile 
robot  Our  program  is  executed  at  two  levels:  a 
stochastic  trajectory  planner  (STP)  in  the  first 
place  then  a  follower  trajectory  en-line  (FTE) 
based  on  the  complete  stochastic  dynamic  model  of 
the  mobile .  The  modelization  is  first  done  in  the 
deterministic  case  based  on  the  Lagrangien 
formalism .  This  gives  the  stochastic  model  of  the 
mobile  robot  The  mobility  of  the  robot  is  besides 
considered;  first  a  static  mobility  is  given.  Then  we 
consider  a  dynamic  mobility.  After  that  the  mobility 
is  randomized  and  taken  as  an  output  of  our 
dynamic  system.  Our  program  is  one  of 
identification  of  the  doubly  stochastic  process  of 
hidden  Markov  chaines  minimizing  the  function  of 
information  of  Kullback-Leibler.  The  convergence 
and  the  consistence  of  functions  of  parameters 
evaluation  as  well  as  simulations  on  the  case  of  the 
SARAH  robot  are  given  to  demonstrate  the 
efficiency  of  our  algorithms 

Key  words: 

Dynamic  Navigation,  Dynamic  Mobility, 
Information  of  Kullback-Leibler,  Identification  and 
Stochastic  Control. 


1.  Introduction 

Until  now  the  mobile  robotics  knew  some 
important  developments  on  the  plan  material 
Hardware,  notably,  systems  of  vision.  Whereas 
decision  level  of  intelligence  and  command  is 
reduced  to  an  assessment  from  the  geometric  or 
Idnematics  models  of  the  mobile  generating  an 
organized  trajectory  of  right  lines  or  com  spiral 
connections.  [1],  [2],  [3]  and  where  the  curvature  or 
the  acceleration  is  taken  like  command  One 
perceives  while  the  dynamic  effects  as  inertia  and 
mass  and  various  disruptions  can  not  be  neglected, 
that  is  wished  for  the  high  performances. 

In  [4]  the  authors  had  developed  a  deterministic 
dynamic  model  based  on  the  principle  of  Lagrange 
-Euler.  We  follow-up  here  of  a  stochastic  dynamic 
model.  An  original  contribution  of  the  present  work 
is  the  quantification  of  the  mobility  and  capacity  of 


maneuverability  of  the  robotics  system.  Indeed  a 
static  mobility  function  has  been  considered  in  [10]. 
However  such  a  function  of  mobility  is  valid  only 
in  the  case  of  the  stationary  robots  or  manipulators. 
In  the  second  section  one  will  develop  deterministic 
and  stochastic  dynamic  models  of  the  mobile.  In  the 
third  section  the  first  level  of  the  stochastic  strategy 
for  identification  and  command  is  developed.  An 
Iterative  Global  Algorithm  (IGA)  based  on  the 
minimization  of  the  function  of  Kullback-Leibler 
information  of  processes  of  doubly  stochastic 
hidden  Markovs  is  considered  This  algorithm  has 
been  applied  to  the  identification  of  parameters 
meaning  the  various  disruptions,  notably  the 
variance  of  the  white  additive  gaussian  noise 
representing  uncertainties  of  measures.  This 
algorithm  has  been  applied  with  success  to  ecology 
[5],  to  the  speech  recognition  [6] [7],  to  the  pictures 
Markovien  models  [8],  to  the  numeric 
communications  [9]  and  in  short  to  robot 
manipulators  [10].  In  the  fourth  section,  we  will 
develop  the  application  of  an  algorithm  of  recursive 
identification  of  parameters  and  adaptive  control 
(IRCA)  to  the  mobile  robots.  This  algorithm  is  a 
truncation  of  Birkoff  ergodic  theorem.  The  interest 
carried  by  the  application  of  this  strategy  to  the  case 
of  the  mobile  robots  SARAH  type.  Convergence 
analysis  based  on  martingal  proprieties  is  followed 
by  developing  a  new  model,  new  space  and  new 
probability  measure.  We  noted  the  fast  convergence 
of  parameters  identified  and  the  optimality  in 
precision  in  the  scheduling  by  IGA  and  the  pursuit 
by  IRCA  of  the  trajectory.  The  SARAH  robot  being 
the  robot  developed  by  the  unit  Genius  Advanced 
Robotics  of  the  atomic  energies  center  Paris  [12]. 

2.  Dynamic  model  of  the  mobile 

The  mobile  is  assimilated  to  a  parallelepiped  The 
long  of  the  present  work  will  be  referred  that  it  is 
the  body  of  the  robot.  The  wheels  of  the  robot  are 
assimilated  to  two  identical  cylinders.  Let’s 
consider  the  following  notation  (  see  figures  1,  2) 

{a,  b,h) Lengths  of  the  axis,  and  height  of  the 
robot  respectively. 

( )  The  radius  and  the  thickness  of  the  wheel 

(0,X,Y,Z)  The  axis  3D  that  origin  is  at  the 
center  of  gravity  of  the  robot 
Ox  Center  of  the  right  wheel 
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01  Center  of  the  left  wheel 
/j  and  1 2  points  of  wheel  contact  with  the  ground 
CO  Angular  speed  of  the  w  heel  around  the  Z  axis 
V  Linear  speed  of  the  center  of  gravity  of  the 
robot 

Vx  Linear  speed  of  the  right  wheel 
V2  Linear  speed  of  the  left  wheel 
COj  Angular  speed  of  the  right  wheel 
CO j  Angular  speed  of  the  left  wheel 

2.1  Deterministic  model 

xi+\  =xi  +V/cosfe) 

}/+i  =  J/  +  V/Sinfe) 

0/+i  =  e,  +¥i 


3.  Mobility: 

The  static  and  dynamic  mobilities  are  defined  in  the 
previous  articles!  1 0], [  1 3] . 

Let's  recall  however  their  characterizations.  Indeed, 
the  static  mobility  is  expressed: 

r,<2x(  r(t)  ^ 


M(r)=K$ 


ro(0 


dr 


VOV/  J 
The  dynamic  mobility ?  is : 

f 

M(r)=  KIq71  p{r.v.6.\\>) 


A1  . 

v/o(0~ . 


v* 


dt 


P{r,v,d,w)  =  (l v)cos 2  t  +  ~b^h~2 (l w)sin2 1 


With 


t  =  arctg 


f 


a 

vga 


\ 

J 


And  a  =  k  +  - 


w  n 
rmax  2 


If 

v  =  0  hl{t,v)  =  hi(t,w)  =  \ 
A  =  b  =  r(t) 


If 


''  ^  0  h,  (t,  v)  =  —  for  [/;r(l  -  5gn(v))l  >  — 
3  2 

hlCv)  =  1  -  ~7“[r'T(l  -  sgn(v))] 

3  3  x 

Elsewhere 


3.1  Stochastic  mobility 

A  quantification  of  the  mobility  as  uncertain 
.  variable  becomes  necessary  as  we  have  inherent 
^unavoidable  disruptions  because  of  to  the  intrinsic 
and  extrinsic  w  orking  space  of  the  mobile. 

The  intrinsic  disruptions  are  owed  to  the  robot 
because  of  proper  deteriorations  of  the  machine. 
The  extrinsic  disruptions  are  for  the  main  thing 
owed  peripheral  system  (camera,  sensors...)  and  to 
the  environment  in  general  as  obstacles,  other 
robots  and  targets. 

So  the  continuous  stochastic  mobility  can  be 
defined  from  the  dynamic  mobility  as  follows 


=  Ayfo  ^  P(r,  v.  0 .  w 


Y7 

-  I  dt  +  dwj 
) 


With  the  functions.  Pj\r0 .  to  identify  U)  a 
Brownian  movement. 

How  ever,  the  mobile  agents  are  for  the  main  tiling 
systems  to  discrete  evolution,  so  a  discrete 
treatment  of  these  systems  is  necessary. 

One  is  going  to  consider  the  discrete  version  of  the 
model  of  the  stochastic  mobility  of  our  mobile 
which  w  ill  be  given  as  the  output  of  the  system. 

A/s(r);+i  =A&(r)j  +  Ks  I  p-(rrv,6,w) 

7=1  J 


lroO) 


+  Pm"' 


4.  Stochastic  dynamic  model 

Tlie  stochastic  dynamics  mobile  generalizes  the 
deterministic  mobile  dynamic  of  the  robot: 

-V/+1  =Xi  +  V/C0S  (<?,)+Pxi'/ 

}i+l  =  Yi  +hivi  sin(^/  )+  Pyvf 
6i+ 1  =di+  hi°i  +  Po'i 
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one  has  the  following  result 

5  -  The  first  level  of  the  program:  The  Parameters  estimated  are  martingals 

Iterative  Global  Algorithm  (IGA):  _ 

Demonstration; 


For  A  =  (px)  the  IGA,  one  is  going  to  define  an 
extension  of  the  method  proposed  by  Baum  [5] 
based  on  the  utilization  of  the  function  of  ■ 
information  of  Kullback  -  Leibler  that  defines  a 


distance  between  two  values  of  parameters  A 

and  A .  This  is  function  of  entropy. 

The  algorithm  for  one  parameter  is  the  next  one 


»  i  N 
Py=—  Z  Z  Z 

X  *«  =  1X  X  t 
n  n  - 1 


\x  -X  -h  /  lCos e  ,r 

1  n  n  - 1  h-1  w-1  n-  H 


l\ 

f  > 

mn,x  X  /a, 

'  1  .  if- 1  nf  > 

) 

Z  2 

V  V 

:  l\ 

>) 

5.2  the  control  law 


Let’s  suppose  given  the  initial  conditions  to  the  state 
n+1  the  law  of  command  is  given  while  using  the 
stochastic  gradient  by: 


<r  =c,*+J-„£i;i 


»=1V„.  V„ 


K 

h  rM 


c: 


6.  The  second  level  of  the  algorithm 
Recursive  Identification  and  Adaptive 
Control  (RIAC) 

The  parameter  estimation  is  given  for  the  same 
parameter  by: 

PXn+\  =  Pxn  +  Sn  X„ lX"  "  "  hnV»~' 


Similar  to  the  one  in  [13].  One  is  going  to 
demonstrate  that  this  integral  is  squares  integrables 

Theorem  2  (A.Khoukhi) 

Under  hypothesis  adhoc,  pXn  is  square  integrable 

A 

E  ((  px*  )2)  is  bounded 

But  the  demonstration  of  this  theorem  is  a  direct 
consequence  of  the  theorem  of  J.Neuveu[14] 


Theorem  5  (Kabanov,  Lipser  ,Shrayer) 

Being  given(Q,F)a  measurable  space  with  a 
following  of  sigma-algebra 

Fn,n>  0  Fn<  Fn+l  n  0  and  being  given 

P„and  P„ 

A 

Let's  suppose  Pn  «  P„  and  let's  put 
a  =  L„ /4_,  With  L„  =  dPnjdPn  Then  « 

A 

Pn  «Pn  (absolutely  adjoining) 

Theorem  6  (A.khoukhi) 

Under  hypothesis  adhoc  the  law  of  probability 
P  relative  to  the  initial  model  and  the  relative 
probability  law  to  the  new  model  are  absolutely 
adjoining 

Demonstration:  See  [13] 

Corollaries:  The  parameters  converged  under  P 

8*  Control  algorithm: 


7.  Convergence  of  the  stochastic 
evaluation: 

7.1.  The  new  model: 

In  [13],  the  author  defines  a  new  model,  new  space 
and  new  probability  measure 

7.2  Theorems 

7.2.1  Martingales  proprieties 
Theorem  1  (A.  Khoukhi) 


It  is  a  stochastic  algorithm  of  type  corrective- 
predictive.  On  the  one  hand  a  precise  model  of  the 
robot  identified  in  real  time  that  gives  torque  to 
apply  to  wheels  to  assure  the  desirable  movement 
while  holding  amount  of  gaussian  structure  of  the 
noise.  It  is  called  command  predictive.  On  the  other 
hand  a  proportional  integrator  derivator  determined 
by  nonlinear  stochastic  approximation  with 
adaptive  and  chosen  gain  going  asymptotically 
toward  zero. 
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9.  results  of  simulation: 

Studies  of  simulation  have  been  done  on  the  model 
of  the  SARAH  robot  (Autonomous  Robot 
developed  by  the  unit  genius  robotics  Advanced  of 
the  CEA  for  the  cleaning  of  the  embankment  of  the 
subway  of  the  Parisian  Metro  RATP)  on  a  Pentium 
II  using  the  software  of  automatic  Matrixx 

We  simulated  the  movement  producing  the 
trajectory  starting  of  (0,0)  and  reaching  the  target 
(4,4)  (10,10). 

This  diagram  of  simulation  is  done  at  a  time  for 
iterative  global  algorithm  (AIG)  and  for  the 
algorithm  of  recursive  identification  and  the 
adaptive  control  (IRCA). 


10.  Efficiency  of  mobility 

We  show  many  cases  study  of  mobility.  In  fact,  we 
have  two  important  cases:  The  case  of  weak 
mobility  where  the  mobile  has  a  limited  workzone  . 
Mainly,  in  the  presence  of  obstacles  and  other 
machines.  The  second  case  is  the  case  of  strong 
mobility  where  the  robot  has  more  freedom  in  the 
workzone. 

11.  Conclusions 

In  this  article  two  strategies  of  identification  and 
command  global  Iterative  and  recursive  adaptive 
are  presented.  We  got  some  more  effective  results 
by  the  strategies  IGA.  IRCA.  A  new  function 
stochastic  mobility  has  also  been  developed.  Our 
mobility  is  also  taken  as  an  output  function  in 
identification  and  control  process. 
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Abstract 

The  project  of  a  two-legged  walking  robot  is  presented. 
First  the  mechanical  design  is  described  with  particu¬ 
lar  emphasis  on  the  sensors  and  actuators  of  the  robot. 
For  this  system  a  nonlinear  control  scheme  is  proposed 
that  is  based  on  the  method  of  feedback  linearization.  In 
order  to  allow  for  a  “ dynamically ”  stable  motion,  the 
reference  trajectories  are  computed  so  that  the  system 
remains  controllable  throughout  the  entire  gait  cycle.  In 
simulations  the  robot  is  controlled  in  a  running  motion, 
including  phases  without  ground  contact .  The  simula¬ 
tion  results  show  the  good  performance  of  the  control 
strategy  in  a  purely  dynamic  motion. 

1.  Introduction 

In  recent  years  growing  scientific  effort  has  been  de¬ 
voted  to  the  development  of  two-legged  robots  that  can 
move  similar  to  human  beings.  This  trend  is  stimulated 
by  the  rapid  development  of  electronics  and  microcom¬ 
puting  allowing  for  the  implementation  of  more  sophi¬ 
sticated  control  schemes.  In  spite  of  worldwide  efforts 
in  the  area  of  human  walking,  only  few  machines  with 
acceptable  performance  have  been  realized  [4,  12,  13]. 
Still  there  is  no  machine  that  can  walk  at  higher  speeds 
and  there  is  no  comprehensive  solution  to  the  control 
problem. 

In  this  paper  the  principles  of  dynamically  stable 
walking  are  discussed  and  a  control  strategy  for  a 
two  legged  robot  is  presented.  The  control  scheme 
is  designed  for  a  biped  robot  being  developed  at  the 
TU-Munich. 

2.  Design  of  the  Robot 

The  geometry  of  the  robot  is  based  on  anthropome¬ 
tric  data  taken  from  a  male  human  of  the  size  of  180  cm 
and  a  weight  of  75  kg  [3].  The  joint  structure  (see  figure 
4)  has  been  chosen  to  provide  the  required  degrees  of 


freedom  for  walking  and  running  on  plain  and  uneven 
ground.  The  upper  body  is  connected  to  the  pelvis  by 
a  rotational  joint.  Each  hip  joint  has  three  degrees  of 
freedom.  The  knee  joint  has  one  degree  of  freedom, 
whereas  the  ankle  joint  allows  two  degrees  of  freedom, 
pitch  and  thrust.  Furthermore,  the  arms  are  connec¬ 
ted  to  the  upper  body  with  one  active  or  optional  two 
passive  (damped)  degrees  of  freedom,  each. 

The  structure  will  mainly  consist  of  aluminum,  which 
has  particular  advantages  in  terms  of  the  heat  conduc¬ 
tion  and  the  weight  properties.  As  the  weight  of  the 
structure  itself  will  be  low  compared  to  that  of  the  ac¬ 
tuation  and  the  electronics,  weight  reduction  of  the  ac¬ 
tuation  has  been  emphasized. 

Therefore,  hydraulic,  pneumatic  and  electric  drives 
have  been  considered.  It  has  been  proven  that  electric 
actuation  has  the  highest  performance-to-weight  ratio 
in  the  weight  region  of  interest.  The  good  controllabi¬ 
lity  was  another  reason  for  choosing  electric  actuation. 
The  chosen  joint  components  are  Neodym-Bor  motors 
with  Harmonic  Drive  gears  and  ball  screws,  respective¬ 
ly- 

To  obtain  an  optimized  design,  several  iterations 
have  been  performed.  Figure  1  shows  the  design  of 


Figure  1 :  Design  of  hip  joint 
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the  hip  joint.  The  hip  contains  three  active  degrees  of 
freedom  and  has  to  transmit  high  torques.  The  availa¬ 
ble  space  is  very  limited  and  the  demand  for  integrating 
three  motor  -  gear  -  combinations  makes  it  difficult  to 
find  an  appropriate  arrangement.  The  joint  for  the  in¬ 
ternal  and  external  rotation  of  the  shank  is  integrated 
in  the  shank  (not  shown)  and  attached  to  the  flexor 
joint.  The  adductor  joint  is  actuated  by  a  motor  in 
combination  with  a  toothed  belt,  which  is  integrated 
in  the  pelvis.  The  flexor  joint  and  the  rotational  joint 
both  have  a  coaxial  arrangement  of  motor  and  Harmo¬ 
nic  Drive  gear  which  is  very  space  efficient.  Figure  2 


Upper  Body 
Joint 


Flexor 

Joint 


Connection  to 
Shank 


Adductor 

Joint 


Figure  2:  Design  of  the  pelvis 


shows  the  assembled  pelvis  including  both  hip  joints 
and  the  upper  body  joint.  The  whole  unit  weights 
about  4,600  g.  Table  1  shows  the  transmittable  tor¬ 
ques. 


Table  1:  Drive  torques  of  the  leg 
joints 


Average 

torque 

Maximum 

torque 

Hip  adductor  joint 

35,3Nm 

147Nm 

Hip  flexor  joint 

20,4Nm 

147Nm 

Hip  int.  rot.  joint 

7,8Nm 

86Nm 

Knee  flexor  joint 

32.5Nm 

255Nm 

Foot  adductor  joint 

26,0Nm 

64Nm 

Foot  flexor  joint 

26,0Nm 

288Nm 

Another  important  aspect  is  the  design  of  the  : 

Since  fast  walking  as  well  as  running  is  planned,  im¬ 
pacts  have  to  be  reduced  to  a  minimum  and  the  time 
gap  between  the  sensor  input  and  the  motor  response 
has  to  be  bridged.  Therefore,  a  damping  mechanism 
is  required.  It  will  be  realized  by  integrating  elastic 
elements  in  the  foot. 


The  foot  actuation  is  realized  by  two  linear  drives 
based  on  ball  screws  (see  figure  3).  This  way  the  high 
torques  in  the  ankle  joint  can  be  realized  using  two 
motors  in  parallel. 

3.  Sensor  System 

The  sensor  system  consists  of  angular  velocity  and 
position  sensors  in  each  joint.  Furthermore,  the  Toot 
contains  three- axis  force  sensors  to  measure  the  ground 
reaction  forces  in  each  of  the  three  contact  points.  To 
measure  the  spacial  orientation  of  the  upper  body,  gy¬ 
roscopic  sensors  in  combination  with  inclinometers  are 
installed.  To  verify  the  measured  results  and  to  de¬ 
termine  external  forces,  three  acceleration  sensors  are 
used  in  addition. 

The  machine  hardware  will  consist  of  a  central  PC- 
board,  which  is  connected  to  decentral  C167-controllers 
by  a  CAN-bus  system. 

4.  Control  of  the  Robot 

The  mathematical  model  of  the  plant  corresponds  to 
the  design  of  the  robot  and  consists  of  a  system  of  16 
bodies  connected  by  m  =  15  rotational  joints  (figure  4). 
Then  the  overall  system  has  n  =  m  +  6  =  21  degrees  of 
freedom.  When  the  robot  is  standing  on  one  leg,  the 
6  degrees  of  freedom  of  the  upper  body  correspond  to 
a  specific  orientation  of  the  6  rotational  joints  of  the 
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Figure  4:  Plant  mode! 


supporting  leg.  Furthermore  there  is  one  joint  in  the 
pelvis  and  one  joint  for  each  arm,  which  can  be  used 
to  compensate  dynamic  forces. 

The  controller  is  designed  to  not  only  realize 
“dynamically”  stable  walking,  but  also  to  allow  for 
“running”,  which  includes  “ballistic”  phases  without 
ground  contact.  Since  the  latter  is  more  complicated 
than  pure  walking,  we  only  concentrate  on  this  case  in 
the  following.  During  the  gait  pattern  of  “running”, 
the  support  phases  of  the  feet  are  alternating  with 
ballistic  phases. 

4.1.  Feedback  Linearization 

The  control  of  the  robot  is  based  on  the  method  of 
feedback  linearization  [10,  9],  which  starts  out  from  the 
equations  of  motion: 

Mq  +  W1\  =  h  +  Qe  (1) 

Here  M  is  the  mass  matrix,  q  £  lRn  is  the  vector 
of  generalized  coordinates  and  h  and  Qe  are  the  gy¬ 
roscopic  and  external  forces,  respectively.  The  vector 
A  £  IRm  comprises  the  applied  torques  of  the  joints 
that  are  projected  on  the  generalized  coordinates  with 
the  Jacobian  W\ .  Given  a  set  of  control  inputs  A,  a  li¬ 
near  behavior  can  be  prescribed  in  m  coordinates.  Now 
we  first  take  a  look  at  the  “support  phase”,  in  which 
one  foot  is  in  contact  to  the  ground,  while  the  other 
one  is  swinging  to  its  next  stance  point.  Then  the  6  de¬ 
grees  of  freedom  of  the  supporting  foot  are  determined 
by  the  ground  contraints.  Therefore  all  n  degrees  of 
freedom  of  the  system  are  exactly  defined.  In  general 
the  trajectory  of  the  robot  can  be  prescribed  in  any  m 


coordinates,  as  long  as  there  exists  a  unique  transfor¬ 
mation  to  the  generalized  coordinates.  Here  a  linear 
behavior  is  prescribed  for  vector  xc ,  which  consists  of 
the  height  of  the  center  of  gravity  zs ,  the  spacial  orien¬ 
tation  of  the  upper  body  au,  yu,  the  orientation  of 
the  arms  xa  and  of  the  pelvis  joint xp  and  the  position 
and  orientation  of  the  foot  ( Xfi )  that  is  lifted  off  and 
swinging  to  its  next  stance  point. 


A  linear  PD  control  law  is  prescribed  in  xc  with  con¬ 
stant  matrices  C  and  D.  The  deviation  from  the  re¬ 
ference  trajectory  is  denoted  Axc  =  ( xc  —  xc,ref)  for 
the  velocity  and  Axc  =  ( xc  -  xC)T .e/)  for  the  position, 
respectively. 

Xc  =  Xc^ref  -  DA±C  -  C Axc  (3) 

When  the  system  is  in  a  non-singular  configuration, 
the  linear  control  law  can  be  projected  on  the  vector  of 


generalized  coordinates: 

xc  =  Jq  (4) 

xc  =  Jq  +  Jq  (5) 

Introducing  (5)  in  (3)  leads  to: 

Jq  =  xc,ref  ~  DAxc  -  CAxc  -  jq  ‘(6) 

With  abbreviations  w  and  W2 

"w  —  xCiref  DAxc  C  Axc  j  q  (7) 

W2  =  J  (8) 

we  obtain  the  equations  of  motion  in  standard  form: 

I\Jq  -{-  W\  A  =  h  +  Qe  (9) 

W2q  +  w  =  0  (10) 

Solving  (9)  with  (10)  for  the  applied  torques  A  leads 
to: 

A  -  (W2M-1W1)-1(w  +  W2M~l{h  4-  Qe))  (11) 

4.2.  Trajectory  Planning 

As  long  as  the  mathematical  model  of  the  plant  is 


close  to  the  real  system,  stability  of  the  linearized  sy¬ 
stem  can  be  achieved  by  placing  the  closed  loop  poles 
with  matrices  C  and  £>.  However  it  is  not  sure  that  the 
supporting  foot  does  not  move  relative  to  the  ground 
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throughout  the  entire  support  phase.  While  simula¬ 
tions  show  that  slipping  does  not  occur  since  the  nor¬ 
mal  forces  are  high  enough  to  ensure  sticking,  problems 
arise  from  the  limitation  of  the  maximum  torques  that 
can  be  transmitted  to  the  ground.  When  the  corre¬ 
sponding  torques  of  the  ankle  joint  reach  their  limits, 
the  system  becomes  underactuated.  Practically  this 
means  that  the  robot  starts  tipping  over  in  the  sagit¬ 
tal  or  lateral  direction.  Therefore  it  is  not  possible  to 
prescribe  a  completely  arbitrary  reference  trajectory  in 
«c,  but  the  ground  constraints  have  to  be  considered  in 
the  trajectory.  Defining  the  torques  that  act  between 
.  foot  and  ground  with  respect  to  the  center  of  the  foot, 
the  constraints  on  torques  Tx  and  Ty  are  denoted: 

\Ty\  <  0.5  Fzlx  .  (12) 

\TX\  <  0.5  Fzly  (13) 

Here  lx  and  ly  are  the  length  and  width  of  the  foot  and 
Fz  represents  the  normal  force.  In  order  to  ensure  that 
there  exists  some  region  around  the  reference  trajecto¬ 
ry  in  which  the  system  does  not  become  underactuated, 
functions  for  Tx  and  Ty  are  prescribed  for-  the  reference 
trajectory.  This  procedure  is  similar  to  prescribing  the 
path  of  the  “zero  moment  point”  as  known  from  litera¬ 
ture  [11].  For  the  lateral  plane,  the  trajectory  can  be 
chosen  such  that  the  center  of  gravity  is  always  between 
the  lateral  edges  of  the  supporting  polygon.  For  exam¬ 
ple  Tx  can  be  chosen  to  Tx  ~  0,  so  that  equation  (13) 
is  satisfied.  However  the  motion  in  the  sagittal  plane 
cannot  be  treated  in  the  same  way,  since  the  projection 
of  the  center  of  gravity  always  moves  out  of  the  suppor¬ 
ting  polygon  during  a  natural  gait  cycle.  Therefore  a 
function  for  Ty  is  prescribed  dependent  on  the  position 
of  the  center  of  gravity  xc : 


Ty  = 

-kFzlx 

for  xc  —  Xf  <  -Ax 

(14) 

& 

II 

Fz(xc  - 

X/)  for  | xc  -  Xf  |  <  Ax 

(15) 

T  — 
1y  ~ 

kFJx 

for  xc  —  Xf  >  Ax 

(16) 

Ax  = 

klx 

(17) 

k  = 

const. 

with  0  <  k  <  0.5 

(18) 

Here  xf  denotes  the  position  of  the  center  of  the  foot. 
With  a  larger  value  of  k ,  the  system  is  getting  closer 
to  the  underactuated  state  \Ty\  ~  Fzlx.  While  this 
reduces  the  domain  of  attraction  of  the  controller,  a 
large  value  of  k  leads  to  a  more  human-like  gait  pattern 
and  reduces  the  variations  of  the  velocity  in  the  sagittal 
plane. 

Besides  these  constraits,  the  trajectory  has  to  satisfy 
additional  conditions  to  ensure  that  the  robot  performs 
a  periodic  motion.  The  trajectory  of  the  support  pha¬ 
se  has  to  be  calculated  such  that  the  system  reaches 
a  well  defined  state  at  the  end  of  the  support  phase. 


Then  each  ballistic  phase  starts  with  the  same  initial 
conditions. 

*c(^i)  =  ®c,  1  (19) 

®c(^i)  —  ®c,  1  (^9) 

A  trajectory  has  to  be  found  for  xc  such  that  equations 
(14)-(20)  are  satisfied  for  the  equations  of  motion  (1) 
when  the  ith  support  phase  ends  at  some  time  U.  In 
general  there  is  no  analytical  solution  to  this  problem, 
we  therefore  suggest  a  numerical  approach. 

For  a  given  time  ti  the  trajectories  of  m  -  1  =  14 
elements  of  xc  are  prescribed  in  fifth  order  polynomials 
such  that  equations  (19)  and  (20)  are  satisfied.  Then 
one  component  of  xc  results  from  the  constraints  (14)- 
(16)  and  is  therefore  dependent  on  U .  Here  the  velocity 
of  the  center  of  gravity  in  the  sagittal  plane  is  chosen 
as  dependent  variable.  Then  the  equations  of  motion 
can  be  solved  numerically  for  U  so  that  all  constraints 
are  satisfied. 

The  numerical  solution  requires  iterative  integration 
and  is  therefore  computationally  expensive.  However 
the  determination  of  the  trajectory  has  to  be  done  only 
once  per  gait  cycle  and  does  not  need  to  be  calculated 
at  the  frequency  of  the  digital  controller.  Furthermore 
simulation  results  show  that  a  simplified  model  of  the 
system  is  sufficient  for  the  calculation  of  the  reference 
trajectory. 

4.3.  Ballistic  Phase 

When  the  robot  has  no  contact  to  the  ground,  the 
motion  of  the  system  is  not  influenced  by  external 
forces  except  for  the  gravitational  force.  Still  a  set  of 
m  generalized  coordinates  can  be  controlled  with  the 
m  driven  joints,  while  6  degrees  of  freedom  of  the  robot 
cannot  be  influenced  by  the  controller.  A  possible  set 
of  controllable  variables  is: 

•  spacial  orientation  of  the  upper  body  {au,  (3Ui  yu)T 

•  position  of  the  arms  and  the  pelvis  xarxp 

•  orientation  and  position  of  the  foot  that  is  swin¬ 
ging  to  its  next  stance  point  Xfi 

•  orientation  of  the  previous  stance  foot 

(afgiPfgilfg)T 

By  choosing  the  trajectories  of  these  variables  the  mo¬ 
tion  of  the  robot  can  be  adapted  to  the  human  gait 
pattern.  Furthermore  it  is  important  to  define  the  po¬ 
sition  of  the  foot  that  is  swinging  to  its  next  stance 
point,  so  that  the  subsequent  support  phase  starts  out 
with  a  well  defined  initial  configuration. 
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Figure  5:  Trajectory  z 


Figure  6:  Trajectory  x 


While  these  variables  can  be  controlled,  the  center 
of  gravity  follows  a  ballistic  curve  defined  by  the  initial 
conditions.  Add  to  this  the  position  of  the  previous 
supporting  leg  cannot  be  controlled,  but  it  results  from 
the  overall  moment  of  momentum. 

Based  on  these  assumptions  the  control  scheme  of 
the  ballistic  phase  is  implemented  analogous  to  that 
of  the  support  phase  using  the  method  of  feedback 
linearization.  Only  the  transformation  from  control 
variables  to  generalized  coordinates  has  to  be  adapted. 

5.  Simulation  Results 

The  presented  control  algorithm  is  evaluated  in  a 
multibody  simulation  program.  The  corresponding  mo¬ 
del  of  the  plant  has  the  same  structure  as  the  model  for 
the  controller.  In  order  to  obtain  realistic  simulation 
results,  the  contact  situation  between  foot  and  ground 
is  modelled  in  detail.  The  stick-slip  effects  are  modelled 
with  Coulomb  friction. 

Here  typical  results  are  given  for  a  gait  at  a  speed 
of  ly-  In  order  to  demonstrate  the  dynamic  stability, 
the  robot  is  in  a  “jogging”  motion  including  phases 


without  contact  to  the  ground.  The  time  for  one  step 
is  about  0.65  with  a  ballistic  phase  of  0.15.  Figure  6 
shows  the  sagittal  position  and  velocity  of  the  center 
of  gravity  and  the  corresponding  reference  for  two 
steps.  The  reference  is  not  defined  for  the  ballistic 
phases  and  is  therefore  set  to  zero  for  these  periods. 
Due  to  a  simplified  model  used  for  the  calculation 
of  the  reference  trajectory,  the  actual  velocity  differs 
from  the  reference.  However  the  system  is  controllable 
so  that  the  reference  velocity  can  be  reached  at  the 
end  of  the  steps.  This  leads  to  an  approximately  linear 
increase  of  the  position  in  the  sagittal  plane.  The 
vertical  position  and  velocity  are  shown  in  figure  5. 
Since  the  model  is  known  exactly  in  the  simulation, 
the  resulting  velocity  is  very  close  to  the  reference 
velocity. 

The  corresponding  torques  of  the  driven  joints  are 
shown  in  figure  7  for  the  knee  and  hip  joint.  From 
about  O.O5  to  0.75  the  foot  is  not  in  contact  to  the 
ground.  Therefore  the  torques  are  relatively  low. 
When  the  foot  touches  the  ground  at  t  =  0.7s  the 
torque  of  the  knee  joint  reaches  its  maximum  and 
stays  on  a  high  level  during  the  support  phase.  The 
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Figure  7:  Driving  torques 


design  of  the  machine  and  the  choice  of  the  motors 
complies  with  these  requirements.  According  to  the 
simulation  results,  the  robot  is  dynamically  stable  and 
can  handle  disturbances  effectively.  When  the  system 
is  excited  by  external  forces,  the  stance  points  are 
adapted  so  that  the  robot  remains  stable. 

6.  Summary 

The  design  and  control  of  a  humanoid  walking  machi¬ 
ne  are  presented.  Using  the  method  of  feedback  linea¬ 
rization,  dynamically  stable  motion  can  be  achieved. 
The  trajectories  of  the  system  are  calculated  at  each 
step  numerically.  The  computational  effort  is  reduced 
by  using  a  simplified  model  of  the  plant.  This  way  the 
control  problem  can  be  solved  in  real  time  and  is  sui¬ 
table  for  implementation  on  a  real  robot.  In  the  near 
future  friction  effects  and  sensor  transfer  functions  will 
be  considered  in  the  control  loop  so  that  model  uncer¬ 
tainties  can  be  reduced.  With  the  completion  of  the 
real  robot,  the  control  strategy  will  be  verified  practi¬ 
cally. 
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Abstract : 

In  this  paper  the  problem  of  motion  control  of  biped  is 
considered.  We  develop  a  new  method  based  on  multi¬ 
agent  strategy.  This  method  deals  with  organization  and 
coordination  aspects  in  an  intelligent  modeling  of  human 
motion.  We  propose  a  cooperative  multi-agent  model, 
based  on  this  model.  We  develop  a  control  kernel  named 
IMCOK  ( Intelligent  Motion  COntrol  Kernel  ),  which 
consists  of  a  controllor,  cordinnator  and  executor  of 
different  cycles  of  motion  of  biped.  When  walking, 
IMCOK  receives  messages  and  sends  offers .  The 
articulator  agents  partially  planify  the  motion  of  non¬ 
articulator  agent  associated.  The  system  is  hybrid  and 
distributed  functionally,  that  is  to  say  that  it  is 
constituted  of  a  hybrid  agent  (cognitive  and  reactive) 
which  cooperate  to  accomplish  functions  of  control  and 
command  of  biped.  Cognitive  agents  commuincate  with 
reactive  agents  (non  articulator)  in  order  to  generate  the 
motion. 

1.  Introduction 

The  study  of  the  motion  of  walker  robots,  especially  the 
locomotion  of  bipeds  has  been  studied  by  many 
researchers.  The  biped  can  navigate  on  surfaces,  so  their 
applications  will  be  increasing,  but  the  control  and  the 
coordination  of  their  movements  remain  a  problem  that 
must  be  adapted  according  to  methods  proposed  in  the 
literature[10;],  such  as  Autonomous  locomotionfl;], 
optimization  based  methods  of  energy  and  time-energy 
Adaptive  control  algorithms  [2]  [3] [4],  methods  based 
phase  of  movement[7].  Recently,  many  researches 
investigated  biped  robots  and  walking  on  surface,  as 
robot  that  can  climb  a  sloping  surface[16].  Presented 
hereafter  is  using  the  distributed  artificial  intelligence 
(DAI).  So  researchers  in  DAI  are  concerned  with 
understanding  and  modeling  action  and  knowledge  in 
collaborative  enterprises  [17].  Hence,  we  consider 
principles  and  foundments  of  Distributed  Artificial 
Intelligence  (D.A.I)  in  order  to  build  cooperation  and 
communication  system  between  agents. 

Multi-Agent  Systems  (MAS)  offer  a  way  to  relax 
the  constraints  of  centralized,  planned,  sequential 
control,  they  offer  production  systems  that  are 
decentralized  rather  than  centralized^  emergent  rather 
than  planned,  and  concurrent  rather  than  sequential  [18] 
concerned  with  the  behavior  of  a  collection  of 


autonomous  agents  aiming  at  solving  a  given 
problem.  So,  these  intelligent  entities  (agents)  cooperate 
in  order  to  solve  a  problem  [6,7],  The  autonomous  agent 
approach  replaces  a  centralized  data  base  and  control 
computer  with  a  network  of  agents,  each  endowed  with  a 
local  view  of  its  environment  and  authority  to  respond 
locally  to  that  environment  [8].  Several  taxonomies  of 
multi-agent  systems  have  been  published  In  different 
areas  :  production  and  design  [11,12;]  Robotics[9],  urban 
traffic  [13,14],  ethology  and  biology  [15;],  .  So,  multi¬ 
agent  proposes  an  intelligent  method  for  control  because 
of  its  powerful  theories[7].  This  part  is  an  extension  of 
article  [10].  The  model  is  called  Multi-agent  Biped 
System  (MABS)  and  the  control  kernel  is  called 
Intelligent  Motion  Control  Kernel  (IMCOK).  Actually, 
the  coordination  between  the  biped  members  makes  its 
movement.  The  present  scheme  is  a  representation  of  a 
biped . 


Figure  1  :  Schematic  representation  of  biped 
2.  System  description 
2.1  Definition 
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Figure  2  :  knowledge  of  MABS 
The  MABS  system  is  a  multi-agent  system  within  a 
basis  of  actors  conceived  to  assure  the  control  over  the 
activity  of  movement  of  the  biped.  The  system  is  hybrid 
and  distributed  in  terms  of  function  and  scheduling  ,i.e. 
it  constitutes  of  a  set  of  hybrid  agents  (cognitive  and 
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reactive)  that  cooperate  to  accomplish  functions  of 
control  of  biped.  Agents  use  a  dynamic  organization 
strategy  based  on  the  use  of  the  negotiative  technique 
based  on  communication  protocol  inter-agent  .  This 
technique  allows  messages  exchanging  between  agents. 

2.2  MASS  Model 

The  main  features  of  this  model  is  to  permit  a 
scheduling  distributed  of  several  agents,  and 
facilitates  the  communication  with  the  user  via  the 
interfacing  of  communication.  Mainly,  it  gives  a  new 
strategy  of  organization:  the  parallel  communication 
between  agents,  the  execution  of  several  operations 
(intra-agents  or  inter-agents)  and  the  flexibility  and 
extensibility.  It  allows  a  distributed  scheduling  too.  It 
also  admits  new  articulator  agents. 


2.3  Operation  levels 


Figure  3  :  MABS  levels 

The  scheme  below  shows  the  different  levels  of  our 
architecture  ,  the  user  can  communicate  with  MABS 
via  interfacing  of  communication  witch  is  the  means  of 
communication  necessary,  for  outside  knowledge 
receipt,  to  the  working  of  the  system. 

2.3  Procedure  of  function  : 

It  is  based  on  the  following  algorithm,  followed  by 
every  agent: 

Message  receiving. 

Scheduling. 

Sending  results. 

3  Architecture  of  MABS  system 

MABS  constitutes  of  (figure  4): 

-  Kernel  (IMCOK)  "  Intelligent  Multi-agent  COntrol 
Kemer  that  is  composed  of:  supervisor  agent,  percr 
agent,  and  equilibrator  agent  and  regulator  agent. 

-  Interfacing  of  communication 

-  Articulator  agents 

-  Non-articulator  agents 


3.1  Articulator  agents 

These  agents  are  homogeneous  in  their  structures, 
but  differentiate  by  their  associated  functions 


Figure  4:  MABS  and  IMKOC 

Architecture 

Movement.  Our  articulator  agent  has  three  processes: 
Process  of  communication,  process  of  decision  and 
Process  of  scheduling. 

-  Process  of  communication:  It  receives  the  message 
coming  from  the  supervisor  or  other  components  by  the 
Process  of  Input  Communication  (PIC),  it  shapes  it  into 
Internal  Message  File  (IMF),  and  transmits  the  results 
of  scheduling  by  Process  of  Output  Communication 
(POC). 

-  Process  of  decision:  at  the  receipt  of  a  message 
from  the  regulator  agent,  the  articulat-or  agent  decides 
after  consulting  the  scheduling  process,  whether  it 
sends  the  control  to  the  non-articulator  agent  or  no. 

This  process  also  decides  on  sending  the  answer 
(success  or  failure)  to  the  supervisor,  percr  and 
equilibrator. 

’*  Process  of  scheduling:  this  process  is  constituted  of 
library  of  plans  and  memory  of  storage:  the  necessary 
data  allowing  the  process  of  decision  either  to  send  a 
signal  of  activation  or  not  to  the  non-articulator  agent, 
are  contained  in  the  library  of  plans  (like  max  and  min 
limit  for  rotation  of  the  non-articulator  agent  partner). 
The  ability  of  articulator  agents  resides  in  their  plans. 
In  case  of  fire,  the  signal  of  activation  is  going  to 
automatically  reactivate  in  the  memory  of  storage, 
what  permits  to  optimize  the  time  of  scheduling. 

Noting  that  the  three  processes  communication¬ 
scheduling-decision  function  in  parallel. 
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3.2  Non-articulator  agents 

It  doesn’t  arrange  a  process  of  decision.  Its  primary 
function  is  clearly  the  execution  of  the  movement. 
So,  it  represents  a  reactive  agent.  It  constitutes  of 
Process  of  communication :  it  receives  the  activation 
signal  of  articulator  agent  via  PIC,  and  transmits  the 
result  of  movement  to  the  main  articulator  agent  via 
PIC. 

•  Process  of  scheduling:  The  necessaiy  data 
allowing  the  non-articulator  agent  to  plan  its 
trajectory  of  reference  and  then  to  follows  it,  are 
contained  in  the  library  of  plans.  After  having 
executed  its  function,  the  non-articulator  agent 
sends  a  signal  of  result  to  the  articulator  agent 
partner,  which  transmits  it  to  the  percr  and 
supervisor. 

3.3  Regulator  agent 

It  constitutes  of:  process  of  communication, 
process  of  regulation,  motor  process,  process  of 
scheduling. 

* Process  of  Regulation :  its  function  is  the 
regulation  of  speed  of  the  members  given  by  the 
supervisor.  The  compliant  speeds  to  the  movement 
are  initialized  by  the  programmer  under  a  shape  of 
a  speed  interval  in  order  to  adjust  the  speeds  of  non¬ 
articulators  agents  coming  from  the  supervisor. 

Motor  process:  its  unique  function  is  the 
generation  of  a  signal  (message)  called  message 
motor,  as  well  as  the  speed  of  the  movement 
associated  to  these  members. 

•  Process  scheduling:  this  process  contains  the 
necessary  information  to  the  treatment. 

3.4  PERCR  agent 

.  After  having  received  signals  of  the  environment 
(reversed  picture),  the  cerebellum  of  our  system  is 
going  to  construct  the  real  picture  of  the 
environment,  it  is  the  perception  of  the 
environment.  Then  the  resulting  signal  is  going  to 
be  transmitted  to  the  supervisor’s. 

PERCR  constitutes  of:  Process  of  communication, 
process  of  scheduling,  Process  of  percept-ion  and 
Process  of  decision. 

Peer  receives  two  types  of  signals,  one  from 
sensors,  the  other  from  the  agent’s. 

•  Process  of  perception  :  this  process  ensures  the 
perception  of  the  environment  signals. 

This  process  arranges  a  SYstem  of  Picture 
Treatment  (SYPT)  permitting  a  real  reconstruction 
of  the  received  virtual  picture  by  sensors. 

•  Process  of  scheduling:  it  contains  a  library  of 
plans  :  One  distinguishes  two  essential  data  parts: 
data  specified  for  the  perception,  which  ensures  the 
management  of  the  system  of  picture  treatment 
(SYPT).  And  data  specified  for  the  control,  which 
ensures  the  management  of  the  control  system  by 


sending  data  concerning  states  of  the  different 
members  in  progress  of  each  cycle  of  movement 
(single  support,  and  double  support). 

•  Process  of  control  (SYMOC)  :  SYstem  of 
Movement  Control  :  agent  transmits  him 
information  on  states  of  agents  to  control  ...  at  all 
instants,  SYMOC  has  the  monitoring  of  messages 
coming  from  the  articulator  agents,  then  treats  them 
transmits  a  result  signal  to  the  supervisor  (to 
inform  the  user  of  the  state  of  march).. 

3.5  Supervisor  agent 

The  supervisor  receives  the  already  quoted  signals 
from  agents ,  treat  them  and  generate  new  signals  of 
command  or  answer  (structures  of  solution  to 
problems  met  by  the  other  agents  of  MABS).  The 
supervisor  communicates  with  the  environment  or 
agents  via  the  process  of  communication.  One 
distinguishes  two  types  of  communications,  at  the 
level  of  the  supervisor  : 

communication  intra-supervisor  (between  its 
different  constituents)  effected  via  the  Internal 
Messages  File  (IMF),  and  the  inter-supervisor 
communication  (the  exchange  of  messages 
between  the  supervisor  and  the  other  components  of 
the  system)  effected  via  DISI  DIspatcheur  of 
Signals,  and  PIC  process  of  exit  communication. 
Signals  coming  from  the  different  components  are 
identified  and  recognized  by  RISI.  All  the 
components  of  MABS  system  are  managed  by  their 
respective  managers. 

MC:  manager  of  communication 

ME:  manager  of  equilibrator 

MR:  regulating  manager 

MO:  manager  of  control 

MA:  manager  of  articulator  agent 

MN:  manager  of  non-articulator  agent 
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We  explain  little  process  :  The  library  of  plans  contains 
as  the  command  or  the  necessary  tools  for  the 
management  of  the  components  of  the  MABS  system. 

3.5.1  Basis  of  knowledge 

It  is  the  data  basis  of  MABS.  Containing  all 
information  on  the  different  components  of  the  system 
(IMCOK  ,  articulator  and  non-articulators  agents),  and 
of  each  cy  cle  of  movement  (4  cycles,  two  for  double 
and  two  for  single  support ). 

3.5.2  Monitor  of  Series  (MS) 

Its  main  role  is  the  coordination  between  the 
supervisor  actions.  This  process  is  still  in  contact  with 
the  executor  (while  providing  him  with  actions  to 
execute)  after  consultation  of  knowledge  basis  and 
process  of  scheduling.  One  can  associate  different 
strategies  to  plan  activities  of  this  monitor: 

--  With  static  priority:  once  a  priority  is  affected  into  a 
program,  this  one  won't  change  until  the  end  of 
execution  of  this  program. 

“  With  dynamic  priority:  the  priority  affected  into 
program  changes  according  to  the  environment  of 
execution  of  this  one  (load  of  system,  types  of  actions 
to  be  executed...). 

3.5.3  Executor 

it  is  the  loaded  process  of  instructions  provided  by  the 
monitor  of  series.  The  strategy  of  scheduling  data  to 
IMCOK  is  said  to  be  not  interrupted  [if  once  the 
executor  is  allocated  to  a  program,  this  one  will  keep  it 
until  the  end  of  its  execution  or  that  it  is  blocked  on  a 
resource  not  yet  ready].  The  allowance  of  executor  to 
the  program  can  make  it  according  to  the  affected 
priorities  to  program  them  automatically  (for  example 
one  is  going  to  give  to  every  signal  coming  from  an 
organ  a  priority,  with  of  course  their  organization),  or 
by  an  external  manner  by  the  person  responsible  of 
exploitation,  the  priority  can  be  allocated  statically  or 
dynamically. 

After  execution,  the  result  will  be  transmitted  to  the 
dispatcher  of  signals  in  order  to  transmit  it  to  the 
concerned  agent. 

3.5.4  Receiver  and  Identifier  of  the  Signals  (RISI)  : 

Contrarily  to  the  other  agents  of  MABS,  the  supervisor 
has  a  superior  workload.  That  requires  a  loaded 
element  of  identification  and  to  send  received  messages 
from  the  outside  into  process  or  the  corresponding 
manager  in  the  supervisor.  This  element  is  called  RISI. 

3.5.5  Dispatcher  of  Signals  (  BIST) 


Play  an  important  role  in  the  diffusion  of  signals  (either 
signals  that  have  just  been  executed,  or  those  that  has 
just  been  managed  without  being  executed). 

3.5.Managers 

ensures  the  management  of  agents  of  the  MABS 
system.  The  manager  sends  a  message  of  working  to 
the  agent  managing.  It  arrives  that  during  its  work,  the 
agent  undergoes  a  problem  of  state,  in  this  case  it 
informs  the  corresponding  manager  in  order  to  solve 
the  problem. 

3.5.6  Equilibrator 

This  process  ensures  the  balance  of  the  biped  during 
the  march  or  during  the  station.  The  physical 
conditions,  so  that  the  robot  is  in  state  of  balance,  are 
that  the  center  of  gravity  of  the  body  is  on  the  surface 
of  lift  (surface  taken  between  elements  in  contact  with 
the  ground).  It  consists  of  a  process  of  communication, 
process  of  scheduling  and  process  of  equilibration. 

The  equilibrator  receives  the  answer  from  articulator 
agents  [this  answer  is  the  result  of  the  articulator  agent 
decision  on  information  of  command  (for  example  the 
degree  of  rotation,  the  degree  of  bending  the  degree  of 
extension...)  constituting  the  signal  of  command  that  it 
receives  from  the  supervisor]  via  the  process  of 
communication,  after  a  treatment,  the  process  of 
equilibration  gives  the  state  of  balance  of  the  biped, 
and  transmits  them  to  the  supervisor.  The  plans 
contain  all  data  allowing  the  equilibrator  to  ensure  its 
function,  among  these  data. 
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The  scheme  bellow  shows  the  relationships  that 
exist  between  different  agents  already  quoted . 

4.  Management  of  the  communication: 

The  MABS  system,  according  to  its  architecture 
and  its  function,  it  appears  like  a  very  necessary 
system  in  matter  of  communication  needs.  Indeed, 
MABS  agents  communicate  between  them  by 
sending  messages. 

Every  signal  (message)  represents  the  information 
needed  by  the  agent  (a  control  to  be  achieved)  and 
as  the  result  of  treatment  of  the  previous  agent. 

5*  Simulation : 

Inter-agents  message  exchanging  protocol  via 
DDE(Dynamic  Data  Exchange)  through 
conversation  with  the  server,  there  by  he  transmits 
(transactions  )  to  the  server,  the  latter  answers  by 
providing  data  or  services.  We  choose  then  , 
asynchronous  transactions.  The  general  form  of 
algorithm  is : 

Client  agent 

{ 

xtype_connect; 
put  message  in  case 
transaction  of  emission 

read  answer 
send  xtyp_disconnect 

} 

server  agent 

{ 

xtyp_connect_confirm 

xtyp_poke 

read  data  from  case 
send  answers 

} 
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Abstract 

We  introduce  here  our  works  in  development  of  a 
snake-like  redundant  micro-robot  based  on  twenty 
identically  segments  structure  for  pipe 
investigations.  Each  segment  is  a  parallel 
mechanism  (3  DOF)  with  3  DC  motors  associated 
with  its  sensors  and  controlled  by  a  16-bit  micro¬ 
controller.  The  distributed  control  system  is 
supervised  by  a  real-time  kernel  on  the  host 
computer  communicating  via  a  Can  network. 

1.  INTRODUCTION 

The  paper  presents  the  development  of  a  mobile 
robot,  acting  like  an  earth-worm,  if  both  ends  are  free, 
or  like  an  elephant  trunk,  if  one  end  is  fixed,  within  the 
European  COPERNICUS  project  941306-FMAM.  The 
work  was  performed  by  international  collaboration 
between  University  of  Science  Metz  (France), 
“POLITEHNICA”  University  of  Bucharest  (Romania) 
and  Warsaw  University  of  Technology  (Poland). 


Fig.  1  Robot  mock-up 


Current  evolution  of  micro  and  mini-robotics  is 
directed  to  continuous  miniaturization,  to  increasing  of 
autonomy  and  flexibility!  for  accomplishing  complex 
tasks  in  narrow  or  hardly  accessible  spaces.  Today, 
industry  is  faced  with  the  need  to  increase  reliability 
and  reduce  the  maintenance  costs  for  the  complex 
mechanical  systems  and  equipment  (power  plants, 
airplane  engines,  etc.). 

This  means  a  tremendous  requirement  for 
technology,  which  makes  possible  to  perform 
inspections  and  repairs  in  tight  spaces,  without  having 
to  dismantle  the  plant  equipment.  If  the  newly 
developed  robots  will  be  able  to  remain  inside  various 
installations,  they  could  monitor  and  repair  it  and 
maintenance  work  will  be  performed  without  shutting 
down  or  dismantling  complex  machinery.  For  these 
purposes,  the  solution  is  to  transpose  the  motion 
principles,  which  are  specific  to  living  world,  into 
technical  achievements.  The  first  investigations 
concerning  development  of  mobile  robots,  with  a  snake¬ 


like  locomotion,  were  made  in  Japan,  by  Shigeo  Hirose, 
in  the  early  ‘70-s  but  there  are  reported  only  a  few 
achievements  in  this  field  (Japan,  USA,  Germany). 
Then,  waving  still  remains  an  interesting  way  of 
locomotion  for  grounds  with  variable  geometiy  or 
difficult  to  travel  through,  such  as  pipes,  while  it  is  also 
an  actual  subject  for  research  and  development. 

2.  TYPES  OF  MOTION 

The  first  stage  consist  in  define  future  modes  of 
motion  of  the  robot  in  autonomous  working,  the 
tentacular  working  leading  to  other  type  of  problem. 

Grass-snake  type  displacement  undulation  in  a 
horizontal  plan. 

Advantages :  no  problem  of  stability,  reduced 
consumption  of  energy. 

Disadvantages :  not  negligible  rubbing. 

Adder  type  displacement  undulation  in  a  vertical  plan. 
Advantages', ;  medium  energy  consumption,  reduced 
rubbing. 

Disadvantages :  problem  of  stability. 

Cobra  type  displacement  undulation  and  extension  in  a 
horizontal  plan. 

Advantages :  no  problem  of  stability,  speed  raised. 
Disadvantages :  important  energy  consumption, 

important  rubbing. 

Earthworm  type  displacement :  extension  in  a 
horizontal  plan. 

Advantages:  no  problem  of  stability,  reduced  energy 
consumption,  negligible  rubbing,  medium  speed. 
Disadvantages',  necessary  module  elongation. 

Some  investigations  have  sown  that  the  most 
interesting  solution  was  a  multiple-module  robot,  able 
to  move  like  an  earthworm.  On  file  basis  of  simulations 
undertaken  on  one  model  in  two  dimensions,  we  have 
developed  this  principle  of  locomotion. 

This  requirement  had  led  to  providing  the  module 
with  3  degrees  of  freedom:  2  rotations  (0,v|/)  and  1 
translation  (p).  The  rotations  was  necessary  for 
generating  the  snake-like  or  earth-worm  like  motion 
with  bending  waves,  while  the  translation  was  useful 
for  generating  the  earth-worm  motion  with  elongation 
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waves,  which  enables  the  robot  to  move  into  narrow 
spaces  or  to  increase  its  velocity  on  straight  trajectory. 

In  reality,  this  principle  obliged  to  realize  a  more 
complex  mechanics,  but  it  would  be  able  to  support  to 
support  all  types  of  snake-like  locomotion  presented 
previously.  We  left  therefore  the  open  way  to  other 
types  of  utilization,  particularly  in  tentacular  or  trunk 
mode. 

3.  MECHANICAL  STRUCTURE 

The  design  of  the  module  had  several  possibilities  to 
be  carried  out,  but  the  adopted  solution  was  to  link  2 
platforms  with  a  special  3  DOF  joint  (Cardan  + 
translation)  and  3  linear  actuators,  ended  with  spherical 
joints.  As  linear  actuators,  screw-drives  and 
micromotors  with  gearheads  and  encoders  were  used. 

This  articulation  (a  scale  2  model  has  been  realized) 
eliminates  mechanical  problems  like  radial  parasitic 
effort  and  axial  torque.  The  precision  is  good  since  one 
uses  kneecaps,  as  well  as  the  robustness.  The  overall 
dimensions  of  each  module  are  diameter-  30  mm, 
height-  50  mm.  The  robot  body  consists  of  20  similar 
modules  that  mean  60  DOF  to  be  controlled.  But  they 
are  not  yet  instrumented. 


Amplitude  of  angles  0  and  \\i  are  limited  to  20 
degrees  and  p  to  1/3  of  total  length  of  the  module.  The 
reverse  mathematical  model  is  simple,  which  reduce 
times  of  calculation.  However,  the  direct  mathematical 
model  has  no  analytic  solution,  since  it  is  a  parallel 
architecture. 


A  significant  amount  of  our  research  was  oriented 
on  the  module  comportment  simulation.  The 
geometrical  models  are  now  operationally  with  respect 
of  the  mathematical  problems  of  resolution.  The 
calculation  model  of  torque  and  effort  components  is 
also  prepared  (see  Reflex  level)  and  we  are  actually 
working  on  the  algorithm  of  pipe  forward  mode.  But 
the  problems  concerning  the  effort  threshold  sampling 
could  be  resolved  only  in  the  phase  of  identification  on 
the  complete  prototype. 

Direct  Model:  p,0,v[/  =  F(dl,d2,d3) 

dl  =  \f(-2  R2to$0-lR./>.sin0*2R2  +  />2) 

d2  =  a/ <V5  R./>cosfl  sinp  ♦  R2  sinff  sin£-  1  R2  cosfl  +  R.p  sinfi-  |  R2  cosp  ♦2.R*+/>2) 

ci3=  a/ ( - V* R./>. cosfi. siny  •  ^  ■  r2  sin®- siny ■  7 r2  cos ^ +  RA sinfl  ■  | . R2  cosf  ♦  2.R2 +/>2 ) 

Reverse  Model  (linearized)  :  dl,d2,d3  =  F-l(p,0,i[/) 

p  —  -  1/3  dl2  -»-3  d22+3.d32 

e  _  _ 2.  dl2  -  d22  -  d32 

2.R  V  3.  dl2  3  d22  +3.d32 

ip  =  -  2.  a/3  (  d22  -  d32  )  3  dl2  +  3  d22  -+-  3.d32~ 

3.R  (2  dl2  +5  d22  +5.  d32  ) 

Another  solution  for  direct  model  is  to  consider  that 
h  is  approximately  equal  to  dl.  Then  we  can  iterate. 

4.  CONTROL  ARCHITECTURE 

The  developed  architecture  of  the  control  system  of 
the  robot  is  hierarchically  distributed  on  two  main 
operational  levels.  At  the  higher  level,  a  real  time 
nucleus,  implemented  on  a  computer  plays  the  role  of 
the  supervisor  and  deals  with  the  trajectory  planning,  as 
well  with  the  user  interface.  At  the  lower  level,  a  local 
control  system,  distributed  between  the  robot  modules, 
allows  the  control  of  each  module,  in  order  to  reach  the 
imposed  trajectoiy.  This  system  provides  some  degree 
of  autonomy  for  each  module  and  also  presents  the 
advantages  of  modular  construction,  consisting  of 
identical  control  closed  loops  for  each  module, 
possibility  of  connecting  and  disconnecting  the  modules 
and  putting  out  of  operation  the  defective  ones,  without 
stopping  the  robot  action. 


Utilization 

Application 

Level 

-Task  planning 

-  Piloting 

-  Decision  taking 

Task 

1 

Central 

Calculator 

J 

1 

Equipment 

Cooperation 

Level 

*  Environmenl  interaction 

•  Dialog  management 

-  Peripheral  devices  management 

f? 

£ 

% 

Robot 

Effectors 

Coordination 

Level 

•  Coordination  and  modules  control  by  function 

•  Gripper  management 

Actuators 

Action 

Level 

-  Combinations  of  elementary  actions  by  operation 

-  Informations  processing 

-  Corrections  and  adaptation 

Module 

Network 

J 

*1 

Micro- 

Controler 

J 

Reflex 

Level 

-  Asservissements  of  actuators 

-  Taking  of  raw  Informations 

Hardware 

Table  1:  Control  system  architecture 

The  communication  between  the  two  hierarchical 
levels  is  accomplished  by  means  of  a  field-bus  of  CAN 
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type.  The  choice  of  this  one  was  recommended  by  its 
guaranteed  performances  of  the  communication 
protocol,  especially  for  real-time  applications.  The  two 
levels  of  the  control  system  are  organized  in  several 
layers,  as  specified  in  the  next  table.  For  a  better 
visualization,  we  present  the  functional  analysis 
structure  used  and  the  layers  covered  by  the  Copernicus 
project,  which  we  have  developed. 

Application  level 

This  level  is  that  of  the  tasks  planning.  It  is  here  that 
been  defined  objectives  to  reach,  modes  of  snake-like 
locomotion,  functionment  modes,  etc. 

This  level  is  not  included  in  our  project, 
nevertheless,  a  minimal  application  will  be  developed  to 
govern  the  robot,  with  a  joystick  or  a  mouse,  by  an 
operator  ahead  the  screen  connected  to  the  camera  of 
the  robot. 

Coordination  level 

This  layer  is  an  interface  between  the  robot  and  its 
utilization.  It  allows  an  application  to  access  to  all 
functions  of  the  robot  via  requests  to  the  exploitation 
system. 

It  manages  equally  all  computer  peripheral  devices 
or  other  (lectors  of  discs,  camera,  keyboard,  screen, 
printer,  inputs-outputs,  etc.),  as  well  as  man-machine 
and  machine-machine  interfaces. 

Cooperation  level 

At  this  level  are  realized  commands  to  the  effectors, 
such  as  gripper  and  the  whole  body  itself. 

This  level  is  implanted  in  the  central  computer  and 
dialogue  with  the  inferior  layer  by  a  network  that  it 
manages.  It  is  therefore  comparable  to  the  spinal  cord 
that  transmits  orders  and  executes  automatic  actions, 
here,  crawling  along  of  a  path. 


The  principle  of  interpolation  currently  used  is 
developed  here:  (note  that  all  the  figures  are  in  2D  for 
more  clearness,  but  calculations  are  in  3D). 


Fig.  4  :  Interpolation  principles 

to  each  segment  of  the  robot  is  associated  two 
frames,  one  on  the  below  plate  Rbi,  and  one  on  the 
above  plate  Rai,  expressed  in  an  absolute  frame 
associated  to  the  head  above  frame  Rh.  This  frame  is 
then  floating.  Note  that  the  below  frame  Rbi  of  one 
module  is  the  same  as  the  above  plate  Rai+1  of  the 
backward  module.  All  frame  are  expressed  in  a 
homogeneous  matrix  form,  i.e.  position  +  orientation. 

The  trajectory  to  follow  is  given,  by  the  superior 
level,  in  terms  of  position  Rg  to  reach  by  the  frame  Rh 
associated  to  the  head  and  expressed  in  herself.  When 
an  order  of  displacement  is  received,  the  frame  of  the 


head  and  the  goal  frame  allow  interpolating  a  curve 
tangent  to  the  Z-axis  of  the  two  frames  and  passing  by 
their  origins. 

We  actually  use  two  3 -degree  equations  for  this 
curve.  Since  the  curves  are  expressed  in  head  frame,  we 
assume  that  Z  can  only  increase.  If  the  vector  Zg  is 
oriented  in  backward  direction,  another  (intermediaiy) 
frame  Rg  must  be  defined  before  by  the  superior  level. 

The  head  has  to  follow  this  curve  with  a  relative 
precision  of  0,1mm  and  to  remain  constantly  tangent  to 
the  path,  but  the  precision  have  no  real  signification 
when  the  robot  move. 

Next  elements  have  to  follow  "on  the  best"  this 
curve,  without  obligation  of  tangency.  So  as  to  not  to 
overload  calculations,  positions  of  next  elements  are 
calculated  by  linear  interpolation  between  points  of 
passage  of  the  head.  However,  this  method  induces  to 
memorize  a  considerable  number  of  points.  We  prefer 
therefore  to  memorize  only  20  points  in  a  circular  table 
(one  point  by  segment). 

A  point  is  added  to  the  table  each  time  that  the 
second  segment  reach  or  exceeds  its  aimed  point.  The 
added  point  corresponds  to  the  theoretical  position  of 
the  head  at  this  moment.  A  point  is  suppressed  each 
time  the  last  segment  reach  or  exceeds  its  objective. 
Each  segment  that  reach  or  exceeds  its  objective  aims 
then  the  following  point  in  the  table.  The  table  is 
initialized  with  original  positions  of  20  segments  at 
power-on. 

The  amplitude  of  the  movement  is  realized  by  a 
unique  wave  going  through  the  head  to  the  tail  of  the 
robot.  With  half-sinusoid  sampled  in  n  intervals,  we 
obtain  a  flowed  movement.  To  each  temporal  sample, 
the  amplitude  of  an  articulation  pij  is  equal  to 
A.SIN(j.7i/n)  +  pmin,  where  pmin  is  the  minimal  length 
of  one  segment.  The  speed  of  the  movement  is  a 
function  of  n  and  A  is  the  maximal  segment  elongation. 
Only  five  modules  are  moving  at  the  same  time  to  the 
maximum,  so  n  is  a  multiple  of  five. 


For  positioning  the  head,  the  amplitude  obtained  for 
the  first  articulation  gives  p.  The  intersection  of  a 
sphere  of  radius  p  centered  in  Rbi,  with  the  two 
equations  of  the  trajectory  path  gives  the  position  Rnl 
to  reach  for  the  next  sample  by  the  head  above  frame 
Rh  (calculated  by  successive  approximations).  Note  that 
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Rbl  is  not  the  actual  Rbl  but  the  future  position  of  Rbl 
for  the  next  sample.  Then  the  tangent  to  the  curve  at  this 
point  give  the  orientation  of  this  frame  in  terms  of  0  and 
V- 

For  positioning  the  backward  elements,  and  only 
while  the  head  module  move,  the  amplitude  of  each 
articulation  gives  also  pi.  The  intersection  of  a  sphere  of 
radius  pi  centered  in  Rbi,  with  the  straight  line  to  the 
aimed  point  Pj  of  segment  i  gives  the  position  to  reach 
for  the  next  sample  by  the  frame  Rai  (calculated 
analytically).  In  the  example  below,  the  above  frame 
plate  Rai  of  the  module  i  is  initially  in  Oi.  It  must  go  to 
Ai.  Another  methods  consist  to  not  aim  points  Pj 
themselves,  but  segments  of  straight  line  between 
points,  for  a  best  precision.  The  solution  is  a  lightly 
more  complex. 


Fig.  6:  Module  trajectory  positioning 


The  first  calculated  point  is  that  corresponding  to  the 
most  behind  module,  then  we  goes  up  towards  the  head. 
Each  Rai  obtained  is  used  to  calculate  the  following 
(Rai  =  Rbi-1).  We  finish  by  calculation  on  the  head 
module  developed  earlier.  Let  us  note  that  the  most 
behind  module  cannot  be  directed  towards  its  aimed 
point  since  0i  and  v[/i  are  on  Rai  and  not  Rbi. 

When  the  head  module  don’t  move,  the  calculations 
is  a  quite  more  complex  because  the  five  modules  must 
move  between  two  fixed  points.  The  forward  and  the 
backward  modules  must  not  move.  Since  the  distance 
between  these  two  points  is  constant,  it  is  impossible 
that  the  sum  (in  3D)  of  the  calculated  elongation  of  the 
five  modules  have  the  good  value.  The  solution  is  to 
modulate  the  value  of  the  maximal  segment  elongation 
A,  the  relations  between  the  lengths  of  the  five  modules 
remain  in  sinus  forms.  This  is  done  by  iterations. 

When  we  have  all  the  origins  of  the  frames,  we  can 
compute  their  orientations  since  their  Z  axis  must  point 
the  origin  of  the  preceding  segment  frame  and  their  Y 
axis  are  in  the  same  plan.  This  gives  the  0i  and  vj/i.  All 
the  pi,  0i  and  \|/i  calculated  are  then  sent  to 
corespondent  modules  by  the  net. 

Note  that  when  the  robot  goes  in  backward 
direction,  the  calculations  are  analogous  but  not  exactly 
the  same  because  of  the  rotations  □  and  □  are  on  the 
frame  Rbi  and  no  more  Rai. 

Action  level 

This  level  corresponds  for  us  to  a  complete  module 
of  the  robot.  Each  module  is  composed  of  a  segment 


with  its  three  actuators,  its  proprioceptive  position  and 
effort  sensors,  the  electronics  of  power  and  a  micro¬ 
controller  managing  the  module  in  his  totality  and 
conversing  with  the  superior  level  by  a  field-bus. 

A  module  possesses  certain  autonomy  and  a  "free 
arbiter".  It  receives  orders  of  displacement  and 
functionment  modes  and  returns  responses  in  terms  of 
state  and  positions  reached.  It  is  itself  that  applied  direct 
and  reverse  geometrical  models  and  adapts  order 
received  to  conditions  of  evolution  that  it  detects  by  its 
effort  sensors.  A  module  has  to  be  seen  functionally  like 
a  vertebra  that  executing  "as  well  as  possible"  the 
orders  received  from  the  superior  level.  It  can  refuse  to 
function  in  case  of  failure,  but  it  has  to  inform  the 
coordination  level  that  will  change  then  the 
functionment  mode. 

Reflex  level 

It  concerns  the  control  of  actuators  and  the  taking  of 
raw  information.  It  is  the  layer  that  pilots  the  equipment 
and  is  situated  in  micro-controllers. 

To  control  speed  of  each  motor  of  a  module,  we  use 
the  command  Pulse  Width  Modulation.  This  for  reasons 
of  cost  and  facility  of  future  integration  in  a  VLSI  chip. 
The  Siemens  micro-controller  used  is  already  provided 
with  the  necessary  circuits. 

Two  supplementary  information  are  necessary  for 
superior  levels.  It  concerns  the  horizontally  of  the  robot 
and  lateral  efforts  applied  by  each  segment  of  the  robot 
on  the  pipe. 

Let  us  specify  that  currently,  the  robot  is  not 
instrumented  with  these  sensors. 

Compliant  command 

The  compliant  command  is  necessary  in  three 
operating  modes: 

♦  In  pipe  forward  mode,  the  robot  must  be  able  to 
adapt  to  the  inequalities  of  the  ground  and  the  curves 
of  the  pipes.  For  doing  this  it  uses  its  own  effort 
traducers  to  calculate  the  applying  forces  on  the  pipe 
walls.  It  drives  its  actuators  in  the  sense  of  the  best 
rapprochement  of  the  objective  but  without  overtaking 
the  maximal  effort  threshold.  In  this  case,  the  aimed 
position,  given  by  the  higher  level,  is  theoretical  and 
cannot  always  be  reached. 

♦  The  pipe  forward  mode  also,  allows  solving  the 

problems  of  robot  sliding  by  jamming  in  the 
canalization.  In  this  case  the  effort  threshold  will  be 
significantly  greater  and  the  top  level  define  a 
sinusoid  trajectory  or  spiral  trajectory  with  the 
diameter  just  a  little  greater  than  the  one  of  the  pipe, 
making  the  robot  to  exercise  an  effort  on  the  pipe 
walls.  1 

♦  The  inert  mode.  This  mode  is  useful  for  the 
dysfunctional  movement  of  one  or  more  modules. 
When  a  module  detects  a  network  fault  (it  can't 
communicate)  or  when  it  receives  the  order  to 
disconnect  itself,  this  mode  allows  the  robot  driving 
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and  ignoring  that  module.  The  disconnected  module 
uses  its  effort  sensors  for  pilot  its  motors  such  a  way 
that  the  module  don't  oppose  any  resistance  (active 
reversibility).  The  module  becomes,  then,  inert. 

The  problem  is  that  we  don't  use  tactile  sensors 
because  technology  does  not  allow  a  rather  high 
resolution.  We  prefer  use  effort  sensors  instead,  so  we 
can’t  try  to  determine  where  are  the  contact  points 
between  the  robot  module  and  the  pipe. 

The  effort  sensors  must  be  established  on  the  axes  of 
the  three  push  rods  dl,  d2,  d3.  When  the  module  meets 
an  obstacle,  it  undergoes  an  effort  F  which  break  up 
into  a  Rz  force  along  Z  axis  and  two  torque  Cx  and  Cy 
around  X  and  Y  axis  (the  other  components  do  not 
interest  us).  This  effort  F  also  breaks  up  into  three 
constraints  FI,  F2,  F3  on  the  three  pushers. 

We  established  the  mathematical  model  allowing  to 
calculate  Rz,  Cx  and  Cy  according  to  FI,  F2  and  F3. 
But  this  model  is  complex  and  does  not  allow  making 
the  difference  between  the  constraints  imposed  by  the 
force  F  and  those  imposed  by  the  adjacent  modules.  But 
these  two  constraints  require  opposite  reactions  of  the 
motors.  A  simpler  (and  worse)  solution  would  consist  to 
impose  a  maximum  threshold  for  the  constraints  FI,  F2 
and  F3  and  to  integrate  it  in  the  control  loops  of  the 
motors. 

Horizontality  sensor 

A  non-negligible  problem  relates  to  the  need  to 
know  the  horizontality  of  the  robot,  because  the  robot 
can  roll  on  itself.  It  then  becomes  impossible  to  direct  it 
without  this  information.  If  we  know  the  inclination 
angle  of  the  robot,  we  can  evaluate  the  real  reference 
frame  of  the  head  and  calculate  a  pseudo  reference 
frame  Rh  so  that  its  axis  Xh  is  always  vertical. 

The  Polish  team  built  a  prototype  of  3D- 
honzontality  sensor.  This  prototype  is  still  too  large  to 
be  integrated  in  the  head  of  the  robot,  but  it  enables  us 
to  work.  Currently,  this  sensor  is  in  phase  of 
characterization,  because  the  vibrations  of  the  robot 
generate  parasitized  signals. 

5.  HARDWARE 

The  Reflex  and  Action  layers,  implemented  on  each 
module,  are  under  the  responsibility  of  local  16-bit 
micro-controller. 

Application,  Cooperation  and  Coordination  layers 
are  implemented  on  a  host  computer  (PC  Pentium). 

The  communication  between  the  host  computer  and 
the  twenty  micro-controllers  is  accomplished  by  a  CAN 
field  bus. 

Controller 

Each  module  is  equipped  with  a  micro-controller  of 
16  bits  (Siemens  C167CR).  This  one  administrates  the 
communication  within  CAN  network  and  local  control 
on  the  basis  of  the  sampled  trajectoiy,  by  the  real-time 


kernel.  Micro-controller  also  implements  the 
geometrical  positioning  algorithm  and  control 
functions.  In  the  same  time,  each  module  informs  the 
central  computer  about  its  own  evolution. 


Real-time  kernel-Multi-tasking 

Specific  to  embedded  applications,  a  real-time 
kernel  is  necessary  to  allow  the  administration  of 
distributed  applications  with  a  short  switching  time  of 
different  tasks. 

The  kernel  divides  the  work  in  several  tasks, 
grouped  according  to  priorities  system.  Some  of  the 
most  important  tasks  are  robot  trajectory,  interpolation 
and  communication  with  the  modules.  Other  tasks  like 
monitoring  of  the  robot  evolution,  corrections  and 
adaptation,  prove  the  user  interfaces  are  have  different 
priority  levels. 

Concerning  the  robot  motion,  the  kernel  surveys  the 
five  parallel  moving  modules.  For  each  n/5  sampling 
period,  this  window  of  five  modules  is  cyclically  shifted 
with  one  module,  beginning  from  the  robot  head 
towards  the  tail 

During  the  development  stage,  a  real-time  system 
(iRMX)  is  used,  which,  will  be  replaced,  in  the  final 
stage,  with  a  real-time  kernel  of  smaller  size  and 
dedicated  to  specific  problems  of  robot 

Inter-modules  communication  network 

CAN  is  a  new  field  bus  with  good  performances  in 
development  of  real-time  embedded  applications;  a 
quick  serial  bus  (maximum  transfer  rate  of  IMbit/s  for  a 
maximum  distance  of  40  m).  The  communication 
protocol  is  of  CSMA/AMP  type  and  the  messages  can 


All  the  nodes,  connected  to  the  CAN  network, 
survey  and  can  receive  the  bus  broadcast  messages, 
labeled  with  a  unique  identifier  (29  bits,  B  norm, 
Extended  CAN).  The  identifiers  also  determine  the 
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priority  degree  of  the  message.  CAN  provide  a  good 
noise  protection  (differential  transmission),  as  well  a 
permanent  system  of  transmission  errors  detection  and 
recovering. 

Each  robot  module  is  see  as  a  network  node  and  is 
use  the  CAN  controller  already  integrated  on  the  16  bit 
Siemens  C167CR  micro-controller. 

Central  computer  has  a  CAN  communication  board, 
which  liberates  the  computer  from  the  CAN  protocol 
management  task.  Data  to  be  transmitted  are  read  or 
written  by  computer,  via  DPRAM  memory  (dual  port). 
Physical  connection,  in  bus  topology,  is  provided  by  a 
differential  line,  opto-electrically  coupled  assuring  a 
good  noise  protection. 

Central  computer  broadcasts,  into  the  network,  the 
operations  to  be  carried  out  by  each  module,  the  robot 
actions  sequence  and  the  necessary  data  (position  co¬ 
ordinates,  limit  values  of  forces,  etc.).  Each  module 
responds  with  its  own  data  and  information.  At  a  certain 
moment,  five  modules  act  in  parallel,  therefore  the 
messages  have  to  be  received  by  all  the  five  modules  in 
action.  So,  a  message  is  not  forced  to  contain 
information  about  receiver,  but  especially  information 
about  the  enclosed  data.  Each  node  monitors  the 
network  and  can  decide  if  the  enclosed  information  is 
interesting  or  not.  This  way,  the  messages  could  be 
organized  in  the  so-called  dictionaries,  depending  on 
the  enclosed  data  (position,  speed,  force,  etc.)  and  on 
the  priority  level. 

6.  CONCLUSION  AND  FUTURE  WORK 

We  exposed  some  techniques  and  methods  used  in 
the  design  of  snake-like  robot  built  in  collaboration  by 
three  laboratories  (French,  Romanian,  and  Polish).  The 
robot  is  dedicated  mainly  for  small  diameter  pipe 
investigations. 

First,  we  presented  the  mechanical  articulated 
structure  based  on  a  parallel  mechanism  with  3  DOF. 
Several  mock-ups  was  built  and  studied  for  finally 
adopting  the  fig.  2  design  and  for  building  a  20 
segments  robot  (scale  2: 1). 

Then  the  distributed  control  system  architecture  is 
exposed.  Methods  developed  here  are  tested  on  the 
robot  structure,  but  some  algorithms  couldn’t  be 
directly  applied  since  some  sensors  are  not  yet  enough 
small  to  fit  on  the  robot’s  modules. 

Also,  some  robot  systems  are  not  studied  in  this 
phase,  for  examples  the  vision  micro-camera  system 
and  the  head  micro-gripper; 

A  lot  of  work  still  rest  to  do  in  integration  and 
miniaturization  and  will  make  the  object  of  further 
communications. 
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Abstract 

In  this  paper,  we  study  the  performance  of  various 
controllers  designed  for  an  Autonomous  Underwater 
Vehicle  (AUV).  A  particular  AUV  is  chosen  as  a  base 
model  for  controller  design  and  comparison.  Three 
different  controllers  are  designed  to  control  the  speed,  the 
direction  heading  and  the  depth  of  the  AUV:  PD 
controller,  feedback  linearization  controller  (FBLN)  and 
sliding  mode  controller  (SMC).  Computer  simulations 
are  carried  out  to  evaluate  the  performance  of  the  various 
controllers.  Disturbances  and  parametric  uncertainties 
are  introduced  to  determine  the  robustness  of  each  of  the 
controllers 

1.  Introduction 

Many  researchers  have  studied  the  modelling  and 
controller  design  for  Autonomous  Underwater  Vehicles 
(AUVs).  In  [1],  [2],  [3]  (and  the  references  therein), 
many  controller  designs  have  been  proposed  for  AUVs 
and  the  performance  and  robustness  of  the  controllers 
have  been  studied.  We  will  not  review  all  the  control 
design  methods  for  the  AUVs  here  and  refer  the  reader  to 
the  reference  [1],  [2],  [3]  and  the  references  in  those 
papers.  In  this  paper,  we  compare  the  performance  and 
robustness  of  three  well  known  controller  design 
methods:  PD  controller,  Feedback  Linearization  controller 
and  Sliding  Mode  controller  for  an  AUV.  The  controllers 
are  designed  to  control  the  speed,  the  heading  angle  and 
the  depth  of  the  AUV.  Computer  simulations  are  carried 
out  to  verify  the  performance  of  these  various  control 
design  methods  for  an  AUV. 

2.  Dynamic  Modelling  of  an  AUV 

The  dynamic  model  of  an  AUV  is  nonlinear  due  to  rigid 
body  coupling  and  the  complex  hydrodynamics  associated 
with  underwater  operation.  The  inclusion  of  terms  such 
as  added  mass  is  an  important  feature  of  an  AUV  model 
since  it  moves  in  water.  The  equations  of  motion  can  be 
derived  using  Newton’s  2nd  law  and  Euler's  law  of  angular 
momentum. 

In  accordance  to  SNAME  nomenclature  [2],  we  define 


where  tj  is  the  linear  and  angular  position  vector,  and  v 
is  the  velocity  vector  expressed  in  a  body  fixed  frame 
respectively  (See  Figure  1).  The  6  DOF  dynamics  can  be 
written  in  compact  matrix  form  as  [2]: 

Mv  +  C(v,  v)  +  D(v)  +  g(fj)  =  Pr ,  (!) 


where  M  e  5R6*6  is  the  symmetric  and  positive  definite 
inertia  matrix  which  includes  the  added  mass,  vectors 

C,  /),£€  5R6*1  represent  centripetal  and  Corolis, 


damping,  gravity  and  buoyancy  respectively,  P  e  9?6xm 
and  t  €  SR"1*'  represents  the  control  inputs  of  the  AUV. 

The  vehicle's  path  relative  to  the  earth-fixed  coordinate 
system  is  given  by  a  velocity  transformation  [2] 

V  =  J(ti2)v,  (2) 

where  J (•)  is  the  Jacobian  matrix. 
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These  12  equations  form  the  dynamic  model  of  an 
AUV.  The  interested  reader  is  referred  to  [2]  for  more 
details  on  the  modelling  aspect  of  an  AUV. 

3.  Simplified  Decoupled  System 
In  our  study,  we  use  the  Naval  Postgraduate  School 
AUV  II  as  an  example  [1]  (See  Fig.2).  The  model  of  the 
NPS  AUV  II  is  the  form  described  by  (1)  and  (2).  The 
value  of  the  model  parameters  of  the  AUV  II  can  be 
obtained  from  [1].  These  equations  of  motion  are 
modified  further  in  [4], 


Figure  2:  Schematics  of  the  NPS  AUV  II. 


As  with  current  practice  in  submarine  control,  the 
dynamic  equations  of  motion  of  an  AUV  can  be 
decoupled  into  3  lightly  or  non-interacting  subsystems 
[1].  This  assumption  holds  true  as  the  shape  of  the  hull  of 
die  AUV  II  is  streamlined.  Three  controllers  are  designed 
for  the  following  3  subsystems: 

1 .  Speed  (surge  velocity)  controller  which  uses  u  as  the 
state 

2.  Direction  heading  control  is  determined  by  the  3 
states  consisting  of  v,r,  y/ 


3.  Depth  controller  makes  use  of  3  states  q909z  . 

The  external  control  inputs  for  each  of  the  3  subsystems 
are  the  propeller  speed  n,  the  angle  of  deflection  of  the 
rudder  Sr ,  and  the  angle  of  deflection  of  stem  planes  Ss , 
respectively.  These  inputs  are  represented  by  the  vector 

r  =  \n  Sr  &]r .  (3) 


The  roll  mode  is  left  to  be  passive. 

The  equations  of  motion  being  used  are  given  next. 
The  interested  reader  is  referred  to  [1]  and  [2]  for  the 
nomenclature  used  and  the  parameter  values  of  the 
variables. 

For  speed  control,  we  make  use  of  the  surge  equation  of 
motion  given  by  [1] 


(m =  ^lVC„o[(0.012-)|(0 

2  2  u 


.012— )| 
u 


-1].(4) 


The  equations  of  motion  used  to  determine  the  control  for 
direction  heading  are  [1] 


m[v  +  xar-yar2]  =  ^LA  [7/r]  + I3[T'v  +  )>0r] 
+^L2[Y;u0v  +  Y^ru02Sr] 


(5) 


I zr  +  m[xa  (v +u0r)~  yG  (w0  -  vr)]  =  ^L5[N’tr] 

+^L4[Nlv  +  N’ru0r]  +  ^Lt  [N'vu0v  +  N’s,u02Sr] 
\j/  -r . 


(6) 

(7) 


The  4  equations  of  motion  that  are  used  for  the  design  for 
the  depth  controller  are  [1] 

«[*- u0q  -  xaq  -  zaq2]  =  ^  LA[Z\q]  + 

+  Z'qu0q ]  +  ^L2[Z'wu0w+u02(Z'SsSs)]  +  (W  -  B)  cos0c(8) 
Iyq  -  m[xG  (w  -  u0q )  -  zG  (u0  +  wq)\  =  ^-L5  [M’.q] 

+ 1 14  [A/>  +  M'uqu0q]  +  ^  I3  [M’mu0w  +  u2  {M'SsSs )] 

-(xgW-xbB)  cosB0 -(zgW-zbB)  sin0o  (9) 

0  =  q  (10) 

z  =  -u0  sin  0O  +wcos0o  .  (11) 

4.  Controller  Design 

For  the  controller  design,  we  assume  that  all  states  are 
observable  and  full  state  feedback  is  possible. 

4.1  PD  Controller 

From  (4),  setting  ii-  0  gives  u2  cc  «2  at  steady-state 
condition.  Therefore,  it  is  found  that  the  propeller  speed, 
n ,  and  the  surge  velocity  are  linearly  related.  In  order  to 
design  a  speed  controller,  we  consider  a  first  order 
linearization  of  (4)  at  operating  points  nd  and  ud  . 

A  speed  controller  consisting  of  a  pure  P-controller 
with  a  feedforward  term  is  proposed.  This  feedforward 
term  makes  use  of  the  knowledge  of  the  linear 
relationship  between  propeller  speed  and  surge  velocity, 
while  the  P-controller  is  used  to  reduce  the  steady  state 
error  to  zero  asymptotically.  Therefore,  the  control  law  is 
n  =  Kxnd  +KlP(u-ud)9  (12) 

where  nd  is  the  desired  propeller  speed  related  to  desired 
surge  velocity  ud ,  Kx  is  the  proportional  constant,  KlP 
is  the  proportional  constant  that  has  to  be  chosen  carefully 
to  exponentially  stabilize  (4)  so  as  to  reduce  the  steady 
state  speed  error  asymptotically  to  zero. 

The  direction  heading  and  depth  control  utilize  the 
linearized  equations  of  motion  based  on  (5)-(7),  linearized 
about  the  desired  surge  velocity  u-ud  =  1.0m/  s  ,  while 
the  other  equilibrium  states  are  at  zero.  We  propose  a 
rudder  controller  which  is  a  simple  PD  controller. 
Treating  the  sway  velocity  v,  as  negligible,  the  controller 
is  given  by 

Sr  =K  2P(if/ci  -<//)  + K2D  {rd  -  r) ,  (13) 

where  y/d  and  rd  are  the  desired  heading  and  angular 
yaw  rate  respectively.  Both  K1P  and  K1D  are  chosen 
for  performance  and  to  ensure  exponential  stability  of  the 
system  (5)-(7). 

Similarly,  the  depth  controller  makes  use  of  the  state 
equations  (8)-(ll).  It  should  be  noted  that  the  heave 
velocity  w,  is  ignored  as  simulations  have  shown  it  to  be 
negligible.  Based  on  the  linearization  of  (9)-(l  1),  we 
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propose  a  depth  controller  with  the  control  law  given  by 
&  =  KiP (6d  ~0)  +  K3D(qd  -q)  +  G3(zd  -z),  (14) 

where  the  constants  K3P , K3D ,G3  are  chosen  to  ensure 
closed-loop  stability  of  the  system  (8)-(ll).  The  first  2 
terms  in  the  controller  (14)  exponentially  control  the  pitch 
angle  to  zero  while  the  last  term  is  used  to  achieve  the 
asymptotic  convergence  to  the  desired  depth  zd . 

4.2  Feedback  Linearization  Controller 
Consider  a  simplified  system  where 

?  =  v  (15) 

mv  +  h  =  r  (16) 

Note  that  in  equation  (15),  v  is  the  velocity  of  the  state 
to  be  controlled.  The  nonlinearities  in  "h"  can  be 
cancelled  out  by  the  choice  of  a  suitable  control  law 
z  =  mav+h,  (17) 

which  leads  to  a  linear  closed-loop  system  given  by 
7  =  v  (18) 

v  =  ar.  (19) 

The  commanded  acceleration  av  can  be  selected  to  be 
of  the  form 

av  =vd  -X,(v-vd)-X2(j]-jjd),  (20) 

where  vd  is  the  desired  velocity,  qd  is  the  desired  state 

(position),  which  gives  us  the  closed-loop  system  given 
by 

e  +  X2e  +  X2e  =  0  (21) 

where  e  =  q-T]d .  Note  that  X / ,  X2  >0  ensures  exponential 
convergence  of  e(t)  to  zero. 

For  speed  control,  we  use  (4),  which  is  a  special  form 
of  (15)-(16).  The  speed  control  is  chosen  similar  to  the 
control  law  (17)  where  the  commanded  acceleration  is 
chosen  as 

au  =«rf  ~K(u~ud),  (22) 

where  ud  is  the  desired  speed  for  the  AUV.  This  results 

in  a  closed-loop  system  given  by 

u-ud  +A„(u-ud)  =  0 .  (23) 

Thus,  if  \  >  0 ,  we  have  exponential  convergence  of  the 
speed  u(t)  to  the  desired  speed  Ud  (/)  . 

For  direction  heading,  we  used  equations  (6)-(7),  which 
are  of  the  same  form  as  (15)-(16).  The  heading  angle 
controller  is  chosen  similar  to  the  control  for  (17),  where 
the  commanded  acceleration  is  chosen  as 
a?  =rd  - Xri(r -rd) - Xr2(y  -\j/d)  (24) 

where  y/d  is  the  desired  heading  angle  and  rd  is  the 
desired  heading  angle  rate. 

This  results  in  a  closed-loop  system  given  by 
(rd  -r)  +  XrI(r-rd)  +  Xr2(y/-yd)  =  0 .  (25) 

Thus  if  XrI ,  Xr2  >  0 ,  we  have  exponential  convergence  of 
the  heading  angle  y/(t)  to  the  desired  heading  angle 


In  order  to  design  the  depth  controller,  we  consider 
equations  (9)-(l  1).  Notice  that  the  depth  controller  is 
designed  in  2  steps.  Firstly,  we  design  a  pitch  controller 
and  secondly,  we  add  an  outer  loop  proportional 
controller  for  depth  controller. 

To  devise  the  pitch  controller,  we  use  (9)-(10),  which  is 
of  the  form  as  (15)-(16).  The  pitch  control  law  is  chosen 
similar  to  the  form  given  by  (17)  where  the  commanded 
acceleration  is  chosen  as 

ae  =Qd -Agi(q-qd)-Xe2(0-0d) ,  (26) 

where  9 d  is  the  desired  pitch  angle  and  qd  is  the  desired 
pitch  angular  rate. 

This  control  law  would  result  in  a  closed-loop  system 
given  by 

(id-q)  +  lai(q-qd)  +  X02(e-Od)  (27) 

Thus  if  Xyi ,  Xq2  >  0 ,  we  have  exponential  convergence 

of  the  pitch  angle  9{t)  to  the  desired  pitch  angle  6d{t). 

Now  to  control  the  depth,  we  have  to  modify  (26)  to 
include  an  outer  loop  proportional  controller  for  depth 
control.  Thus  for  depth  control,  we  modify  (26)  as 

=q<)  -hi(q-qd)-xe2(e-ed)-Ge(z-zd)  (28) 

where  zd  is  the  desired  depth.  Appropriate  values  of 

^■ei’^02  an£f  Gg  are  then  chosen  to  guarantee  good 
performance  of  the  closed-loop  system  with  the  controller 
given  above. 

4.3  Sliding  Mode  Controller 

The  approach  from  [1]  is  used  here  for  the  design  of  the 
SMC.  The  nonlinear  system  described  by  the  equations 
that  corresponds  to  each  sub-system,  i.e.  (4),  (5)-(7)  and 
(8)-(l  1),  can  be  rewritten  as 

x  =  Ax+bu  +  f(x )  (29) 

The  first  2  terms  of  (29)  form  the  linear  part  while  the 
last  term  contains  the  nonlinearities.  The  linearized 
equations  of  motion  at  operating  point  corresponding  to 
ud  =  1.0m/ s,  are  used  to  select  the  sliding  surfaces. 

In  the  design  for  a  SMC,  we  need  to  define  sliding 
surfaces  in  state  error  space.  Let  this  error  be  denoted  by 
x  —  x  —  xd .  The  sliding  surfaces  are  then  given  by 

o  =  sTx.  Global  asymptotic  stability  of  the  sliding 
surface  dynamics  can  be  guaranteed  through  defining  a 
Lyapunov  function  such  that 

V(<r  (t))  =  y2<rTa.  (30) 

Since  V(v)  is  positive  definite  and  unbounded,  global 
asymptotic  stability  of  <r(t)  can  be  assured  if 
V((r)  =  6r<r<0  (31) 

The  control  law  of  each  of  the  sub-systems  (i.e. 
corresponding  to  (4),  (5)-(7)  or  (8>(11»  is  chosen  to  be 
of  the  form 
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u-u-^-u  .  (32) 

The  nominal  control  is  chosen  as 
u  =  -kx  (33) 

and  u  is  the  sliding  control  law  which  is  determine  next. 
If  the  pair  ( A,b )  is  controllable,  substituting  equation  (33) 
into  (29)  gives 

x  =  Acx  +  bu  +  f(x)  (34) 

where  Ac  =  A-bk .  Premultiplying (34) by  sT  and 
subtracting  sT xd  from  both  sides,  we  get 
a  =  sTAcx  +  sTb  u  +sT  f(x)-sT xd .  (35) 

If  sTb  is  invertible,  choosing 

u  =  (sTb)~\sTxd  -  sTf(x )  -  rf  sgn  cr] ,  (36) 

where  f(x)  is  the  best  estimates  for  the  nonlinearities 
and  uncertainties,  yields 

&  -  s  T  Ac  x  -  7]  sgn  cr  +  s  T  Af(x) ,  (37) 


where  Af  (x)  =  f(x)  -  f(x) . 

The  choice  of  a  can  now  be  determined.  If  the  vector 
s  €  913  is  the  left  eigenvector  of  the  closed-loop 
dynamics  Ac  such  that  sr[Ac]  =  0 ,  equation  (37)  can 
be  simplified  as 

&  =  -77  sgn(cr)  +  s  T  A/( x) .  (3  8) 

This  is  possible  if  the  gain  vector  k  is  chosen  to  place  the 
closed-loop  eigenvalues  of  the  matrix  Ac  at  \  =  0  , 
and  A3  are  arbitarily  selected  based  on  performance 
requirements.  Substituting  (38)  into  (31)  yields 

V  -  aa  =  -t]<j  sgn(a-)  +  er  sT Af(x) 

=  -r}\cr\+o-sTAf(x). 

Thus,  if  T]  is  chosen  "large"  enough,  which  implies 

V  <0 ,  we  can  be  assured  of  global  asymtotic  stability 
while  overcoming  the  nonlinearities  and  uncertainties 
represented  by  the  second  term  on  the  RHS  of  (38). 
Details  of  the  above  formulation  can  be  found  in  [2]. 

The  implementation  of  the  discontinous  switching 
control  sgn(<r)  results  in  chattering.  In  practice,  this  is 
undesirable  since  it  involves  a  high  amount  of  control 
activity  and  may  excite  unmodeled  high  frequency 
dynamics.  To  achieve  smoother  switching,  a  suitably 
selected  value  of  (O),  known  as  the  "boundary  layer", 
can  be  added.  The  modified  switching  law  is  given  by  the 
saturation  function  sat(al$>)  such  that 


,,  /rtV.  Jsgn(o-)  //|cr/<t>| : 
sat(aM  =  |ct/(J)  J  J 


(40) 


l» 

otherwise 

This  reduces  tracking  precision,  but  improves  robustness 
by  increasing  the  bandwidth  of  the  system  and  reduces  the 
effects  of  chattering  [2].  In  this  paper,  we  follow  [1]  in 
using  the  tanh(<r/0)  functions  for  the  switching  control 


instead  of  the  saturation  function  sat(cr/ 0)to  reduce 
chattering. 

5.  Simulation  Results 

The  controllers  are  designed  based  on  a  constant  speed 
which  is  at  ud  -1.0m/  s  and  by  making  use  of  the 
parameter  values  of  the  NPS  AUV  II  given  in  [1].  We 
consider  these  values  to  form  the  "nominal"  model  of  the 
AUV.  Parametric  uncertainties  are  introduced  by  50% 
changes  in  the  values  of  the  hydrodynamic  derivatives. 
This  is  because  hydrodynamic  derivatives  are  the  most 
difficult  to  obtain  accurately  and  thus  contribute  to  the 
highest  possibility  of  errors  in  modelling  any  AUV. 

External  disturbances  are  introduced  which  represent 
translational  sinusoidal  waves  with  respect  to  the  global 
fixed  frame  and  transformed  to  the  body  fixed  frame  as  an 
additional  velocity  vector. 

Computer  simulations  are  carried  out  using  Matlab™ 
with  a  sampling  time  of  0.1  seconds.  This  is  sufficient  for 
our  case,  as  the  time  constant  for  the  AUV  dynamics  is 
large.  Source  codes  were  adapted  from  [5],  with 
extensive  modifications  to  suit  our  particular  simulation 
needs. 

The  tracking  reference  input  is  generated  based  on  the 
equation  of  a  cubic  polynomial  [6].  This  is  used  instead  of 
a  step  input  to  prevent  saturation  of  the  input  efforts  and 
also  to  provide  a  reference  input  which  is  continuous  as 
time  derivatives  are  needed  for  the  controllers. 

The  gains  for  all  3  controllers,  that  is  the  PD  controller, 
the  feedback  linearization  controller,  and  sliding  mode 
controller,  have  been  adjusted  such  that  all  the  input 
efforts  would  be  of  the  same  range.  This  is  done  so  as  to 
make  a  fair  comparison  among  the  3  different  schemes. 
Representative  results  from  our  simulations  will  be 
discussed  in  this  section.  Each  figure  shows  the  surge 
velocity  u  in  metres  per  second,  the  yaw  (direction 
heading)  in  degrees  and  the  depth  Z  in  metres 
respectively.  The  bottom  3  graphs  show  the  propeller 
speed  n  in  RPM,  the  rudder  and  stem  planes  deflection 
angles  in  degrees. 

First,  with  surge  velocity  at  1.0 m/s,  the  desired 
direction  heading  is  commanded  from  0°  to  20°  in  30 
seconds  and  the  desired  depth  is  commanded  from  0 m  to 
2m  in  50  seconds.  We  see  from  Figure  4  that  all  3 
controllers  have  similar  performance.  As  expected,  all  3 
controllers  perform  well  at  this  operating  condition. 

Next,  we  examine  the  performance  of  the  controllers 
under  modelling  uncertainties.  The  controller  is  designed 
based  on  the  nominal  model  of  the  AUV.  But  for 
simulation,  the  model  used  has  a  50%  larger  value  for 
hydrodynamic  parameters.  Figure  5  shows  the  simulation 
with  u  =  1.0 mis  ,  the  desired  direction  heading  changing 
from  0°  to  20°  in  30  seconds  and  the  desired  depth 
changing  from  0 m  to  2m  in  50  seconds.  We  see  that  the 
AUV  is  stable  with  all  3  controllers. 
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For  figure  6,  a  sinusoidal  disturbance  of  low  frequency 
and  small  amplitude  is  applied  along  the  Y-  and  Z-axes  to 
the  nominal  AUV  model  so  as  to  simulate  environmental 
disturbances.  The  operating  surge  velocity  is  u  =  2.5m/ s, 
with  the  direction  heading  changing  from  0°  to  20°  in 
30  seconds,  and  the  desired  depth  changing  from  0m  to 
2m  in  50  seconds. 

All  3  controllers,  which  are  designed  based  on  the 
nominal  model,  are  able  to  stabilize  the  AUV  to  the 
desired  set  points.  The  speed  control  used  in  the  PD- 
controller  outperforms  the  other  2  possibly  due  to  the 
presence  of  the  feedforward  term.  Both  the  SMC  and 
FBLN  controllers  display  good  overall  performance  in  the 
other  2  objectives. 

From  the  various  responses  of  the  AUV,  we  can 
observe  some  interesting  results.  The  PD  controller  does 
not  perform  as  required  when  the  operating  surge  velocity 
is  significantly  higher  than  u  =  l.0m/s.  The  linearization 
of  the  nonlinear  model  is  not  robust  enough  to  handle  a 
wide  range  of  operating  conditions.  While  for  the  case  of 
SMC  and  FBLB,  they  are  robust  as  their  designs  are 
based  on  the  nonlinear  model  of  the  AUV,  which  retain 
important  nonlinear  dynamics. 

6.  Conclusion 

The  comparisons  of  the  3  control  schemes  applied  to  an 
AUV  have  reinforced  several  points  we  already  know. 
Using  a  simple  PD  controller  for  a  nonlinear  plant  may 
give  rise  to  stability  problems  if  the  operating  condition  is 
far  away  from  the  linearized  point.  The  use  of  FBLN 
allows  the  full  knowledge  of  the  nonlinear  plant  to  be 


used.  Together  with  SMC,  the  2  schemes  show  improved 
capabilities  over  the  simple  PD.  Although  SMC  tends  to 
be  more  involved  while  implementing  the  control  law,  it 
has  the  added  advantage  of  not  needing  the  exact 
modelling  parameters. 
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Figure  3:  Legend  for  Figures  4  to  6 


Figure  4  Graph  showing  response  of  AUV  model,  with  direction  heading  from 
¥  =  0°  to  y/  =  20°  in  30  sec  and  depth  change  from  z  =  0m  to  z  =  2m  in  50  sec. 
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Abstract 

In  this  paper,  a  novel  speed  control  of  a  single  phase 
induction  motor  is  proposed  using  a  modified 
inverter.  Three  MOSFETs  are  used  for  controlling 
the  flux  linkage  of  a  single  phase  transformer,  its 
secondary  winding  is  connected  in  series  with  the 
motor.  On-line  Bang-Bang  speed  control  of  motor 
scheme  is  implemented  using  a  microprocessor  to 
derive  the  inverter.  A  mathematical  model  for  the 
single-phase  induction  motor  is  presented,  which  is 
used  to  build  up  a  simulation  program  for  a  certain 
desired  speed.  Experimental  results  have  been 
carried  out  to  investigate  the  motor  performance  with 
the  controller.  Good  agreement  has  been  obtained 
between  simulation  and  experimental  results. 

Keywords 

Microprocessor,  Flux  linkage,  MOSFET,  Bang-Bang 
control,  and  single-phase  induction  motor. 

^Introduction 

The  high-performance  inverter  technology  realizes 
many  new  application  fields.  The  dc/ac  static 
inverters  permit  feeding  the  complex  systems, 
providing  a  low  rate  of  harmonic  distortion,  high 
efficiency,  rapid  dynamic  response  and  competitive 
cost.  The  great  inconvenience  found  in  these 
inverters  lies  in  the  great  power  dissipation  during 
turn  on  and  turn  off  of  the  semiconductors,  and  in 
turn  limiting  the  inverter  frequency.  This  problem 
prevents  the  reduction  of  weight,  volume  and  cost 
for  the  system  as  a  whole  [1-4], 

Pulse-Width  Modulation  (PWM)  schemes  for  dc-ac 
power  conversion  have  received  much  attention  in 
the  last  two  decades.  Many  PWM  schemes  have 
been  developed  and  implemented  successfully  for 
different  applications.  Variable-speed  induction 
motor  drives  have  found  widespread  use  of  pulse- 
width  modulation  inverters.  In  many  applications,  a 
further  cost  reduction  for  the  drive  is  an  important 
aspect  and,  thus,  a  reduction  of  the  number  of  power 
semiconductors  in  the  converter  should  be  a  main 


consideration  [5-6].  In  recent  years,  numerous  pulse 
width  modulation  pattern  generation  techniques  have 
been  developed  for  improving  the  performance  [2- 
7],  When  an  induction  motor  fed  by  a  voltage  source 
inverter  ,  the  applied  stator  voltage  wave  forms 
contain  harmonics  generated  by  the  PWM  technique 
and  the  system  stability  will  be  affected  by  these 
harmonics  especially  at  low  frequencies. 

This  paper  is  presents  a  speed  control  of  a  single¬ 
phase  induction  motor  fed  by  the  proposed  inverter. 
Such  a  system  can  be  used  in  the  industrial 
applications,  which  required  smooth  variation  of 
speed  with  high  response,  also  in  controlling  speed 
of  car  vechals,  and  robot  motion.  The  proposed 
systems  requires  only  three  MOSTETs  and  gives  an 
output  ac  voltage  can  be  controlled  in  a  wide  range 
to  give  smooth  variation  of  motor  speed  . 

2.  System  operation  and  analysis 

In  this  section,  the  proposed  system  shown  in  Fig. 
1,  is  described  and  analyzed.  MOSFETs  1  and  2 
transfer  the  input  voltage  to  the  output  load  (motor) 
through  step-up  transformer,  which  having  a  rating 
more  than  the  load  rating,.  MOSFET  3  connected  in 
parallel  across  the  load  (transformer  primary)  to 
prevent  any  opposite  voltage  in  the  output,  either  in 
positive  or  negative  half  cycle,  also  to  reduce  the 
output  spikes  and  improving  the  output  voltage  Vl. 
The  MOSFETs  switching  are  controlled  by  using 
PWM  technique.  The  driver  circuits  (shown  in 
Fig-2)  are  feeding  from  a  microprocessor  to  arrange 
the  switching  process,  according  to  the  reference 
control  voltage  (Ref)  and  feed  back  signal  from  the 
motor  speed  (N)  through  a  tacho-generator. 
MOSFET  3  is  switched  on  if  and  only  if  either 
MOSFET  1  or  MOSFET  2  is  turned  off  to  prevent 
the  opposite  voltage  due  to  the  load  inductance  ( 
transformer  and  motor ) . 

A  square  wave  signal  with  50  Hz  to  the 
microprocessor  is  used  to  achieve  synchronization 
between  the  MOSFETs  pulses  and  the  applied 
voltage  to  the  motor.  This  circuit  is  used  if  an  ac 
supply  exists  to  recognize  the  frequency  of  the 
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waveform  of  the  output  voltage,  but,  in  case  of  the 
absence  of  the  ac  supply,  a  timer  is  used  to  achieve  a 
fixed  load  voltage  frequency.  Comparing  between 
reference  speed  (Ref.)  and  the  motor  speed  (tacho- 
generator  feed  back  signal )  through  summing  circuit 
used  to  give  the  error  signal  e,  A/D  circuit  transfers 
the  error  signal  e  to  the  microprocessor  for 
arranging  the  MOSFETs  switching.  Applying  bang  - 
bang  control  action  [6]  as  follows  ;  if  e  >  zero  hence 
turn  on  MOSFETs  1  or  2,  depending  on  positive 
or  negative  load  voltage,  otherwise  if  e<zero 
hence  turn  off  MOSFETs  1  or  2  . 


Fig.  1  A  schematic  diagram  of  the 
proposed  system 


330  ohm 


Fig.  2  Driver  circuit  for  MOSFET 
3.  Motor  mathematical  model 


For  running  capacitor  single  phase  induction  motor, 
the  motor  equations  in  d-q  axis  (Fig.  3)  can  be 
written  as  follows: [9] 

Vm=  (Rm^m  P)  *m  +M  p  \a 

-  Vc  =  (  Ra  +  La  p  )  ia  +  A*  M  p  i  p 


0-Mpim-  As  M  ia  +  (Rf  +  W  P)  ia  ’  Er  A#  ip 


0=  M  A#im+  AsMp  ia+Lr  A#  ia  +  (Rr  +  Lr  P)  ip 

A\  AX 


where  c 


The  instantaneous  electromagnetic  torque  may  be 
expressed  in  terms  of  substitute  variables  as: 

Te  ~  P  M  (ift  im  -  Ia  ia  As) 

The  equation  of  motion  can  be  written  as  follows: 

=  Te-TL-Kcor 

Al 

The  present  model  is  valid  for  both  steady-state  and 
transient  conditions. 


P  -axis 


Fig.3  Representation  of  single-phase 
induction  motor 


4.  System  performance  and  results 

In  order  to  evaluate  the  performance  of  the 
proposed  system,  dc  source  used  with  suitable  value 
and  must  be  free  of  ripples  and  accepted  recovery 
power  such  as  car  batteries,  where  the  dc  voltage 
supplied  inverter  has  great  influence  of  its  operation. 
An  experimental  circuit  has  been  built  according  to 
Fig  .1  and  tested  at  the  laboratory  to  evaluate  the 
motor  performance  under  different  conditions  of 
operation . 

For  single  phase  running  capacitor  induction  motor 
with  parameters  shown  in  Appendix  (a),  two 
batteries  of  12  volt  each  (Vs/2)  are  used  as  a  dc 
source,  power  devices  are  MOSFETs,  type  IRFP740. 
A  pulse  width  modulation  control  is  achieved  using 
microprocessor  8085,  and  Bang-Bang  control 
technique  Algorithm  is  used,  an  Assembly  program 
is  written  and  stored  in  an  EPROM  interfaced  with 
the  microprocessor  to  achieve  that  technique. 
Figures  4-7,  show  the  experimental  and  simulation 
waveforms  of  the  motor  speed,  terminals  voltage  and 
current  during  startup  period  and  steady  state 
interval  at  reference  speed  =  1040  r.p.m.,  the  spikes 
in  the  experimental  waveform  may  be  due  to  the 


409 


motor  and  transformer  inductance.  The  motor 
performance  under  steady  state  condition  is  shown  in 
Figs. 8-9.  When  a  change  in  the  reference  speed  is 
occurred,  the  motor  response  is  shown  in  Figs. 
10,11.  It  is  noticed  that  the  motor  response  follows 
the  reference  speed.  This  means  that  the  proposed 
controller  is  accurate  and  smooth  on  the  period  of 
change.  Figures  12,13  show  the  motor  response 
when  the  load  torque  is  changed  by  60  %  of  full  load 
torque. 

5.  conclusion 

This  paper  describes  a  novel  method  of  speed 
control  of  an  ac  motor  (single-phase  running 
capacitor  induction  motor  )  by  adjusting  the  flux 
linkage  between  two  magnetic  coils  using  a  modified 
inverter  and  Bang-Bang  control.  This  strategy 
influences  the  effective  equivalent  inductance  of  two 
coils  connected  in  series  with  the  motor, 
consequently  control  the  motor  applied  voltage. 
Simulation  and  experimental  study  have  shown  that, 
by  this  method  of  control  the  speed  of  the  motor  can 
be  changed  smoothly  from  zero  value  to  the  rated 
speed  with  high  response. 
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7.  Appendix  (a) 

The  single  phase  running  capacitor  induction  motor 
has  : 

110  volt,  rated  voltage.,  0.5  amp  rated  current  aiul 
2770  r.p.m,  rated  speed,  Ls=Lr=  0.25  H,Rr=50 
0.,Rc-69.76  O,  M-0.6  H,  C=3  p  F,  and  As  »  I 
Transformer  rated  current:  3  amp 

Symbols 

As:  Auxiliary  to  main  winding  turns  ratio, 
ia,  im  :  Auxiliary  and  main  winding  current. 
i(x>  iR  •  rotor  current.in  a  and  p  axis; 

J  :  Moment  of  inertia  in  Kg.m2 

Lp  Auxiliary,  main  and  rotor  winding  self 
inductance. 

M:  Main  to  rotor  mutual  inductance. 

K:  Friction  constant. 

P:  number  of  pair  poles, 
p:  d/dt 

Rm»  Rp  Auxiliary,  main  and  rotor  winding 
resistance. 

Te:  Electromagnetic  developed  torque. 

Ty  :Load  torque. 

V,  :  Supply  voltage 

Vc:  capacitor  terminal  voltage  . 

Vm:  Amplitude  of  a.c.  Input  voltage 
0  :  Electrical  angle  between  stator  and  rotor. 
cor:  Motor  speed  in  rad,/sec  =  d©  /  dt  . 
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Figure  6  Start  up  of  motor  terminal  voltage  and 
current  (Experimental  results) 


Figure  8  Steady  state  of  motor  wave  forms 
(Experimental  results) 
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Figure  7  Start  up  of  motor  terminal  voltage  and 

current  (Simulation  results) 


Time  (sec.) 

Figure  9  Steady  state  of  motor  wave  forms 
(Simulation  results) 


Figure  10  Experimental  results  (variation  of 
reference  speed) 

Chi.  Reference  speed 
Ch2.  Motor  speed 


Figure  12  Experimental  result  of  motor  speed 
(variation  of  load  torque) 
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Abstract 

This  paper  is  study’  on  the  design  of  IP  servo  controller 
for  cold  tandem  mill,  using  LMI  (Linear  Matrix 
lnequality)-based  techniques.  The  controller  improves  the 
performance  of  AGC(Automatic  Gauge  Control)  system 
based  on  Taylor  linearized  model  of  cold  tandem  mill  and 
satisfies  robust  stability,  the  restraint  of  disturbance  and 
robust  tracking  property.  The  IP  servo  control  problem  is 
modified  as  an  usual  IP  control  problem,  and  the 
solvability  condition  of  fP  servo  control  problem  depends 
on  the  solvability  of  modified  TP  control  problem.  But 
this  modified  problem  dose  not  satisfied  standard 
assumptions  for  TP  control  problem,  it  can  be  solve  by 
using  LMI  techniques.  Consequently,  the  comparison  of 
the  performance  between  our  IP  servo  controller  and  the 
existing  PID/FF(FeedForward)  controller  shows  the 
usefulness  of  this  study. 


1.  Introduction 

For  the  improvement  of  mill  product  quality  in  cold 
rolling  process,  there  has  been  much  research  related  to 
the  development  of  AGC  system  to  improve  the 
performance  of  the  existing  gauge  control  system.  In 
especial,  the  new  mathematical  models  for  cold  tandem 
mill  and  control  systems,  which  are  applied  with  modem 
control  theories,  such  as  optimal  control,  robust  control, 
etc,  have  been  developed.  Recently,  there  have  been  many 
studies  on  the  design  of  robust  hC  control  system 
considering  interstand  interference  in  cold  tandem  mill1 'l 

An  H00  control  is  the  method  of  control  design  which 
has  a  performance  index  containing  uncertainties,  such  as 
disturbances,  modeling  errors  occurred  from  the  variation 
of  system  parameters  or  the  model  approximation.  An 
LMI-based  H°°  control  particularly  has  been  researched 
lately.  If  the  uncertainties  are  considered  as  the 
unstructured  form,  these  are  specified  to  the  form  of 
weighting  function  in  frequency  domain.  The  importance 


of  determining  uncertainties  has  been  perceived  in  the  FL 
control,  so  the  research  on  identifying  mathematical 
model  with  input  and  output  data  has  been  achieved 
briskly. 

At  first,  we  analyze  the  dynamic  characteristics  of  cold 
tandem  mill  as  a  basic  procedure  of  our  study.  The 
interrelation  between  input  and  output  data  for  model 
identification  and  the  effect  of  uncertainties  to  our  system 
can  be  determined  from  the  results  of  these  analyses.  And 
then,  we  determine  a  mathematical  model  of  cold  tandem 
mill  using  model  identification,  based  on  these  dynamic 
analyses.  In  our  paper,  MOESP  (MIMO  output-error  state 
space  model  identification)  algorithm  which  is  useful  to 
MIMO  system  identification  is  applied  to  construct  the 
mathematical  model[1].  First,  a  nominal  model  is  identified 
from  input  and  output  data  without  uncertainties,  and  then, 
an  uncertain  model  is  identified  from  input  and  output 
data  containing  uncertainties.  The  modeling  error  can  be 
determined  by  calculating  the  error  between  the  nominal 
and  the  uncertain  model.  And  the  maximum  singular  value 
plots  of  both  the  modeling  error  and  the  disturbance  are 
shown  in  this  paper.  Finally,  stable  weighting  functions  for 
designing  \P  servo  controller  are  determined  by  these 
maximum  singular  value  plots. 

In  order  to  design  the  servo  controller  which 
satisfies  robust  stability,  restrain  disturbance  and  has 
robust  tracking  property,  the  FI00  servo  control  problem  is 
modified  as  an  usual  YP  control  problem  with  internal 
model  which  contains  the  mode  of  reference  input 
model[3].  But  the  modified  generalized  plant  in  this  FT 
control  problem  does  not  satisfy  the  assumptions  of 
DGKFI31[6].  Hence,  the  LMI-based  solution  to  the  \P  control 
problem  is  applied  to  solve  our  YP  control  problem^ 

This  paper  consists  of  four  chapters.  Chapter  1,  an 
introduction,  gives  brief  reviews  of  previous  study  and  the 
purpose  of  this  paper.  The  dynamics  of  cold  tandem  mill 
and  the  simulation  results  are  introduced  in  the  chapter  2. 
Chapter  3  presents  the  model  identification  for 
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2.  The  dynamic  characteristic  of  TCM 

In  this  chapter,  as  a  basic  process  of  designing  the  H00 
servo  controller,  we  analyze  the  dynamic  characteristics 
of  cold  rolling  process  using  a  computer  simulation  with 
some  empirical  formulas  of  the  TCM.  Through  this 
simulation,  we  grasp  the  motion  of  the  cold  rolling 
process  and  determine  the  relation  of  input  and  output 
signal  for  model  identification  of  the  TCM.  The  three- 
stand  the  TCM  considered  in  this  study  is  shown 
schematically  in  Fig.  1 . 


1  STAND  2  STAND  5  STAND 


Figure  1.  Schematic  of  three-stand  the  TCM 


Some  typical  empirical  formulas  of  the  TCM  used  in  the 
simulation  for  the  dynamic  analyses  are  given  by 

•  Rolling  force  equation(p( ) :  Hill’s  approximation171 

p,  =  kpiQ nK ib,y[R^,  (# ,  -h,)  (1) 

•  Delivery  strip  thickness(  h, ),  Back  unit  tension(  rbi ) 

h>  =  s,  ® 

=  f  (|r (1  +  f,)VK  ~  d  +  /„.  )VRi_, )  (3) 

•  Forward  slip  coefficient  ft) :  Bland  &  Ford  equation1*1 

/(  =  *J>L<D2  (4) 

b, 

m  i  n, —  ^  “  7r“)0  ~  r. ) 

O  =  ~ -tan(  0.5  sin  _1  +  -  / — ^ —  In( - bJ- - ) 

S/ 

The  simulation  which  is  composed  the  empirical 
formulas  of  the  TCM  is  essential  to  the  analysis  of  the 
dynamic  characteristics,  such  as  nonlinearities  in  rolling 
process,  interstand  transport  delays,  interaction  between 
delivery  strip  thickness  and  back-tension.  Using  this 
simulation,  the  interrelation  between  input  and  output 
data  for  model  identification  and  the  effect  of 
uncertainties  to  TCM  can  be  determined.  The  set-up 
values  for  the  TCM  for  the  simulation  are  shown  in 
Table  1.  Input  variables  are  roll  gap  and  roll  speed, 
output  variables  are  delivery  strip  thickness  and  back  unit 


tension.  As  a  result  of  the  simulation,  we  show  the 
simulation  results  of  the  behavior  of  delivery  strip 
thickness  resulted  from  the  variation  of  roll  gap  and  the 
occurrence  of  roll  eccentricity.  The  simulation  was 
carried  out  in  MATLAB. 


Table  1.  The  set-up  values  for  the  TCM 


1  STD 

2  STD 

3  STD 

Entry  strip  thicknessfm/w) 

2.6 

2.12 

1.57 

Delivery  strip  thickness(mw) 

2.12 

1.57 

1.17 

Back  unit  tension (kzf/mm2) 

2 

12.7 

17.1 

Forward  unit  tension(/r^7ww2) 

12.7 

17.1 

10 

Work  roll  radius(mm) 

273 

273 

273 

The  variations  of  delivery  strip  thickness  resulted 
from  increase  of  roll  gap  to  +0.1mm(5%  from  set-up 
value)  at  0 sec  and  resulted  from  sinusoidal  roll 
eccentricity  of  work-roll  in  the  first-stand  are  shown  in 
Fig.  2  and  Fig.  3. 


Figure  2.  Time  response  of  delivery  strip  thickness 
in  the  each  stand 


Figure  3.  Time  response  of  delivery  strip  thickness 
in  the  each  stand 


As  a  result  of  this  simulation,  we  concluded  that  the 
variations  of  the  roll  gap,  the  roll  speed  and  the 
disturbances(such  as  the  roll  eccentricity,  the  variation  of 
the  entry  strip  thickness,  etc)  in  the  first  stand  affect 
greatly  the  delivery  strip  thickness  and  back  tension  in 
each  stand  of  the  TCM. 

3.  Model  identification 
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In  order  to  construct  the  mathematical  model  to 
design  the  IT  servo  controller,  we  identify  the  model  of 
the  TCM  using  the  MOESP  algorithm121.  In  especial,  to 
obtain  the  independent  model  of  each  stand  which  has 
natural  properties  of  the  TCM,  the  model  is  identified 
with  the  input  and  output  data  which  determined  from 
the  dynamic  analyses  in  the  chapter  2.  The  identified 
model  is  discrete-time  linear  time  invariant  state-space 
model  by 

x(k  + 1)  =  Ax{k)  +  Bu  ( k )  ^ 

y(k)=Cx(k)  +  Du(k) 


The  MOESP  algorithm  is  based  on  subspace  model 
identification.  From  input  and  output  data,  block  Hankel 
matrices  are  constructed,  state-space  model  A,  B,  C  and 
D  are  calculated  through  the  RQ  factorization  and 
SVD(Singular  Value  Decomposition)191. 

An  implementation  of  the  MOESP  algorithm  is  given  by 


Step  1  :  Construct  the  Hankel  matrices  Yk ,  Uk  from  the 


input  and  output  data. 


y{k) 

X*  +  v-i) 

y(k  + 1)  - 

y(k  +  N) 

X*  +  t- 0  - 

y(k  +  i  +  N-  2)_ 

u(k) 

u(k+N  - 1) 

uk  = 

u(k  + 1) 

u(k  +  N ) 

w(£  +  z'-l) 

u(k  +  i  +  N-2) 

(6) 

(7) 


matrices  Yk ,  Uk 


Step  2  :  Achieve  the  RQ  factorization  of  the  block  Hankel 

(8) 


(9) 


X' 

IX 

j_I  a" 

k, 

^22  _  0-2  _ 

Rn  g  SR{mtxmi\R2l  g  W  ’ ,R22  e 

Step  3  :  Compute  the  SVD  of  the  matrix  R22e  9t(/,x//) 


v2=[v  [v,1 


0  ’ 

i - 

0 

VJ 

3 

t- 

i _ 

Un  :  ( /;  x  n),U nl  :  (//  x  (//-«)),  I „  :  (n  x  n) 

Step  4  :  The  system  matrices  denoted  by  A,  B,  C  and  D 
can  be  computed  from  the  following  set  of  equations 

\D 
B 

L(:,l :  m) 

L(:,m  + 1 :  2m) 


C=U„Q..l-\U"’A  =  Unw,Xy=Xu®X® 


(10) 


L(:,m(z-1)  +  1 :  mi) 


UnL(\:l,  :)T 
£//(l  +  /:2/,:)r 


U„L  (l(i  - 1) + 1 :  :)T 
0 


7, 

0 


0 


where  una)  is  the  submatrix  composed  of  the  first  (/-!)/ 


rows  of  the  matrix  Un  and  u™  is  the  submatrix  of  the 
/+1  rows  to  li  rows  of  the  matrix  u  . 

n 

The  delivery  strip'  thickness  of  the  first  stand  model 
identified  from  the  input  data  (the  roll  gap  and  roll  speed 
of  the  first  stand)  and  output  data  (the  delivery  strip 
thickness  and  back  unit  tension  of  the  first  stand)  appears 
in  Fig.  4.  A  solid  line  is  the  delivery  strip  thickness  used 
to  identify  model  and  a  dotted  line  is  the  delivery  strip 
thickness  comes  of  identified  the  first  stand  model.  The 
error  between  two  thicknesses  in  Fig.  4  is  shown  in  Fig. 


5. 


Figure  4.  The  delivery  strip  thickness  of  the  identified  model. 


VV  W">M4i'1r  V  i‘  VrV 


Time(sec) 


Figure  5.  The  error  of  two  delivery  strip  thicknesses. 


The  identified  model(the  nominal  model)  of  the  first 
stand  is  given  by 


0.9959 
4.745  x!0“3 


0.10  ' 
0.8839 


3.98x10  : 
-4.61  xlO'2 


0.465  -0.59 
0.534  0.462 


-7.46x1  O'5 
-3.42  x  10‘7 


1.19xl0'5  ' 
-1.38xl0'5 
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-2.3x1  O'10 


The  identification  of  the  uncertain  model  achieves  by 
using  the  input  and  output  data  containing  uncertainties, 
and  the  modeling  error  can  be  determined  by  calculating 
the  error  between  the  nominal  model  and  the  uncertain 
model.  The  maximum  singular  value  plot  of  the 
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modeling  errors(such  as  the  variation  of  coefficient  of 
friction  between  roll  and  strip,  the  variation  of 
deformation  resistance  of  strip)  and  the  plot  of  the 
disturbance(such  as  the  variation  of  entry  strip  thickness, 
roll  eccentricity)  are  shown  in  Fig.  6  and  Fig.  7.  In  next 
chapter,  stable  weighting  functions  for  designing  H00  servo 
controller  can  be  determined  by  these  maximum  singular 
value  plots. 


Figure  7.  The  maximum  singular  value  of  the  disturbance. 


4.  H  servo  controller  for  TCM 

The  H00  servo  problem  for  designing  the  controller 
which  achieves  the  robust  tracking  property(the  delivery 
strip  thickness  robustly  tracks  the  command  input,  i.e. 
the  given  set-up  value  without  steady-state  error)  is 
defined  as  follows131. . 
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0 

Figure  8.  The  H00  servo  problem. 


(11) 


[  H“  servo  problem  J 

Consider  a  two  degree-of-freedom  control  system 
depicted  in  Fig.  8.  For  a  given  generalized  plant  Gd(z)  in 

(1 1)  and  a  reference  model,  find  a  controller  Kd(z) 


u  =  Kd 


r 

y 


(si)  Kd(z )  internally  stabilizes  Gd(z) 


(12) 


(s2)  II  TJz)  IU  <  y ,  7^(z) :  transfer  function  from  w  to  z. 
(s3)  Kd(z)  achieves  the  robust  tracking  property  for  the 


reference  model 


Kd(z)  denotes  the  controller  to  be  designed,  w(t)  e 

^  1  denotes  the  disturbance  input,  u(t)  eRm2  denotes  the 
control  input,  z(t)  e  RPl  denotes  the  controlled  output, 
y(t)  e  RPl  denotes  the  measured  output,  and  r(t)  e  RPl 
denotes  the  reference  input,  respectively. 

The  H00  servo  problem  is  modified  as  an  usual  H00  control 
problem  with  internal  model  E(z)  which  contains  the  mode 

of  reference  input  model.  In  this  paper,  we  consider  the 
case  of  step  reference  input,  hence  we  can  set  i(z)  and 
the  modified  generalized  plant  Qd{z)  by 


(13) 

(14) 


Figure  9.  The  H00  control  problem  equivalent  to  the  TT 
servo  problem. 


Consequently,  we  will  solve  the  following  IF  control 
problem  whose  solvability  is  equivalent  to  the  YF  servo 
problem; 

[  H  control  problem  ] 

For  a  given  generalized  plant  Gd(z )  in  (11)  and  a 

reference  input  r,  construct  the  modified  generalized 
plant  Gd(z)  in  (14).  Find  a  controller  gd(z)  satisfying 
the  following  two  specifications 

(pl)  Kd{z)  internally  stabilizes  Gd(z) 

(P2)  ||  Tzw(z)\U<j 

The  modified  generalized  plant  in  (14)  does  not 
satisfy  the  assumptions  of  DGKF16^  ,  e.g.  G12(z)  has  an 
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imaginary  axis  zero  and  Z)21  has  not  full  row  rank. 
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Hence,  the  LMI-based  solution  to  the  H00  control  problem 
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AR 

is  applied  to  solve  our  H00  control  problem  in  this 
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[  LMI-based  H00  control  ] 

The  set  of  H00  controllers  with  closed-loop 
performance  y  can  be  implicitly  parameterized  by  the 
solutions  of  the  system  of  LML  These  solutions  of  LMI 
play  a  role  analogous  to  that  of  the  Riccati  solutions  in 
ARE-based  H00  control.  For  a  given  discrete-time 
generalized  plant  Gd(z)  in  (14)  and  the  performance  y  , 

the  discrete-time  FT  control  problem  is  solvable  if  and 
only  if  there  exist  two  symmetric  matrices  R,  S  e  Rnxn 
satisfying  the  following  system  of  LMIs. 


Step  3  :  Factorize  I  -RS  as  MNT ,  with  M  and 
N  invertible,  and  compute  AK,BK, CK  by  solving 

NBk  =  -SB2Dk  +@/  (21) 

CkM  t  =  ~DkC2R  +  0C  (22) 

-NAkMt  =  SB2Sc  +  S£tC2R  +  S (A  - B 2D kC 2)R 
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(17) 


where  Nu  and  N2\  denote  orthonormal  bases  of  the 
null  spaces  of  (b2t  ,  D12r)  and  (c2 ,  Dn\ 

Given  any  (R,  S)  satisfying  (15)-(17),  a  full-order  y- 
suboptimal  controller  K(z)  =Dk  +Ck(sI  -  AK)~ 1 BK  is 
obtained  as  follows. 

Step  1  :  Compute  any  solution  D l  of  X,  >  0,  with  X  given  by 


A  =  Ar  :=  - 


0 


0  A  +  B2DjrC2 
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Step  2  :  Compute  the  least-norm  solutions  f  ,  (®: 
of  the  linear  equtions 
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[  Simulation  results  ] 

Now,  we  design  the  H00  servo  controller  which 
satisfies  robust  stability  of  modeling  errors,  restraints 
disturbances  and  achieves  the  robust  tracking  property  of 
the  delivery  strip  thickness.  In  this  paper,  we  consider 
only  the  first  stand  of  the  TCM  to  design  the  thickness 
controller.  The  generalized  plant  is  shown  in  Fig.  10. 


Figure  10.  The  generalized  plant  for  designing  the  H00 
servo  controller  of  the  TCM 
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where  G0{z)  is  the  identified  model  of  the  first  stand  of 
the  TCM,  Wx(z),  W2{z)  are  the  weighting  functions, 
and  Kd(z)  is  the  H00  servo  controller  to  be  designed. 

The  Wy(z),  W2(z )  are  considered  in  the  form  of 
stable  weighting  function  in  frequency  domain  and 
determined  from  Fig.  6  and  Fig.  7. 
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The  comparison  of  delivery  strip  thickness  controlled 
the  IT  servo  controller  and  the  PID/FF  controller  based 
on  the  Taylor  linearized  model  is  appeared  in  Fig.  1 1 . 


0.DD0 

k  ;  . 

_ 

controller 

PID/PT  rentrollrr 

0.000 

!;  J 

;  f 

J 

^  j 

0.004 

iii  •;!  ii! 

« i 

;  j 

4  ;!  i 

^  0.00} 

s 

0 

< 

•0.00} 

i  ji<  i  -  ii  ii ,  $ ii , 

Igilii 

fii 
!■  •  !  •  • !!  ■ 
i*  *  * » ♦  i*  i 
<  *  i 

i(i  S|r:;::i 

figlli 

-0.004 

fi:  ’  jiij  ’ 

; !  u  t  ' 

::  ij  ’  ii '  j 

;!  '  •  *  ji  “  i  •  i !  If . 

•D.ODB 

•  ’  ii  ’  ij  '  ’ 

V 

!  i  :j  f  ; 

1  ;;  ¥  J 

-0.00« 

-0.01  . 

►  !  '  V 

\  1 

?  s 

0 

i  }  a 

4  B 

B  7 

*  «  10 

Time(sec) 

Figure  11.  The  variation  of  delivery  strip  thickness  of  H®  servo 
controller  and  PID/FF  controller 

A  dotted  and  a  solid  line  is  respectively  the  delivery 
strip  thickness  resulted  from  the  PID/FF  controller  and 
the  H®  servo  controller  when  the  disturbance,  the  roll 
eccentricity,  occurred  in  the  work-roll  of  the  first  stand. 
The  variation  of  the  delivery  thickness  of  H00  servo 
controller  is  within  ±1  jam,  it  shows  that  the  good 
performance  of  our  IT0  servo  controller  in  the  restraint  of 
disturbance. 


Time(sec') 


Figure  12.  The  variation  of  delivery  strip  thickness  of  FT  servo 
controller  and  PID/FF  controller. 

Specially,  we  show  the  comparison  of  the  tracking 
performance  between  the  H00  servo  controller  and  the 
PID/FF  controller  in  Fig.  12.  For  the  step  reference  input 
of  the  delivery  strip  thickness,  a  dotted  and  a  solid  line  is 
respectively  the  delivery  strip  thickness  of  the  PID/FF 
controller  and  the  H00  servo  controller.  As  shown  in  Fig. 
12,  the  steady  state  error  of  the  output  in  PID/FF 
controller  does  not  reduce  within  ±20/jm ,  but  the  case  of 
the  H00  servo  controller,  the  steady  state  error  of  the 


output  is  ±8.3 jum  and  the  settling  time  of  the  response  is 
0. 12sec. 

5.  Conclusion 

In  this  paper,  we  designed  the  H00  servo  controller 
which  satisfy  the  robust  stability  under  the  modeling 
errors,  restrain  the  disturbances  and  achieve  the  robust 
tracking  property.  The  generalized  plant  in  the  IT  servo 
controller  was  modified  by  the  internal  model  principle, 
the  LMI-based  H00  control  technique  was  applied  to 
design  the  H00  controller  for  the  modified  generalized 
plant.  Consequently,  it  was  shown  that  the  delivery  strip 
thickness  robustly  tracked  the  reference  input  although 
the  modeling  errors  and  disturbances  exist. 
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Abstract 

In  this  paper ,  we  propose  a  decentralized  motion 
control  algorithm  of  multiple  mobile  robots  handling 
an  object  in  coordination  like  humans.  First,  we  illus¬ 
trate  the  principle  of  the  control  algorithm  using  two 
mobile  robots  in  coordination,  and  propose  a  leader- 
followers  type  control  algorithm  for  multiple  robots  us¬ 
ing  the  concept  of  the  ’’Virtual  Leader”.  Experimen¬ 
tal  results  with  omni-directional  mobile  robots  will  il¬ 
lustrate  the  validity  of  the  proposed  control  algorithm. 
We  then  extend  the  algorithm  to  nonholonomic  mobile 
robots  driven  by  two  wheels  by  introducing  dual  caster 
action  to  followers.  Experimental  results  using  two 
tracked  mobile  robots  will  illustrate  how  the  algorithm 
works. 

1  Introduction 

When  we  would  like  to  transport  a  large  and  heavy 
object,  we  carry  it  in  cooperation  with  other  people. 
To  utilize  multiple  robots  in  coordination  is  a  natu¬ 
ral  extension  of  such  human  behavior  to  the  robots. 
In  this  paper,  we  propose  an  intelligent  decentralized 
control  algorithm  of  multiple  mobile  robots  handling 
an  object  in  coordination  like  humans. 

Much  research  has  been  already  done  for  the  motion 
control  of  multiple  manipulators  handling  an  object  in 
coordination  [l]-[6]e£c  .  However,  most  of  the  control 
algorithms  proposed  so  far  have  been  designed  for  the 
centralized  control  system,  that  is,  a  single  controller 
is  supposed  to  control  all  of  the  robots  in  a  centralized 
way.  The  centralized  control  system  could  handle  two 
or  three  manipulators  in  coordination,  but  it  is  not 
easy  to  control  many  mobile  robots  in  coordination 
because  of  its  implementation  problem  and  computa¬ 
tional  burden. 

In  this  paper,  we  develop  a  decentralized  control  al¬ 
gorithm  for  handling  a  single  object  by  multiple  robots 
in  coordination  like  humans.  The  algorithm  could  be 
applied  to  multiple  mobile  robots  in  coordination.  In 
the  proposed  control  system,  the  motion  command  of 
the  object  is  given  to  one  of  the  robots,  referred  to 
as  a  leader.  The  other  robots  are  referred  to  as  fol¬ 
lowers.  Each  follower,  which  is  controlled  by  its  own 
controller,  estimates  the  motion  of  the  object  com¬ 


manded  to  the  leader  to  transport  the  object  in  co¬ 
ordination  with  the  leader.  The  proposed  control  al¬ 
gorithm  is  experimentally  applied  to  omni-directional 
mobile  robots  first,  and  the  results  illustrate  the  valid¬ 
ity  of  the  proposed  control  algorithm.  The  algorithm 
is  also  extended  to  two  nonholonomic  mobile  robots 
driven  by  two  wheels  by  introducing  dual  caster  action 
to  the  follower.  Experimental  results  using  tracked  ve¬ 
hicle  will  illustrate  it’s  validity. 

2  Principle  of  Coordination 

2.1  Two  Robots  in  Coordination 

In  this  section,  we  explain  the  decentralized  control 
algorithm.  For  the  simplicity  of  explanation,  we  first 
consider  the  case  of  two  autonomous  omni-directional 
mobile  robots,  each  of  which  has  three  degrees  of  free¬ 
dom  of  motion.  We  assume  that  each  robot  is  con¬ 
trolled  by  its  own  controller  in  a  decentralized  way. 
The  desired  trajectory  of  the  object  is  given  to  the 
leader  and  the  follower  estimates  the  desired  motion 
of  the  object  commanded  to  the  leader. 

We  assume  that  each  robot  has  the  following  dy¬ 
namics  by  an  appropriate  controller  as  shown  in  Fig¬ 
ured. 

DAxlVKAxl  =  fl-flin  (1) 

DAxi  +  KAxi  =  fi  —  fin  (2) 

where  D  G  R3xS  and  K  G  R3x3  are  positive  definite 
matrices,  fi,fi  G  R3  be  a  forces  applied  to  the  robots, 
fin  ^  jin  £  fF  be  a  specified  internal  forces  applied 
to  the  object  by  the  robots,  and  Axi,Ax±  6  R3  be 
the  trajectory  deviations  of  the  robots  according  to 
the  forces  applied  to  the  robots.  The  subscripts  l  and 
1  indicate  that  the  variables  with  the  subscripts  are 
related  to  the  leader  and  the  follower  respectively. 

Let  Xd,xe i  G  R3  be  the  desired  trajectories  of  the 
leader  and  the  follower  respectively  and  x  G  R3  be 
the  real  trajectory  of  the  object.  Under  the  assump¬ 
tion  that  each  robot  holds  the  object  firmly  and  no 
relative  motion  between  the  object  and  each  robot  oc¬ 
curs,  these  deviations,  Axi  and  Ax\,  are  expressed  as 
follows; 

Axi  —  x  Xd  (3) 
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leader  1st  follower 


Figure  1:  Compliant  Motion  of  Robot 

Axx  =  x  -  xel  (4) 

Under  the  assumption  that  the  external  force  applied 
to  the  object  is  negligible,  the  relationship  between 
Ax i  and  Axi  is  expressed  as  follows  from  Eq.fl)  and 
(2); 


A  xi  +  Axi  =  0  (5) 

Eliminating  x  from  Eq.(3)  and  (4),  we  obtain 

Ax!  -  Ax;  =  xd  -  xei  (6) 

Let  Axdi  be  the  trajectory  estimation  error  of  the  fol¬ 
lower,  then,  Axdi  is  expressed  by 

A xdi  =xd-  xel  (7) 

From  Eq.(5),  (6)  and  (7),  Ax^i  can  be  written  as  fol¬ 
lows; 


Axdi  =  2Axi  (8) 

It  should  be  noted  that  the  follower  can  calculate  Axdi 
using  the  observable  variable  Ax2 . 

Let  us  consider  how  xd  is  estimated  using  Axrfl. 
Let  Gi  be  the  transfer  function,  which  estimates 
Xd,  as  xei,  based  on  A  Xdi  as  shown  in  Figure.2(a). 
From  Eq.(7),  Figure.2(a)  can  be  rewritten  as  a  feed¬ 
back  system  shown  in  Figure. 2(6).  To  eliminate  the 
steady-state  position  and  velocity  estimation  errors, 
the  transfer  function  Gi  is  designed  as  follows; 


Gi 


(Zis  +  bi 


(9) 


where  J3  €  R3xS  is  the  identity  matrix  and  a  and  b 
are  positive  real  numbers  to  keep  the  stability  of  this 
feedback  system. 

2.2  n+1  Robots  in  Coordination 

2.2.1  Forces  and  Trajectory  Deviations 

We  extend  the  result  in  the  previous  section  to  a  gen¬ 
eral  case.  First,  we  consider  the  relationship  among 
trajectory  deviations.  Under  the  assumption  that  ex¬ 
ternal  force  applied  to  the  object  is  negligible,  we  have 


^dl 

ri 

X£1 

ul 

(a) 


xd 
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(b) 


Figure  2:  Estimator 

the  following  relations  with  respect  to  the  forces  ap¬ 
plied  to  the  object; 


fi  +  E/»=0  (io) 

j= i 

n 

nn  +  J2fin=0  (ii) 

j= i 

As  time  tends  to  infinity,  we  obtain  the  following  rela¬ 
tionship  from  (1),  (2),  (10)  and  (11),  even  if  the  initial 
value  of  Ax i  +  A xj  is  not  zero. 


A  XI  +  Axj  —  0  (12) 

J  =  1 

2.2.2  Dynamics  of  Virtual  Leader 

It  is  impossible  for  the  i-th  follower  to  estimate  the 
desired  trajectory  of  the  leader  because  the  trajec¬ 
tory  deviation  of  the  z-th  follower,  which  was  used  for 
the  estimation  of  the  desired  trajectory  of  the  leader 
in  previous  case,  is  affected  by  motions  of  all  of  the 
robots.  Therefore,  for  the  z-th  follower,  the  robots 
are  classified  into  two  groups;  one  is  the  z-th  follower 
itself  and  the  other  is  the  rest  of  the  robots  includ¬ 
ing  the  leader.  In  this  paper,  we  refer  to  the  rest  of 
the  robots  as  the  z-th  virtual  leader.  The  z-th  vir¬ 
tual  leader  consists  of  the  leader,  and  j- th  followers 


Figure  3:  Virtual  Leader 
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( j  =  1, . . .  ,  z  —  1,2  +  1, . . .  ,  ra)  as  shown  in  Figure. 3. 
For  the  z-th  follower,  the  z-th  virtual  leader  behaves 
as  if  it  were  a  real  leader.  Using  the  concept  of  the 
virtual  leader,  the  z-th  follower  estimates  the  desired 
trajectory  of  the  z-th  virtual  leader  based  on  the  es¬ 
timation  algorithm  when  two  robots  handle  a  single 
object  in  coordination. 

We  can  derive  the  dynamics  of  the  z-th  virtual 
leader  as  follows; 

n  n 

D(Axi  +  ^2  Axj)  +  K(Axi  4-  ^2  Axj) 

=  (fi+  £  /j) -(/«*"+  £  in  (is) 

j=l(j^i) 

where 

n  i— 1  n 

£  ci  =  £ci+  £  ci  (i4) 

j= 1  j=i+ 1 

The  trajectory  deviation  of  the  z-th  virtual  leader  Axu 
is  expressed  as  follows; 


n 

Ax  u  =  Axi  +  ^2  Axj  (15) 


As  time  tends  to  infinity,  we  obtain  the  following 
relationship,  regardless  of  the  desired  trajectory  given 
to  the  leader  x&. 


A  xn  +  Axi  =  0 


(16) 


The  force  applied  to  the  z-th  virtual  leader  fu  and  the 
internal  force  applied  to  the  object  by  the  z-th  virtual 
leader,  //*,  are  expressed  as  follows; 
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fi+  £  fj 

(17) 

j=i(j^i) 

fin+  £  ST 

(18) 

j  =  l(jy£i) 


From  (10)  and  (11),  we  also  have 

fu  +  fi  =  0  (19) 

fir  +  ft  =  o  (20) 


From  (15),  (17)  and  (18),  (13)  is  rewritten  as 

DAxu  +  KAxh  -  fH  -  f^  (21) 


It  should  be  noted  that  (21)  expresses  the  behavior  of 
the  z-th  virtual  leader. 

Let  Xdi  be  the  desired  trajectory  of  the  z-th  virtual 
leader.  Then,  Axu  is  expressed  as 


Ax  a  =  x  -  xdi 


(22) 


Figure  4:  Free  joint 


Figure  5:  Coordinate  system 
From  (15)  and  (22),  x<u  is  written  as 

n 

Xdi  **“*  Xd  ^  ^  Axj  (25) 

j=i(jV0 


2.2.3  Estimation 

Using  the  concept  of  virtual  leader,  the  z-th  follower 
can  estimate  the  desired  trajectory  of  the  z-th  virtual 
leader  Xdi  based  on  the  estimation  algorithm  which 
we  discussed  in  the  previous  section,  because  (21)  and 
(22)  have  the  form  as  (1)  and  (3). 

We  define  Axdi  as  the  estimation  error  of  the  z- 
th  virtual  leader  estimated  by  the  follower.  Axdi  is 
expressed  as 


^^X  di  —  Xdi  Xqi 

=  2A  Xi  (24) 

It  should  be  noted  that  Axdi  is  calculated  by  each  fol¬ 
lower  based  on  the  observable  state  of  each  follower 
Axi .  Using  this  Axdi  and  the  transfer  function  ma¬ 
trix  Gi  which  we  design  in  previous  section,  the  z-th 
follower  can  estimate  the  desired  trajectory  of  the  z-th 
virtual  leader  as  follows; 

Xei  -  GiAXdi  (25) 

We  design  the  transfer  function  matrix  G{  similar  to 
the  case  of  previous  section.  The  transfer  function 
matrix  Gi  is  expressed  as  follows; 


Gi  = 


CLi$  “h 


(26) 


3  Extension  to  Mobile  Robots  Driven 
by  Two  Wheels 

The  decentralized  control  algorithm  discussed  in 
the  previous  sections  has  been  proposed  for  robots  un¬ 
der  holonomic  constraints.  This  decentralized  control 
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algorithm  cannot  be  applied  to  the  mobile  robots  un¬ 
der  nonholonomic  constraints  because  nonholonomic 
robots  cannot  generate  velocity  in  all  directions.  In 
this  section,  we  extend  this  decentralized  control  al¬ 
gorithm  to  the  nonholonomic  mobile  robot.  Consider 
a  mobile  robot  driven  by  two  wheels  as  shown  in  Fig¬ 
ured.  Define  the  robot  coordinate  system  as  shown  in 
Figure. 5.  The  motion  of  the  robot  is  characterized  by 
the  velocity  in  its  heading  direction  x  and  the  orien¬ 
tation  of  the  robot  6.  By  appropriately  specifying  the 
velocity  of  each  wheel,  we  can  specify  the  translational 
motion  velocity  and  the  rotational  motion  velocity  6 . 

3.1  Outline  of  System 

The  control  algorithm  in  the  previous  section  was 
designed  under  the  assumption  that  each  robots  holds 
the  transported  object  firmly.  In  case  of  the  mo¬ 
bile  robots  under  nonholonomic  constraints,  the  robot 
could  not  move  on  the  ground  freely  if  each  robot  holds 
the  transported  object  firmly.  To  solve  the  problem, 
we  assume  that  each  robot  holds  the  object  through  a 
free  rotational  joint,  which  is  located  in  the  middle  of 
both  wheels,  as  shown  in  Figure. 4. 

The  mobile  robots  driven  by  two  wheels  cannot  be 
controlled  so  as  to  have  the  dynamics  given  by  Eq.(l) 
and  (2),  because  it  cannot  generate  velocity  in  all  di¬ 
rections.  We  consider  to  use  the  algorithm  in  the  pre¬ 
vious  section  only  along  the  translational  motion  di¬ 
rection.  The  motion  command  is  given  to  the  leader 
and  the  follower  estimates  the  desired  trajectory  given 
to  the  leader  along  the  translational  motion  direction 
of  the  follower.  For  the  rotational  motion  of  the  robot, 
we  use  the  idea  of  the  caster  action  proposed  by  D.  J. 
Stillwell  et  al.  We  extend  the  caster  dynamics  and 
propose  the  dual  caster  action. 


(a) 


follower 


(b) 


Figure  7:  Compliance  of  each  robot 


In  our  system,  each  robot  holds  the  object  through 
a  free  rotational  joint,  which  is  located  in  the  middle  of 
both  wheels,  as  shown  in  Figure.4.  The  caster  action 
is  realized  by  a  control  software.  To  mimic  the  motion 
of  the  real  caster,  the  follower  is  controlled  so  as  to 
have  the  following  dynamics. 


3.2  Dual  Caster  Action 

The  use  of  the  caster  dynamics  for  the  tracked  vehi¬ 
cle  coordination  was  proposed  by  D.  J.  Stillwell  et  al. 
He  proposed  to  control  a  vehicle  as  if  it  has  a  caster 
dynamics.  The  real  caster  has  an  offset  00ff  between 
the  axis  of  the  wheel  and  the  free  joint  as  shown  in  Fig- 
ure.6.  The  caster  turns  to  the  direction  of  the  force 
applied  to  the  caster  by  this  offset. 


Dcast^i  —  Coff(—fix  sin#!  +  fiy  cos 9 1 )  (27) 

This  caster  dynamics  is  realized  by  specifying  the 
angular  velocity  of  the  follower  6X  so  as  to  satisfy  the 
Eq.(27).  Where  fXx  and  fXy  are  the  forces  applied 
to  the  follower  with  reference  to  the  base  frame  Yjb 
as  shown  in  Figure. 7,  0X  is  the  follower  orientation 
with  reference  to  the  base  frame  and  Dcast  is  the 
parameter  expressing  the  characteristic  of  the  caster. 


(b)  (c) 


Figure  6:  Real  caster  action 


Figure  8:  Dual  caster  action 
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Figure  9:  Experimental  Result 


Dcast  and  CQff  are  positive  real  numbers. 

As  long  as  flx  cos  9X  +  fiy  sin  6X  >  0  as  shown 
in  Figure. 6 (6),  the  follower  adjusts  its  motion  direc¬ 
tion  with  the  minimum  rotational  motion,  but  when 
fix  cos  9 1  +  fiy  sin^i  <  0  as  shown  in  Figure. 6(c),  the 
follower  rotates  more  than  90  degrees.  The  rotational 
motion  more  than  90  degrees  brings  the  follower  into 
a  singular  configuration  of  this  transportation  system 
in  which  the  direction  of  the  follower  is  perpendicular 
to  that  of  the  leader.  The  follower  could  not  generate 
any  velocity  along  the  motion  direction  of  the  leader. 

To  avoid  this  problem,  we  consider  to  control  the 
follower  so  as  to  have  two  different  caster  actions. 
That  is,  we  change  the  position  of  the  virtual  caster 
wheel  according  to  the  sign  of  flx  cos  9i  -f-  fly  sin  9X  as 
follows; 


DCast@ i  —  G(  fix  sin 9\  T  f\yCos9i)  (28) 

where 


f  C0f  f ,  for  flx  cos  6X  +  fiy  sin  0i  >  0 
1  ~Coff ,  for  flx  cos  9i  +  fly  sin  9X  <  0 


(29) 


Since  the  follower  is  controlled  by  two  different  caster 
actions,  the  action  of  the  follower  is  referred  to  as  the 
dual  caster  action.  Using  the  dual  caster  action,  the 
follower  can  align  its  orientation  with  the  direction 
of  the  force  applied  to  the  robot  without  unnecessary 
over  action.  The  concept  of  dual  caster  action  is  illus¬ 
trated  in  Figure. 8. 


4  Experiment 

4.1  Experimental  Results  using  Holo- 
nomic  Mobile  Robots 

We  did  experiments  using  three  autonomous  omni¬ 
directional  mobile  robots,  ZEN,  developed  by  RIKEN 
[9].  Each  mobile  robot  has  three  degrees  of  freedom  of 
motion  and  equip  with  the  Body  Force  Sensor  which 
is  the  force/torque  sensor  located  between  the  drive 
mechanism  and  the  body  of  the  mobile  robot  [10].  The 
leader  was  given  the  desired  trajectory  and  each  fol¬ 
lower  estimated  the  desired  trajectory  of  its  own  vir¬ 
tual  leader  using  the  algorithm  proposed  in  the  previ¬ 
ous  section.  The  control  algorithm  was  implemented 
using  VxWorks.  The  sampling  rate  was  1024Hz.  One 
of  the  results  is  shown  in  Figure. 9 

4.2  Experimental  Results  Using  Nonholo- 
nomic  Mobile  Robots 

The  extended  decentralized  control  algorithm  was 
implemented  in  the  experimental  system,  which  con¬ 
sists  of  two  tracked  mobile  robots.  Each  mobile  robot 
held  the  object  through  a  free  joint.  Given  the  desired 
trajectory  to  the  leader,  the  follower  was  controlled 
based  on  the  dual  caster  action  which  we  proposed 
in  Eq.(28),  and  estimated  the  desired  trajectory  given 
to  the  leader  along  the  translational  motion  direction 
using  the  estimator  illustrated  in  Figure. 2.  The  con¬ 
trol  algorithm  was  implemented  using  VxWorks.  The 
sampling  rate  was  1024Hz.  One  of  the  results  is  shown 
in  Figure.  10 
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(a)  0  [sec]  (b)  3  [sec]  (c)  6  [sec] 


(d)  9  [sec] 


(h)  21  [sec] 


(k)  30  [sec]  (1)  33  [sec] 


Figure  10:  Experimental  Result 


5  Conclusions 

In  this  paper,  we  proposed  a  decentralized  motion 
control  algorithm  of  multiple  mobile  robots  in  coor¬ 
dination.  The  algorithm  realizes  a  natural  robot  be¬ 
havior  for  handling  a  single  object  in  coordination. 
The  robots  handle  an  object  in  coordination  although 
only  the  leader  knows  how  to  move.  The  collective  be¬ 
havior  was  realized  through  the  estimation  of  leader’s 
motion  by  each  follower  based  on  information  obtained 
at  each  follower.  The  algorithm  was  extended  to  non- 
holonomic  mobile  robots  by  introducing  the  concept 
of  the  dual  caster  action.  Experimental  results  illus¬ 
trated  the  proposed  algorithms. 
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Abstract 

ROBOPAINT,  the  first  multi-robot  glass  brick  painting 
system  in  the  world  has  been  designed  and  developed  by 
Altinay  Robotics  &  Automation  Inc.  The  system  was 
planted  and  is  currently  functioning  successfully  in  the 
Mersin  Factory  of  $i§ecam  A.S.  ROBOPAINT  automates 
the  whole  production  line  starting  from  the  cooling 
system  to  the  drying  oven .  This  system  takes  in  raw  glass 
bricks,  passes  them  through  certain  processes  and 
produces  a  final  usable  product  at  a  rate  of  10  pieces  per 
minute  which  is  the  production  speed  of  the  raw  glass 
bricks.  This  robotic  system  consists  of  four  subsystems 
including  2  SCARA  robots  and  1  cartesian  robot. 

1 •  Introduction 

The  development  of  robots  and  automation  systems  for 
the  industry  is  due  to  two  main  reasons.  The  first  one  is  to 
keep  people  out  of  unhealthy  and  hazardous  environments 
and  free  them  from  performing  “3-d”  tasks  (dirty,  dull, 
dangerous).  The  second  reason  is  to  be  able  to  produce 
standard  products  with  high  quality.  Painting  application 
represents  one  of  the  branches  which  have  rapidly 
switched  to  robotized  and  automated  systems.  The 
objective  of  this  project  has  been  to  design  a  painting 
system  where  the  final  product  will  be  packed  without 
being  touched  by  a  human,  therefore  having  no  fingertips 
on.  ROBOPAINT  is  the  only  robotized  glass  brick 
painting  system  in  the  world  and  has  the  ability  to  handle 
glass  bricks  of  different  dimensions. 

As  a  multi-robot  system  ROBOPAINT  possesses  the 
advantages  of  such  systems  namely  if  one  of  the  robots 
fails,  the  others  can  compensate  for  it,  and  thus  the  system 
is  reliable  and  flexible. 

2.  Painting  Applications 

The  challenge  of  building  a  spay  painting  booth  in 
conformance  with  all  safety  and  health  codes  makes  spray 
painting  a  natural  application  for  industrial  robots.  [1]  In 
addition,  robots  are  able  to  achieve  a  level  of  consistency 
that  is  difficult  to  expect  human  spray  painters  to 
duplicate.  Although  a  skilled  spray  painter  is  required  the 
first  time  to  teach  the  robot  a  painting  task,  once  taught, 


the  robot  will  do  the  spray  painting  operation  repeatedly, 
with  a  consistency  unattainable  by  the  same  expert  who 
taught  the  robot  the  first  time  through. 

According  to  the  robot  statistics  given  by  UN  and  IFR 
(International  Federation  of  Robotics)  [2]  dispensing 
applications  represent  3%  of  all  robotic  applications  and 
painting  applications  represent  66%  of  ail  dispensing 
applications. 

Spray  painting  methods  fall  mainly  into  three 
categories:  manual,  hard  automation  and  robotized 
painting  systems.  The  last  one  is  the  hardest  to  implement 
but  the  most  useful  and  flexible. 

3.  System  Structure  of  ROBOPAINT 


<D  :  Glass  Brick  Rotating  System 

<D  :  Glass  Brick  Donating  System 

.  @  :  Robotized  Painting  System 

Figure  1  CAD  View  of  the  Whole  ROBOPAINT 
System 

The  design  of  the  system  components  shown  in  figure  I 
was  done  using  I-DEAS  CAD  software.  During  this 
project  the  first  SCARA  robot  used  for  industrial  purposes 
in  Turkey  ASC-40  was  designed  and  manufactured. 

ROBOPAINT  consists  of  autonomously  functioning 
units.  The  electromechanical  subsystems  can  be  classified 
as  follows: 
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•  Glass  Brick  Transferring  System 

•  Glass  Brick  Rotating  System 

•  Glass  Brick  Donating  System 

•  Robotized  Painting  System 

3.1  Glass  Brick  Transferring  System 

The  Glass  Brick  Transferring  System  consists  of  a 
cartesian  robot  (AKR-40)  having  maximum  capacity  of  40 
kg,  1  servo  and  4  pneumatically  controlled  axes  and  a 
variator  controlled  L-shaped  transferring  conveyor  as 
shown  in  figure  2.  The  robot  hand  is  controlled  by  the 
PLC  and  can  hold  the  glass  bricks  firmly  without 
damaging  them. 

The  glass  bricks  are  being  produced  at  rate  of  10 
pieces  per  minute.  The  robot  transfers  the  glass  bricks 
coming  from  the  cooling  system  in  groups  of  fifteen  onto 
the  transferring  conveyor  one  by  one.  The  motion  of  the 
conveyor  is  continuous  and  the  robot  leaves  the  glass 
bricks  onto  it  while  it  is  moving. 


Picture  and  Real  Photo 


3.2  Glass  Brick  Rotating  System 

The  Glass  Brick  Rotating  System  is  a  4-DOF 
pneumatically  controlled  system  placed  at  the  end  of  the 
Glass  Brick  Transferring  System.  The  glass  bricks  are 
rotated  and  transferred  to  the  glass  brick  donating 
conveyor  in  a  position  suitable  for  the  robots  to  handle 
them. 

When  the  glass  brick  reaches  the  stopper  at  the  end  of 
the  transferring  conveyor,  the  glass  bricks  are  stopped, 
when  the  sensor  placed  on  the  stopper  identifies  the  glass 
brick  the  pneumatic  piston  approaches  the  glass  brick, 
grabs  it  and  after  rotating  it  90°  leaves  it  onto  the  up-and- 
down  moving  plate  which  moves  it  down  to  the  conveyor 
height. 

There  are  4  different  movements  in  the  glass  brick 
rotating  system. 


Figure  3  Glass  Brick  Rotating  System  CAD  Picture 
and  Real  Photo 


3.3  Glass  Brick  Donating  System 

The  Glass  Brick  Donating  System  consists  of  a 
conveyor  and  indexing  units.  There  are  4  pneumatically 
controlled  donating  systems-2  for  each  robot.  The  glass 
bricks  are  indexed  with  respect  to  4  indexing  points  and 
are  fed  to  the  robot  hands.  The  glass  bricks  are  released 
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just  when  it  has  been  ensured  that  the  robot  hand  can  hold 
them  safely. 


Although  the  indexing  systems  are  working 
independently,  there  is  no  difference  between  their 
movements. 


Figure  4  Glass  Brick  Donating  System-CAD  Picture 
and  Real  Photo 


3.4  Robotized  Painting  System 

The  Robotized  Painting  System  consists  of  two  5-DOF 
SCARA  type  industrial  robots  (Altinay  ASC-40)  and  their 
special  hands.  When  the  robot  hands  are  closed  just  the 
side  surfaces  of  the  glass  bricks  are  uncovered  and  can  be 
painted.  The  robot  hands  have  been  specially  designed  for 
this  application  and  besides  the  control  axes  of  the  robots, 
they  have  two  pneumatically  controlled  axes. 

The  1st,  2nd  and  3rd  axes  are  performing  rotary  motion 
and  span  a  circular  area  in  the  coordinate  system.  The  4th 
axis  rotates  the  fist  part  of  the  robot  hand  and  the  fifth  axis 
rotates  the  second  part  of  the  robot  hand. 

The  up-down  movement  of  the  robot  hand  covers  is 
pneumatically  controlled  and  the  movements  of  the  two 
robots  are  symmetric. 

The  structure  and  properties  of  the  ASC-40  robot  will 
be  given  in  detail  in  the  following  sections. 

Although  this  robot  was  designed  for  the  application 
mentioned  above,  ASC-40  can  be  used  in  various  other 
industrial  applications  such  as  material  handling, 
assembly,  material  transfer,  etc. 


Figure  5  Robotized  Painting  System 


4.  The  Industrial  Robots  Used  in  the  Project 

Two  types  of  industrial  robots  have  been  used  in  the 
ROBOPAINT  project,  namely  the  cartesian  robot  used  in 
the  Glass  Brick  Transferring  System  (AKR-40)  and  the 
SCARA  type  robots  used  in  the  Robotized  Painting 
System  (ASC-40). 

4.1  Altinay  AKR-40  Robot 

The  AKR-40  robot  takes  the  glass  bricks  from  the 
cooling  conveyor  and  leaves  them  onto  the  transferring 
conveyor.  The  properties  of  the  robot  and  its  working 
range  are  as  given  in  Figure  6  and  Figure  7. 


Figure  6  Working  Range  of  the  Altinay  AKR-40 
Robot 
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Model  Name 

AKR-40 

Architecture 

Cartesian 

Degree  of  Freedom 

1 +4 

Motor  Driving  System 

Brushless  AC  Servo 

Motor  Driving  Capacity 

2000  W 

Working 

Range 

Arm 

1.  axis 

3500  mm 

2.  axis 

700  mm 

3.  axis 

120  mm 

4.  axis 

±  185  deg 

5.  axis 

±  65  deg 

Speed 

Arm 

/.  axis 

3000  mm/sec 

Maximum  Payload 

40  kg 

Repeatability 

±0.5  mm 

Robot  Arm  Working  Range 

9.4  m3 

Ambient  Temperature  and  Hum. 

0-40%: 

Weight 

500  kg 

Figure  7  Properties  of  AKR-40  Robot 


4.2  Altinay  ASC-40  Robot 

The  ASC-40  scara  robot  is  active  through  the  painting 
process.  The  robot  hands  cover  the  glass  bricks  while  the 
robots  are  in  the  painting  cabin  leaving  just  the  surfaces  to 
be  painted  uncovered.  While  the  glass  bricks  are  being 
painted  the  robot  hands  turn  360°,  4  times  90°  so  that  the 
complete  circumference  of  the  glass  bricks  is  completely 
painted. 

The  properties  of  the  ASC-40  robot  are  as  given  in 
Figure  8  and  the  working  range  of  the  robot  is  given  in 
Figure  9. 


Model  Name 

ASC-40  \ 

Architecture 

Scara 

Degree  of  Freedom 

5 

Motor  Driving  System 

Brushless  AC  Servo 

Motor  Driving  Capacity 

7000  W 

Working 

Range 

Arml 

1.  axis 

200  deg 

2.  axis 

240  deg 

Wrist 

3.  axis 

210  deg 

4.  axis 

Unlimited 

5.  axis 

Unlimited 

Operation 

Speed 

Arm 

1.  axis 

166  deg/sec 

2.  axis 

166  deg/sec 

Wrist 

3.  axis 

166  deg/ sec 

4.  axis 

270  deg/sec 

5.  axis 

270  deg/sec 

Maximum  Payload 

25-40  kg 

Repeatability 

±0.5  mm 

Robot  Arm  Working  Range 

9.4  m 3 

Ambient  Temperature  and  Hum. 

0-40  %!,20~80%RH 

Weight 

-800  kg 

Figure  8  Properties  of  ASC-40  Robot 


Figure  9  Working  Range  of  the  Altinay  ASC-40  Robot 

At  the  design  stage  of  the  robots  the  properties  have 
been  defined  by  the  requirements  of  the  application,  that 
is  why  the  motors  and  the  reduction  gears  have  been 
selected  by  calculating  the  essential  limit  values 
accordingly. 

5.  The  Mathematical  Model  of  the  Painting 
Operation 

In  order  to  be  able  to  develop  a  mathematical  model 
for  the  painting  process  a  lot  of  strategies  have  been  tried. 
The  aim  is  to  develop  a  tool  that  can  calculate  the  optimal 
painting  trajectory.  [3].  The  geometry  of  the  paint 
particles  is  as  shown  in  Figure  10. 


Figure  10  The  Geometry  of  the  Paint  Distribution 


The  ortonormal  unit  vectors  Ui...u3  fix  the  paint  gun 
orientation.  The  figure  shown  above  is  formed  of  aerosol 
and  when  the  aerosol  reaches  the  surface  to  be  painted  the 
paint  particles  stick  onto  the  surface.  If  the  paint 
distribution  area  j(r,  u!...u3)  can  be  determined  a 
prediction  using  multivariable  differential  theorems  can  be 
made. 

j(r,  u1...u3)=Q0V  /  (4Dz)exp(-VR2  /  (4Dz))u3 

Q0:  constant, 

V :  the  speed  of  aerosol, 

R:  Distance  from  u3 

D:  Diffusion  coefficient 
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6.  Conclusion 


7.  References 


ROBOPAINT  is  an  innovative  system  in  several 
aspects: 

The  idea  is  new:  this  is  the  world’s  first  multi-robot 
glass  brick  painting  system. 

The  product  developed  for  this  system  is  new:  ASC- 
40  -  the  first  SCARA  robot  of  Turkey  was  designed  and 
manufactured  by  Altinay  for  this  project. 

The  final  product  of  the  system  is  new:  the 
production  of  glass  bricks  in  Turkey  started  with  the 
completion  of  ROBOPAINT. 


[1]  Asfahl,  C.  R.,  Robots  and  Manufacturing  Automation,  John 
Wiley  and  Sons  Inc.,  Singapore,  1992 

[2]  World  Industrial  Robots  1997 ,  United  Nations,  Geneva, 
1997 

[3]  Tahrali,  E.,  “A  Simulation  Method  in  Robotized  Painting'\ 
Istanbul  Technical  University,  Ms.  Thesis.  Istanbul,  June  1998 
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Abstract 

An  active  worktable  for  force  control  of  robot 
manipulators  is  designed  and  fabricated.  The  active 
worktable  has  several  degrees  of  freedom  and 
accommodates  its  position/force  in  accordance  with  the 
motion  of  a  robot.  The  worktable  is  controlled,  based  on 
impedance  control  methods,  to  achieve  compliant 
motions.  Several  experiments  are  carried  out  to  confirm 
basic  effectiveness  of  the  active  worktable. 


1.  Introduction 

Compliant  motion  control  is  considered  to  be  a  key 
technique  for  intelligent  motion  control  of  robot 
manipulators.  Numerous  control  schemes  have  been 
proposed  for  force  control  of  robot  manipulators  [1], 
However,  applications  to  practical  tasks  have  not  readily 
progressed.  One  of  the  reasons  is  that  most  industrial 
robots  to  date  have  been  developed  on  the  basis  of 
position  control  schemes  and  it  is  difficult  to  change  the 
controller  for  force  control. 

An  approach  of  adding  extra  degrees  of  freedom  to  a 
robot  manipulator  is  one  of  the  practical  solutions.  Based 
on  the  idea,  active  devices  to  provide  the  compliance  to 
an  end  effector  have  been  developed  [2,3],  The  active 
end  effectors  accommodate  the  position  and  force, 
according  as  the  change  of  environment.  However,  the 
weight  of  an  end  effector  becomes  a  load  to  the 
manipulator  and  the  design  is  restricted  by  the  payload 
limitation  of  robot  manipulators. 

This  article  discusses  an  approach  to  give  the 
compliance  to  a  workpiece,  instead  of  giving  the 
compliance  to  manipulators.  To  make  practical  the 
approach,  an  active  worktable,  which  provides  active 
compliance  to  a  workpiece,  is  proposed  here.  The 
worktable  has  several  degrees  of  freedom  and  can 
perform  compliant  motion  tasks  independently  to  the 
robot  motion. 

Developed  active  worktable  has  architecture  based  on 
a  parallel  link  mechanism  powered  by  DD  (direct  drive) 
motors.  Since  DD  motors  have  many  advantages  such  as 


large  torque,  low  friction  and  low  torque  ripple,  the 
motors  are  suitable  for  force  control  tasks.  Also,  the 
parallel  link  mechanism  can  solve  problems  of 
supporting  heavy  DD  motors. 

In  section  2,  a  design  concept  of  the  active  worktable 
is  presented.  Then,  the  control  systems  design  for 
achieving  the  active  compliance  is  discussed  in  section  3. 
To  confirm  the  effectiveness  of  the  system,  several 
experimental  results  using  prototype  active  worktable  are 
shown  in  section  4. 

2.  Active  worktable 

Fig.l  illustrates  the  concept  of  an  active  worktable. 
The  worktable  holds  a  workpiece  and  accommodates  its 
horizontal  and  vertical  positions.  The  table  is  driven 
actively  to  give  the  compliant  motion  to  a  workpiece. 

Consider  the  grinding  or  polishing  a  surface  by  a 
robot  manipulator,  for  example,  it  is  necessary  to  design 
a  system  with  large  compliance  (small  stiffness)  in  the 
normal  direction,  and  small  compliance  (large  stiffness) 
in  the  tangential  direction.  The  active  worktable  can 
change  the  compliant  direction,  depending  on  the  surface 
profile  of  a  workpiece. 

Using  the  active  worktable  together  with  an  industrial 
robot,  we  can  achieve  the  compliant  motion  tasks  without 
any  change  of  the  robot  controller. 

Fig-2  shows  the  mechanism  of  the  active  worktable. 


Fig.  1  Concept  of  an  active  worktable. 
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L 


turntable  (planned) 


Fig.  2  Two  degrees  of  freedom  parallel 
link  mechanism. 

The  table  has  a  2  d.o.f.  parallel  link  mechanism  driven 
by  two  DD  motors.  The  mechanism  converts  rotary 
motion  of  two  motors  into  translational  movement  of  the 
table.  The  motion  is  restricted  within  a  vertical  plane. 
However,  for  3D  applications,  a  turntable  on  the  table 
enables  the  adjustment  of  horizontal  orientation  of  a 
workpiece. 

Based  on  the  concept,  a  prototype  active  worktable  is 
designed  and  fabricated  as  shown  in  Fig.  3.  Two  DD 
motors  are  placed  in  both  side  of  the  parallel  link 
mechanism.  The  length  of  links  is  176mm  and  work  area 
of  the  table  is  approximately  8cm  square. 

The  specifications  of  the  DD  motor  are  listed  in  Table 
1.  As  the  gravity  load  is  fairly  balanced  by 
counterweights  attached  at  motor  links,  the  maximum 
power  of  the  active  worktable  is  240  N. 


Maximum  torque 

30  Nm 

Friction  torque 

0.29  Nm 

Maximum  speed 

1.25  rps 

Angular  resolution 

491520  P/rev 

Rotor  inertia 

0.018  Kgm2 

Table  1  Specifications  of  DD  motor  (NSK  SSB030). 


3.  Control  of  the  active  worktable 
3.1.  Stiffness  control 

This  section  discusses  a  simple  stiffness  controller 
based  on  the  kinematics  of  an  active  worktable. 

Refer  to  Fig.4.  Kinematics  equations  of  the  link 
mechanism  are 


Fig.  3  Prototype  active  worktable. 


In  order  to  realize  the  active  compliance,  a  stiffness 
control  scheme  [4]  is  applied  to  the  mechanism  as 
follows. 

Give  the  relation  between  the  applied  force  Fs  and  table 
deflection  X/  in  the  task  coordinate  frame  Xs-Ys  as 

Fs  =KX  +DX  ,  (3) 

where  Ks=diag(Ksx,Ksy)  is  a  stiffness  matrix  and 
Ds=diag(Dsx,Dsy)  is  a  damping  matrix  to  ensure  the 
stability  of  the  control  system,  respectively. 

Assuming  that  dynamic  coupled  nonlinearities 
(coriori’s  force,  centrifugal  force  etc.)  are  neglegibly 
small  and  gravity  forces  can  be  mechanically 
compensated  by  counterweights,  torques  to  realize  the 
target  stiffness  (3)  is  determined  as  follows. 

t = -jt  c rksrtx*  +  R£>sRTX'  l  (4) 

task  coordinate  frame 


Ys  Xs 


Px=t  cos  0]  cos  02  (1) 

Py  - 1  sin  6]  +  lsm02  .  (2) 


Fig.  4  Kinematics  of  the  parallel  link 
mechanism. 
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where  J  is  Jacobian  matrix  of  kinematics  relation  and  R 
is  a  rotation  matrix  between  the  base  coordinate  frame  to 
task  coordinate  frame.  Xe  is  table  deflection  in  the  base 
coordinate  frame,  which  is  given  by  reference  position  Xq 
and  motor  angles  as 

Xe=X-X0=m-X0.  (5) 

where  L(  B )  is  the  kinematics  relationship  of  (1)  and  (2). 

The  table  velocity  is  determined  as 

X'=X-X0=J9-X0.  (6) 

3.2  Impedance  control 

The  controller  (4)  is  a  position  servo  in  the  task 
coordinate  frame  and  stiffness  is  determined  by  position 
feedback  gains.  Thus,  the  controller  can  achieve  arbitrary 
stiffness  without  using  force  feedback.  However,  the 
controller  has  not  freedoms  to  specify  the  dynamic 
characteristics.  For  example,  once  we  choose  a  small 
position  gain  for  compliant  motion,  the  transient 
response  of  the  control  system  becomes  slow. 

An  impedance  control  method  [5]  can  be  applied  when 
the  force  information  is  available  in  a  control  loop. 

By  adding  a  force  feedback  to  equation  (4)  as 

T  =  -Jt{RKsRtX,  +  RDSRTX, 

- R(Kr-1)RtF} ,  (7) 

we  can  realize  the  following  mechanical  impedance. 

MRX*+DsX:+KsXe=KRF\  (8) 

where  MR  is  a  mass  matrix  of  the  mechanism  described 
in  the  task  coordinate  frame. 

Although  Mr  is  a  function  of  joint  angles  and  varies  in 
accordance  with  the  table  configuration,  the  simplified 
impedance  control  (7)  cannot  specify  the  mass  matrix. 
However,  motor  inertias  and  counterweights  determine 
dominant  components  of  the  mass  matrix  and  they  do  not 
depend  on  the  table  configuration.  Consequently,  in 
relatively  small  work  area,  the  matrix  is  assumed  to  be 
constant.  Thus,  we  can  design  the  controller  as  follows. 

First,  we  design  Ks  and  Ds  at  the  nominal  table 
configuration  to  satisfy  the  dynamic  characteristics,  then 
design  KR  to  realize  the  necessary  stiffness 
Both  the  stiffness  controller  and  simplified  impedance 
controller  need  information  on  the  normal  direction  at 
contact  point.  This  information  is  determined  by  the 
orientation  of  an  end  effector,  which  is  decided  in  the 
robot  programming  process. 


applied  force 


Fig.  5  Two  axis  force  sensor. 

Since  reference  positions  of  the  table  are 
programmable,  we  can  adjust  the  contact  force  by 
accommodating  the  reference  position  along  the 
compliant  direction.  The  accommodation  also  enables  the 
compensation  of  positioning  errors  introduced  by  the 
uncertainty  of  fixturing. 

4.  Experiments 

To  confirm  the  basic  effectiveness  of  the  system, 
stiffness  of  the  table  was  examined  using  the  prototype 
active  worktable. 

As  shown  in  Fig.  5,  simple  force  sensor  using  strain 
gauges  is  installed  in  the  table  to  measure  the  forces  in 
two  directions  in  the  Cartesian  coordinate  frame.  The 
force  information  is  necessary  for  the  force  feedback  in 
the  impedance  control  (7). 

The  control  software  was  written  in  C++  and 
implemented  on  200MHz  Pentium  processor.  Sampling 
frequency  of  the  control  system  is  200Hz. 

4.1.  Stiffness  control 

First,  stiffness  control  (4)  was  implemented  and 
tested.  Stiffness  was  designed  as  5N/mm  for  compliant 
direction  and  50N/mm  for  rigid  direction.  Table 
deflections  caused  by  the  applied  force  were  examined  in 
three  different  compliant  directions,  O',  45  °  ,  90  ° 
respectively. 

The  results  are  shown  in  Fig.  6.  (a)  is  plots  of  the 
deflections  in  the  compliant  direction  and  (b)  is  those  in 
the  rigid  direction.  Although  the  deflections  are  slightly 
larger  than  expected,  a  slope  of  the  plots  agrees  with  the 
designed  stiffness.  The  results  show  that  we  can  achieve 
the  arbitrary  stiffness  in  steady  state  without  using  the 
force  information 
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4.2.  Simplified  impedance  control 

Next,  the  simplified  impedance  control  (7)  was  tested. 
Fig.  7  shows  the  same  results  as  Fig.  6.  The  control 
parameters  are  chosen  as  listed  in  Table  2. 

Accuracy  of  the  stiffness  is  remarkably  improved  in 
both  directions.  The  controller  not  only  increases  the 
accuracy  of  the  stiffness  control,  but  also  improves  the 
transient  characteristics  of  the  system. 

Fig.  8  shows  step  responses  of  the  active  worktable. 
The  results  show  that  stable  and  quick  responses  are 
realized  by  choosing  the  appropriate  Ks  and  Ds.  The  rise 
time  of  the  step  responses  is  less  than  40ms  for  both 


Applied  force  Fsy  [N] 
(b)  rigid  direction 


Fig.  6  Experimental  results  of  the  stiffness 
control. 


Direction 

Ks 

[N/mm] 

Ds 

fNs/imnl 

Kr 

Stiffness 

[N/mm] 

Compliant 

40 

0.8 

8 

5 

Rigid 

40 

0.8 

0.8 

50 

Table  2  Control  parameters. 

compliant  and  rigid  direction.  The  positioning  accuracy 
is  0.1mm  for  horizontal  direction  and  0.5mm  for  vertical 
direction  respectively. 

Although  the  nonlinear  terms  of  the  table  dynamics 
are  neglected  in  the  control  systems  design,  dynamic 
interactions  are  sufficiently  small. 


Applied  force  Fsy  [N] 
(b)  rigid  direction 


Fig.  7  Experimental  results  of  the  simplified 
impedance  control. 


5.  Conclusion 


An  active  worktable  for  force  control  of  robot 
manipulators  was  designed,  fabricated  and  tested.  The 
active  worktable  uses  a  parallel  link  mechanism  driven 
by  DD  motors.  The  table  holds  a  workpiece  and 
accommodates  its  position/force  in  accordance  with  the 
motion  of  a  robot. 

In  order  to  provide  a  compliant  motion  to  the  table,  a 
force  control  system  was  designed,  based  on  a  stifiness 
control  method.  The  control  system  realizes  the  necessary 
stifiness  in  arbitrary  task  coordinate  frame. 

The  dynamic  characteristics  of  the  table  were  also 
considered  by  implementing  a  simplified  impedance 
control.  The  controller  can  realize  the  arbitrary  stifiness 
without  decreasing  the  dynamics  performance. 

Several  experiments  were  carried  out  using  a 
prototype  active  worktable.  Experimental  results  showed 
the  usefulness  of  the  developed  worktable  for  force 
control  of  robot  manipulators. 
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Fig.  8  Step  response  of  the  active  worktable. 
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Abstract 

A  manipulation  of  fragile  and  shapeless  objects  requires 
an  actuator  with  enough  flexibility  and  safety  not  to 
injure  manipulated  objects.  To  cope  with  such 
requirements  some  kinds  of  soft  actuator  have  been 
already  developed,  almost  of  which  utilize  the  elastic 
deformation  of  rubber  tube  or  balloon  caused  by 
compressed  air  pressure.  Such  a  pneumatic  rubber 
actuator  can  be  expected  to  be  effectively  used  as  a 
flexible  and  human  friendly  soft  actuator  in  various  fields. 
In  this  study ,  in  order  to  realize  a  flexible  pneumatic 
carrier  system ,  a  soft  planar  actuator  using  rubber  balls 
has  been  developed  assuming  that  the  actuator  directly 
contacts  with  the  carried  objects.  This  paper  describes  a 
fundamental  principle  of  operation,  a  control  method  and 
experimental  results.  The  results  sho\ v  the  effectiveness 
of  the  proposed  actuator  mechanism. 

1.  Introduction 

In  various  electronic,  mechanic  and  food  factories  or 
medical  welfare  fields,  robots  which  can  manipulate 
fragile  and  shapeless  objects  have  been  required,  which 
should  be  driven  with  an  actuator  provided  with  enough 
flexibility  and  safety  not  to  injure  the  manipulated  objects 
or  human.  To  cope  with  such  requirements  some  kinds  of 
soft  actuator  have  been  already  developed,  almost  of 
which  utilize  the  elastic  deformation  of  rubber  tube  or 
balloon  caused  by  compressed  air  pressure  [1,  2,  3].  Such 
a  pneumatic  rubber  actuator  can  be  expected  to  be 
effectively  used  as  a  flexible  and  human  friendly  soft 
actuator  in  various  fields. 

In  this  study,  in  order  to  realize  a  flexible  pneumatic 
carrier  system,  a  soft  planar  actuator  using  pneumatic 
rubber  balls  has  been  developed  assuming  that  the 
actuator  directly  contacts  with  carried  object.  The 
actuator  consists  of  a  two-layer  building  pneumatic 
rubber  balls.  These  bails  are  divided  into  five  groups, 
each  group  repeats  expansion  and  contraction 
independently  controlled  with  a  pressure  control  valve. 
By  controlling  inner  pressure  of  balls,  the  actuator 
produces  a  traveling  wave  to  carry  the  object  contacting 


with  the  actuator  owing  to  the  friction  force  between  balls 
and  object. 

In  this  paper,  first,  a  control  method  of  inner  pressure 
of  balls  to  realize  the  desired  operation  is  described.  Next, 
an  open  loop  position  control  is  constructed,  which 
depends  on  only  the  pressure  control.  Also,  a  closed  loop 
position  control  system  is  constructed  by  using  a  camera 
as  a  position  feedback  sensor.  The  control  performances 
are  experimentally  investigated. 

2.  Principle  of  operation 

Figure  1(a)  shows  a  basic  component  of  the  actuator. 
The  actuator  consists  of  a  two-layer  building  pneumatic 
rubber  balls.  In  the  lower  layer,  three  balls  are  placed  at 
an  angle  120°  apart.  An  upper  ball  is  attached  to  the 
lower  three  balls.  There  is  no  air  flow  among  the  balls. 
The  inner  pressure  of  each  ball  is  independently 
controlled. 

Figure  1(b)  shows  a  fundamental  construction  of  the 
actuator,  which  is  composed  of  a  point  symmetric 
connection  of  the  basic  component  in  Figure  1  (a)  with  a 
ball  1,  comprising  seven  balls.  The  lower  balls  1,  2  and  3 
determine  the  direction  of  travelling  wave  by  adjusting 
the  combination  of  expansion  and  contraction  of  each 
ball.  The  upper  balls  4  and  5  contact  with  the  object  and 
select  the  carrier  direction  by  adjusting  the  timing  of 
expansion  and  contraction  for  the  motion  of  lower  balls. 


(a)  Basic  component  (b)Fundamental  construction 
Figure  1.  Fundamental  structure  of  proposed  actuator 


Figure  2  shows  an  example  of  fundamental  operation 
of  the  actuator,  where  the  object  is  carried  toward  the 
right  direction.  The  left  and  right  figures  are  top  and  side 
views,  respectively.  Dotted  lines  indicate  the  center 
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Figure  2.  Operation  of  soft  planar  actuator 


positions  of  each  ball  in  the  state  of  (a).  The  principle  of 
operation  is  described  as  follows: 

(a)  Balls  1,  2  and  3  are  in  a  middle  pressure.  Balls  4 
and  5  are  in  an  expansion  and  a  contraction, 
respectively.  Right  after,  by  contracting  ball  1  and 
expanding  balls  2  and  3,  balls  4  and  5  are  moved 
toward  the  center  position.  The  ball  4  cairies  the 
object  to  the  direction  indicated  by  an  arrow,  that 
is,  the  right  lower  direction.  At  the  same  time, 
balls  4  and  5  start  contracting  and  expanding, 
respectively. 

(b)  Balls  4  and  5  are  in  the  middle  pressure,  both  balls 
are  contacting  with  the  object.  By  continuing  the 
contraction  of  ball  4  and  expansion  of  ball  5,  the 
object  is  handed  to  ball  5  from  ball  4.  At  the  same 
time,  balls  4  and  5  start  returning  to  the  original 
position  in  the  state  of  (a)  by  bringing  balls  1,  2 
and  3  back  to  the  middle  pressure.  The  object  can 
be  carried  to  the  right  upper  direction  of  the  arrow. 

(c)  Balls  1,  2  and  3  are  in  the  middle  pressure.  Balls  4 
and  5  are  in  a  contraction  and  an  expansion.  Right 
after,  by  expanding  ball  1  and  contracting  balls  2 
and  3,  balls  4  and  5  are  moved  apart  from  the 
center  position.  The  object  contacting  with  ball  5  is 
carried  to  the  right  lower  direction  indicated  by  the 
arrow.  At  the  same  time,  bails  4  and  5  start 
expanding  and  contracting,  respectively. 

(d)  Balls  4  and  5  are  in  the  middle  pressure,  both  balls 
are  contacting  with  the  object.  By  continuing  the 
expansion  of  ball  4  and  contraction  of  ball  5,  the 
object  is  handed  to  ball  4  from  ball  5.  At  the  same 
time,  balls  4  and  5  start  returning  to  the  original 
position  in  the  state  of  (a)  by  bringing  balls  1,  2 
and  3  back  to  the  middle  pressure.  The  object  can 
be  carried  to  the  right  upper  direction  of  the  arrow. 

By  repeating  the  process  from  (a) -to  (d),  the  object  can  be 
continuously  carried  to  the  right  direction. 


Figure  3  shows  an  operation  to  carry  to  the  other 
direction,  which  can  be  achieved  by  changing  the 
combination  of  expansion  and  contraction  of  lower  balls. 
In  this  example,  if  balls  4  and  5  are  contracted  and 
expanded  ,  respectively,  in  the  left  figure,  and  in  the  right 
figure  balls  4  and  5  are  expanded  and  contracted, 
respectively,  the  movement  in  the  right  lower  direction 
of  the  arrow  can  be  achieved  by  periodically  switching  of 
left  and  right  configurations.  The  combination  of 
expansion  and  contraction  can  be  adjusted  by  the 
pressure  control  of  balls.  Both  the  desired  carrier 
direction  and  speed  can  be  achieved  by  the  pressure 
control  of  lower  balls. 


Figure  3.  Operation  for  two  dimensional  direction 

By  combining  the  fundamental  structure  shown  in 
Figure  1(b),  a  soft  planar  actuator  comprising  48  balls  is 
constructed  as  shown  in  Figure  4.  which  can  carry  the 
object  in  the  arbitrary  two  dimensional  directions  on  the 
actuator.  This  actuator  is  composed  of  lots  of  balls, 
however,  it  can  be  driven  with  only  five  pressure  control 
valves  because  balls  with  same  number  are  driven  with  a 
common  pressure. 


436 


(1) 


l SFj  =  AjSPj 

Figure  5(a)  shows  the  force  from  the  lower  ball  to  the 
upper  ball.  The  angle  c p  j  is  assumed  not  to  vary  and  0  1 
=  (j)  2-  (j)  3=  (j) .  From  the  configuration  of  balls  shown  in 
Figure  5(b),  the  generated  forces  in  x,  y  and  z  directions 
are  composed  as  follows: 

/n  /n 

5FX  =  ' — —  SF2cos  0  +  —  5F3cos0  &) 

5Fy  =  —SFi  cos0  +  ~5F2cos0  +  ^F3cos0  ^ 
5FZ  =  5Fi  sin  0  -f  SF2  sin  0  -f  5^3  sin  0  (4) 


Figure  4.  Soft  planar  actuator  comprising  48  balls 

3.  Pressure  control 

In  order  to  derive  a  pressure  control  strategy,  the 
fundamental  construction  shown  in  Figure  fra)  is 


analyzed. 

Some  variables  are  defined  as  follows: 

P  0 

:  equilibrium  pressure 

Pj 

:  inner  pressure  of  ball 

Fj 

:  generated  force  of  ball 

A  j 

:  pressure  acting  area  of  ball 

4>  j 

:  angle  between  lower  and  upper  balls 

Px ,  Fv,  Pz  :  virtual  acting  pressures  in  x,  y  and  z 
directions 

Ps  :  composition  of  virtual  pressures 

0  :  angle  of  Ps  from  positive  x  direction  in  x-y 

plane 

where,  a  subscript  j  (j=  1,2,3)  specifies  the  amounts  of 
balls  1.2,  and  3.  8  denotes  a  change  of  each  variable 
from  the  equilibrium  value. 

When  the  inner  pressure  of  ball  changes,  its 
generated  force  (  expansive  force)  also  changes.  The 
change  of  expansive  force  8  Fj  induced  by  the  inner 
pressure  change  8  Pj  from  the  equilibrium  pressure  Po  is 
given  by 


Assuming  ^4 1  =^4 2=^4 3=^4 ,  and  virtual  acting  pressures, 
8  Px=  6  Fx-A,  8  Py=  8  FyA  and  8  Pz=  8  FiA  are 
defined.  It  is  supposed  that  the  upper  ball  is  moved 
toward  the  composite  direction  of  virtual  acting 
pressures  and  of  which  displacement  is  proportional  to 
the  magnitude  of  composite  pressure. 

From  equations  from  (1)  to  (4),  the  relation  between 
the  inner  pressure  change  vector  8  P=[  5  Pi,  8  Pi,  8  P3]T 
and  the  virtual  acting  pressure  vector  5  Ptjr=[  8  Px,  8  Py, 
5  Pz ]T  can  be  expressed  as 


6  Pa  =  NfiP 


where,  the  matrix  TV  is  given  by- 


TV  = 


0 

—  cos  0 
simp 


— Y  COS  0  ^  COS  0 
\  COS  0  |  COS  0 

sin  0  sin  0 


(5) 


(6) 


In  order  to  cany  the  object  toward  the  desired  6 
direction,  the  upper  balls  4  and  5  have  to  perform  a 
reciprocating  motions  in  the  specified  direction  of  6  . 
This  study  deals  with  only  the  motion  in  the  horizontal  x- 
y  plane,  nof^ considering  the  motion  in  the  z  direction. 


(a)  Relation  between  lower  and  upper  ball 


(b)  Horizontal  force 


Figure  5.  Force  from  lower  balls 
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The  magnitude  and  direction  from  the  positive  x  direction 
are  represented  by  8  Ps  and  8  as  shown  in  Figure  6. 
The  relation  between  d  Pa  and  6  Ps,  6  is  given  by 

5Pn  =  SP,L  (7) 


cos  0 


L  — 


sin  0 


0 


(8) 


Figure  6.  Driving  pressure  in  8  direction 

Based  on  the  equations  (5)  and  (7),  a  pressure  control 
system  for  lower  balls  1,  2  and  3  can  be  constructed  as 
shown  in  Figure  7.  uQ  denotes  the  control  input  to 
generate  the  equilibrium  pressure  Po.  P  are  detected 
pressures,  u  are  control  inputs  corresponding  to  6  P.  C 
expresses  a  controller.  8  Ps  is  given  by  a  triangular  wave 
with  an  amplitude  of  8  Psmax  as  a  top  diagram  in  Figure 
8.  The  inner  pressures  of  balls  4  and  5  are  also  controlled 
by  triangular  waves  synchronizing  with  the  desired 
pressure  of  lower  balls  as  shown  in  Figure  8. 

4,  Position  control  system 


Figure  9  shows  a  position  control  system.  X  is  a 
position  of  carried  object  in  the  x-y  plane,  of  which 
desired  value  is  Xr  .  The  pressure  control  system  same  as 
Figure  7  is  included  as  an  inner  loop.  The  position 
control  error  Xr-X=[  8  x,  8  y]T  is  transformed  to  the 
virtual  acting  pressure  8  Ps  and  the  angle  8  as  shown  in 


Figure  10.  8  Psmax  is  given  by 

SPs  max  =  SSr  (9) 

where,  S  is  treated  as  a  proportional  control  gain. 

5.  Experimental  system 

Figure  11  shows  an  experimental  system.  An 
experimental  planar  actuator  is  constructed  by  using  48 
soft  tennis  balls,  comprising  8  balls  as  each  ball  group  of 
1,4,  and  5,  12  balls  as  each  ball  group  of  2  and  3, 

respectively. 

The  inner  pressure  of  each  ball  group  is  controlled  by 
a  pneumatic  servo  valve.  A  supply  pressure  Ps  is 
200[kPa].  The  detected  pressures  are  fed  back  to  the 
computer  through  A/D  converter.  The  position  of  object 
is  measured  with  a  camera. 

6,  Experimental  results  and  discussion 

In  the  experimental  system,  the  equilibrium  pressure 
Po  is  15[kPa]  and  the  angle  <i>  is  30°  .  The  controller  is 
PI  controller  with  a  proportional  gain  %>=0.03[m2/N]  and 


Figure  7.  Pressure  control  system 
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Figure  9.  Position  control  system 


an  integral  gain  ki=0.5[m2/Ns].  The  amplitude  8  Plmax 
and  8  Psmax  of  desired  triangular  pressure  waves  of  bails 
4  and  5  are  25[kPa],  An  acrylic  board  of  350[mm]  X 
350 [mm]  X  2 [mm]  is  used  as  a  carried  object. 


Figure  10.  Relation  between  8  x,  8  y  and  8  Ps,  6 


6.2.  Open-loop  operation  with  pressure  control 

Figure  13(a)  shows  the  carrier  distance  r  of  the  object 
at  intervals  of  20  seconds  for  various  driving  frequency, 
when  only  the  pressure  control  shown  in  Figure  7  is 
executed  for  the  triangle  desired  pressure  wave  with  an 
amplitude  of  8  Psmax  =32[kPa]  and  for  five  desired 
carrier  directions.  The  experimental  results  shown  in 
Figure  13  are  the  average  of  five  measurements.  From 
the  results,  the  driving  frequency  of  0.4 [Hz]  and  the 
amplitude  8  Psmax  of  32[kPa]  are  selected  as  a  best 
driving  condition  of  the  experimental  actuator  where  the 
carrier  speed  is  relatively  large  and  the  influence  of  the 
carrier  direction  on  the  earner  speed  is  relatively  small. 


6.1.  Pressure  response  of  each  ball  group 

Figure  12  shows  the  frequency  response  of  each  ball 
group  measured  for  a  sinusoidal  desired  input  with  an 
equilibrium  pressure  of  Po=15[kPa]  and  an  amplitude  of 
8  /^[kPa].  Each  group  of  ball  1,4  and  5  includes  eight 
balls,  respectively.  Groups  of  ball  2  and  3  comprise 
twelve  balls.  Balls  included  in  each  ball  group  are 
simultaneously  controlled  with  a  common  one  servo 
valve.  A  satisfactory  control  performance  can  be 
obtained  in  the  frequency  range  below  about  1  [Hz] 
regardless  of  number  of  driven  balls. 


Figure  12.  Pressure  response  of  each  ball  group 


Driving  frequency  [Hz] 


Figure  13.  Open-loop  characteristics 
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Figure  14  shows  the  movement  of  object  during  20[s] 
for  every  30  desired  carrier  directions  0  r  between  0° 
and  330  under  the  above  driving  condition  of  0.4[Hz] 
and  8  Psmax=32  [kPa].  The  respective  desired  and 
measured  carrier  directions  are  represented  by  dotted  and 
solid  lines.  The  measured  values  indicate  the  center  of 
object.  An  origin  of  coordinate  is  the  starting  point.  The 
results  show  that  the  movement  in  the  arbitrary 
directions  can  be  obtained,  but  a  certain  difference 
remains  between  command  directions  and  actual 
directions  probably  caused  by  the  assembly  error,  the 
individual  difference  of  balls  and  so  on. 


X  [mm] 


Figure  14.  Carrying  in  various  0  direction 

6.3.  Closed-loop  operation  with  position  feedback 

The  experimental  results  obtained  from  the  position 
feedback  control  system  shown  in  Figure  9,  where  the 
driving  frequency  is  0.4[Hz]  and  8  Psmax=32\yPa].  Figure 
15  shows  the  step  response  when  the  movement  of 
100[mm]  in  the  x  direction  and  50[mm]  in  they  direction. 
5^4xl03[Pa/mm]  in  the  equation  (9).  The  object  can  be 
accurately  carried  to  the  desired  position. 

Figure  16  shows  the  response  for  the  desired  circular 
trajectory  with  a  radius  of  50[mm]  and  a  period  of  70[s] 
Where,  5=5  x  103[Pa/mm]  . 

7.  Conclusions 

A  soft  planar  actuator  to  carry  the  object  in  the  two 
dimensional  directions  has  been  developed  as  a 
pneumatic  carrier  actuator  with  the  flexibility.  The 
experimental  actuator  comprises  48  soft  tennis  rubber 
balls.  A  control  method  of  the  inner  pressure  of  balls  to 


Figure  15.  Step  response  on  closed-loop  system 


Figure  16.  Tracking  control  for  desired  circle 

carry  thfe  object  successfully  has  been  proposed.  Also  a 
position  feedback  control  system  has  been  constructed 
with  a  camera.  The  proposed  actuator  can  carry  the 
object  continuously  in  the  arbitrary  two  dimensional 
directions.  The  experimental  results  have  confirmed  the 
effectiveness  of  the  proposed  actuator. 

This  actuator  is  suitable  for  carrying  soft  and  fragile 
object.  The  small  sized  actuator  is  available,  for  example, 
in  a  semiconductor  manufacturing,  food  industry  and  so 
on.  Also,  the  large  sized  one  may  be  effective  in  the 
welfare  or  medical  field,  for  example  ,  as  welfare  bed. 
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Abstract 

In  this  paper  the  implementation  of  a  simple  mechatronic 
system  design  of  a  new  robotic  underwater  tactile  sensor 
is  described \  We  examine  the  design  and  the  definition  of 
a  tactile  sensor  for  manipulation  in  underwater  robotic 
applications.  Different  kinds  of  technology  concerning 
force  transducers  are  reported,  in  order  to  better 
understand  the  choice  of  strain  gauge  technology  for  this 
application.  The  most  important  features  that  the  sensor 
has  to  respect  are  individuated:  the  choice  of  the  adapted 
transducers  for  this  kind  of  application,  the  design  of  the 
mechanical  structure,  in  terms  of  the  sensitive  force 
surface,  the  circuitry  to  conditioning  the  signals  coming 
from  the  transducers;  the  compensation  part,  in  order  to 
work  at  the  different  water  pressure  the  water  -  proof 
support  and  the  way  to  better  mount  it  on  the  gripper  of 
the  manipulator.  In  fact,  the  final  product  should  be 
mounted  on  the  fingertip  of  an  underwater  manipulator, 
developed  inside  an  EU-  project.  After  the  Force  versus 
Voltage  feature  of  the  sensor,  some  experimental  data 
performed  in  air  are  showed:  a  qualitative  shape 
recognise  and  a  detecting  force  response  versus  xy 
displacement  on  a  single  sensor  cell 

L  Introduction 

In  robotic  field,  it  is  becoming  more  and  more 
important  to  support  mechanical  robot  of  several  sensing 
devices  in  order  to  get  satisfying  information  of  the  main 
physical  quantities.  This  allows  to  better  identify  the  robot 
and  to  develop  the  right  control  algorithms.  For  this 
reason,  it  is  necessary  an  accurate  study  of  the  devices 
(component,  struments)  technology  (mechanical,  electro¬ 
mechanical,  solid  state,  optical)  and  techniques  (data 
acquisition,  data  processing).  Robotic  arms  and 
mechanical  hands  have  a  larger  and  larger  field  of 
applications,  such  as  assembling  parts,  edge  following, 


fine  manipulation,  space  and  underwater  application  and 
so  on  [1]  [2]  [3].  The  purposes  of  a  tactile  sensor  are 
described  in  the  following  points.  The  information  coming 
from  a  tactile  sensor  [4]  [5]  can  be  used  for  application 
like  object  recognition,  where  it  is  necessary  to  determine 
the  location  and  orientation  of  object  edges  and  surfaces 
in  order  to  design  a  real  time  closed  loop  force  control. 
Slip  detection  [6]  is  another  field  where  data  coming  from 
tactile  sensors  are  processed  with  frequency  analysis 
based  techniques;  the  position  of  the  centre  of  the  force 
distribution  and  the  fluctuation  of  the  contact  point  normal 
are  good  parameters  to  define  slipping  between  robot  and 
object.  In  this  work,  we  consider  the  information  coming 
from  a  tactile  sensor,  a  device  that  is  able  to  give  physical 
information  about  interaction  forces  during  the  contact. 
The  use  of  strain  gauges  transducers  for  underwater 
application  has  been  chosen  because  it  is  easy  to  carry  on 
a  mechanical  structure,  which  compensate  the  water 
pressure.  Moreover,  they  are  more  robust  compared  with 
other  kinds.  Talking  about  the  underwater  tactile  sensor 
prototype,  we  will  use  the  same  technology  for  the  aspect 
described  before,  about  making  a  rialable  underwater 
device.  Recent  research  [7]  describes  the  design  of  a  new 
underwater  slip  detecting  and  force  sensor  mounted  on  a 
fingertip,  using  strain  gauges  as  sensor  elements. 

The  following  points  describe  the  development  and 
design  of  this  device.  The  shape  of  the  sensor  is  given  by 
the  arrangement  of  13  cells  mounted  in  a  rhomboidal 
configuration,  in  order  to  better  acquire  a  good  tactile 
information  about  the  manipulated  object  (Figure  2). 
Transducers  were  selected  in  a  way  such  that  they  have 
good  performances  in  according  to  the  desired 
application,  to  be  able  to  operate  correctly  in  the  marine 
environments.  The  choice  of  strain  gauges  (piezoresistive 
transducer  technology)  was  preferred  to  other  materials 
(such  as  conductive  rubber,  piezoelectric,  capacitive  and 
so  on)  because  it  is  easier  with  them  to  develop  a 
compensation  system  for  underwater  application.  Any 
transducers  has  been  mounted  in  every  cells  in  figure  1  .An 
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electronic  circuit  is  necessary  to  manage  the  signals 
coming  from  the  strain  gauges  and  send  them  to  the  signal 
processing  devices  using  a  single  wire  (weather  it  is 
possible).  This  is  accomplished  by  a  16  channels  analog 
multiplexer.  In  fact,  the  arm  allows  the  passage  of  only 
eight  wires  through  it,  due  to  the  complexity  of  the 
internal  devices  such  as  motors,  reducers,  resolvers  and  so 
on.  Three  of  them  are  necessary  for  the  power  supply, 
while  one  is  devoted  to  the  clock.  Hence  only  four  wires 
are  available.  The  idea  is  to  use  two  of  them  for  the  data 
transmission  and  the  other  two  for  the  communication 
protocol,  in  order  to  recognise  which  channel  is  being 
acquired.  Two  possible  approaches  are  currently  under 
evaluation  about  transmitting  the  signals.  The  first  one  is 
simply  the  posting  of  the  analog  signals  from  the 
multiplexer  output.  The  second  consists  of  using  a  digital 
circuit  able  to  convert  analog  data  into  digital  ones  to  be 
sent  through  a  serial  line. 


Figure  1.  The  dual-arm  working  cell 


In  the  second  case,  two  possibilities  are  considered: 
the  use  of  an  RS232  protocol  with  a  serial  link  and  the  use 
of  a  parallel  port  adopting  a  custom  protocol.  In  the  first 
case,  a  custom  circuitry  is  needed  in  order  to  demultiplex 
and  synchronize  sensor  data  to  make  them  available  as 
input  of  the  ADC  board  (IP -330)  on  the  VME  bus,  which 
is  used  for  the  control  hardware.  Anyway,  it  seems  that  the 
RS232  protocol  solution  is  the  best  one,  moreover  by 
using,  a  microcontroller  to  manage  the  signals  both  during 
the  A/D  conversion  and  the  data  transmission  phase.  The 
mechanical  support  must  be  able  to  make  waterproof  the 
sensor  and  electronic  parts.  For  this  way  a  metallic 
support  to  guest  the  electronic  part  has  been  developed.  In 
the  other  hand,  different  kind  of  rubber  are  under  study,  in 
order  to  choose  the  one  which  is  able  to  better  acquire  and 
transmit  the  normal  force  on  the  strain  gauges  and,  in  the 
same  time,  to  make  waterproof  the  system. 


Figure  2  -  The  tactile  sensor  surface  and  support 

The  AMADEUS  Project  [8]  (Advanced  MAnipulation 
for  DEep  Underwater  Sampling)  is  an  UE-fiinded  project 
within  the  MAST  (MArine  Science  and  Technology) 
framework.  Its  aim  is  to  study  and  develop  on-the-edge 
technology  for  underwater  manipulation  of  objects.  The 
project  is  currently  in  its  second  phase,  aimed  at  devising, 
producing  and  testing  two  different  hardware  setups:  a 
hydraulic-actuated  manipulator  mounted  on  a  Slingsby 
arm  and  a  ‘dual-arm  workcell’  [9],  a  couple  of 
mechanical-actuated  arms  with  smaller  jaws  on  top 
(Figure  1). 

The  tactile  sensor  presented  in  this  paper  (figure  2)  is 
going  to  be  mounted  on  the  gripper  of  each  arm  developed 
in  Genoa,  to  perform  underwater  operations  in  the  robotic 
manipulation  field.  The  ‘technical  side’  of  the 
AMADEUS  Consortium  is  composed  of  the  Heriot  Watt 
University,  Edinburgh,  responsible  for  the  hydraulic 
gripper,  the  DIST  Department,  University  of  Genoa  and 
the  Naval  Automation  Institute,  Genoa,  who  share 
responsibility  for  the  dual  arm  workcell  and  the  Human 
Computer  Interface  [10].  The  ‘scientific  side’  is  composed 
of  the  Institute  of  Marine  Biology  of  Crete  and  the 
University  of  Barcelona:  these  latter  teams  consist  of 
biologists  who  have  established  the  global  features  the 
developed  systems  should  have.  In  section  II  we  examine 
the  main  transducers  technology  for  tactile  and  force 
sensors. 

In  section  III  we  consider  the  main  sensor  feature 
(sensor  shape,  transducers,  conditioning  and  sampling 
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circuit,  data  transmission,  mechanical  part  and  underwater 
features,  protective  covering  rubber  and  compensator).  In 
section  IV,  the  performance  of  the  sensor  is  reported,  and 
some  experimental  results  are  reported.  In  section  V  the 
conclusion  and  the  consideration  about  future 
developments  are  made. 

2.  Transducers  for  tactile  and  force  sensor 

The  transducers  able  to  be  tactile  sensor  elements  [11] 
are  classified  by  the  physical  effects  by  means  they 
convert  a  pressure/force  in  a  different  easy-to-get  quantity. 

We  can  consider: 

-  piezoresistive  transducers,  where  the  pressure  is 
converted  into  a  resistance  difference,  due  to  the 
expansion  (contraction)  of  the  sensor  element  (strain 
gauges,  conductive  rubber),  used  in  many  kind  of 
applications  [3]  [12]  [13]; 

-  piezoelectric  transducers,  where  a  contact  force  is 
converted  into  a  voltage  difference  (quartz  based  sensor); 
they  are  also  used  in  micropositioning  system  for 
automatic  precision  assembly  [14]; 

-  capacitive  transducers,  [4]  often  used  in  biomedic 
and  robotic  manipulation  fields,  where  a  capacitive 
variable  array  of  elements  gives  the  tactile  information; 

-  optoelectronic  transducers,  where  a  change  of  the 
light  magnitude  is  performed  in  presence  of  external 
pressure  [15]  [16]; 

-  inductive  transducers,  which  are  better  used  as 
position  sensor  rather  then  tactile  sensor,  due  to  their  large 
dimensions.  In  this  case,  a  pressure  difference  is  detected 
by  inductive  effect. 


Figure  3  -  The  underwater  tactile  sensor 


The  use  of  strain  gauges  transducers  for  underwater 
application  has  been  chosen  because  it  is  easy  to  carry  on 
a  mechanical  structure,  which  compensate  the  water 
pressure.  Moreover,  they  are  more  robust  compared  with 
other  kinds.  Talking  about  the  underwater  tactile  sensor 


prototype,  we  will  use  the  same  technology  for  the  aspect 
described  before,  about  making  a  rialable  underwater 
device.  Sensor  features 

3.  The  underwater  tactile  sensor 

We  are  going  to  design  and  assemble  four  underwater 
tactile  sensors  based  on  strain  gauges:  the  sensors  will  be 
mounted  on  the  fingertips  of  the  grippers  of  a  dual  arm 
cell  for  underwater  manipulation  applications.  They  will 
be  used  to  control  manipulation  and  in  shape  recognition 
tasks.  The  following  points  describe  the  development  of 
this  device. 

3.1  Sensor  shape 

The  shape  of  the  sensors  is  given  by  the  arrangement 
of  13  cells  mounted  in  a  rhomboidal  configuration,  in 
order  to  better  acquire  a  good  tactile  information  about  the 
manipulated  object  (Figure  2).  The  strain  gauges  are 
placed  on  each  cells  of  the  surface  A.  This  is  a  kind  of 
steels,  0.5  mm  thick.  The  three  parts  are  fixed  together. 
The  part  B  (Alluminium,  1  mm  thick)  is  necessary  to 
avoid  the  dependence  between  the  force  applied  on  each 
cantilever.  The  part  C  (Aluminum,  3  mm  thick)  ensures 
the  joint  holding,  and  works  as  mechanical  lock. 

3.2  Transducers 

Transducers  were  selected  in  a  way  such  that  they  have 
good  performances  in  according  to  the  desired 
application,  and  they  are  able  to  operate  correctly  in  the 
marine  environments. 

The  choice  of  strain  gauges  (piezoresistive  sensor 
technology)  was  preferred  to  other  materials  (such  as 
conductive  rubber,  piezoelectric  sensors,  and  capacitive 
sensors)  because  it  is  easier  with  them  to  develop  a 
compensation  system  for  underwater  application.  Any 
transducer  has  been  mounted  in  every  cell  in  figure  4. 

3.3  Conditioning  and  sampling  circuits 

An  electronic  circuit  is  necessary  to  manage  the 
signals  coming  from  the  strain  gauges  and  send  them  to 
the  signal  processing  devices  using  a  single  wire  (weather 
it  is  possible).  This  is  accomplished  by  a  16-channel 
analog  multiplexer.  In  fact,  the  arm  allows  the  passage  of 
only  eight  wires  through  it,  due  to  the  complexity  of  the 
internal  devices  such  as  motors,  reducers,  resolvers  and  so 
on.  Three  of  them  are  necessary  for  the  power  supply. 
Hence  only  five  wires  are  available.  The  idea  is  to  use  two 
of  them  for  the  data  transmission  and  the  other  two  for  the 
communication  protocol,  in  order  to  recognise  which 
channel  is  being  acquired.  The  last  one  is  used  as  external 
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clock,  even  if  the  analog  board  provides  an  own  internal 
clock. 

3.4  Data  transmission 

Two  possible  approaches  are  currently  under 
evaluation  about  transmitting  the  signals.  The  first  one  is 
simply  the  posting  of  the  analog  signals  from  the 
multiplexer  output.  The  second  consists  of  using  a  digital 
circuit  able  to  convert  analog  data  into  digital  ones  to  be 
sent  through  a  serial  line. 

In  the  second  case,  two  possibilities  are  considered: 
the  use  of  an  RS232  protocol  with  a  serial  link  and  the  use 
of  a  parallel  port  adopting  a  custom  protocol. 

In  the  analog  acquisition  case,  a  custom  circuitry  is 
needed  in  order  to  demultiplex  and  synchronize  sensor 
data  to  make  them  available  as  input  of  the  ADC  board 
(IP-330)  on  the  VME  bus,  which  is  used  for  the  control 
hardware.  Anyway,  it  seems  that  the  RS232  protocol 
solution  is  the  best  one,  moreover  by  using  a 
microcontroller  to  manage  the  signals  both  during  the  A/D 
converting  and  the  data  transmission  phase.  At  the 
present,  the  first  tests  on  this  prototype  have  been 
performed  with  the  only  analog  data  acquisition  system. 

3.5  Mechanical  parts  and  underwater  features 

The  mechanical  support  must  be  able  to  make 
waterproof  the  sensor  and  electronic  parts.  For  this  way  a 
metallic  support  to  guest  the  electronic  part  has  been 
developed  (figure  4).  In  the  other  hand,  different  kind  of 
rubber  are  under  study,  in  order  to  choose  the  best  in  one, 
or  the  one  which  is  able  to  better  acquire  and  transmit  the 
normal  force  on  the  strain  gauges  and,  in  the  same  time,  to 
make  waterproof  the  system. 


Figure  4.  -  Main  components  of  the  tactile  sensor 


3.6  Mould 


Figure  5  —  The  mould  and  the  covering  rubber 


In  order  to  protect  the  sensor  surface,  it  is  necessary  to 
cover  it  with  an  elastic  material,  able  to  transmit  the 
surface  force  applied  to  the  transducers,  and  able  to  work 
at  the  high  water  pressures.  For  this  reason,  a  bi¬ 
component  silicone  rubber  has  been  used.  The  mould  in 
figure  5  has  been  developed  in  order  to  obtained  a 
perfectly  compliant  shape  with  the  sensor  surface.  The 
water  retain  is  obtained  with  the  mechanical  support. 

3.7  Compensation 

WATEP 


PROTECTIVE  RUBBER 
SINGLE  CANTILEVER 
MECHANICAL  SUPPORT 
EMBEDDED  OIL 
COMPENSATOR  PUBEEP 

Figure  6  -  The  tactile  sensor  section 
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To  get  the  right  performance  at  different  water 
pressures,  it  is  necessary  to  completely  fill  of  oil  the 
mechanical  support, 

or  generally,  with  a  not  compressible  liquid,  which 
provides  the  right  performance  of  the  transducers  (they  are 
however  covered  by  a  special  not  invasive  rubber)  and  the 
embedded  electronic.  Nevertheless,  it  must  be  add  a 
compensator  on  the  base  of  the  mechanical  support,  as 
showed  in  figure  6.  It  is  made  of  a  silicone  rubber,  more 
elastic  then  the  protective  covering  rubber.  In  this  way, 
the  presence  of  the  compensator  allows  to  maintain  the 
cantilever  of  each  cell  in  the  idle  position,  if  the  pressure 
arises. 

4.  Performance  of  the  tactile  sensor 

In  the  graphic  in  figure  7  it  is  showed  the 
characteristics  measured  on  a  strain  gauges  mounted  in  a 
single  cell.  The  signals  have  been  conditioned  by  a  quarter 
Whetstone  bridge  (resistance  of  the  strain  gauge  and  of 
the  other  resistance:  350  ohm;  power  supply:  5  V),  and 
then  acquired  through  a  differential  amplifier. 


Figure  7  -  Voltage  versus  Force 

Gain  of  the  amplifier:  500.  The  A  characteristic  has 
been  acquired  without  any  covering  rubber  on  the  sensor 
surface.  The  B  characteristic  has  been  acquired  with  the 
silicone  rubber  covering  showed  in  figure  2.  The 
decreasing  of  the  Voltage  magnitude  depends  from  the 
distribution  property  of  the  chosen  rubber. 

2:1  scale  of  each  cantilever 


It  has  a  good  linearity.  The  accuracy  is  around  50  g, 
and  it  could  be  improve  by  improving  the  amplifier  and 
increasing  the  signal-to-noise  ratio. 

Two  kinds  of  experiments  have  been  performed  on  the 
tactile  sensor  in  air. 

The  first  ones  have  been  oriented  on  the  objects  shape 
recognises.  In  figure  9  it  is  showed  the  response  acquired 
in  the  presence  of  a  cylindrical  shape  object  on  the 
surface. 

The  second  ones  have  been  conduced  in  order  to  study 
the  effect  of  mechanical  filtering  in  presence  of  elastic 
cover  [17].  In  figure  10  and  1 1  the  horizontal  and  vertical 
characteristics  are  showed.  The  data  have  been  acquired 
with  a  one  kilo  punctual  force,  moving  on  the  surface 
covered  with  the  silicone  rubber.  The  direction  of  the 
displacement  is  showed  in  figure  8. As  showed  in  figure 
10,  the  characteristic  is  symmetric  with  the  center,  where 
the  strain  gage  is  mounted;  the  other  one  (figure  1 1)  is  not 
perfectly  symmetric,  that’s  why  the  force  resultant 
depends  on  the  versus  of  the  displacement 


Figure  9  -  Local  shape  recognise 


Other  tests  of  the  tactile  sensor  are  going  to  be  done, 
in  order  to  evaluate  the  performances  at  different  pressure 
of  water.  A  first  test  in  water  has  already  been  performed 
at  the  12  AT  pressure  in  the  idle  condition  (without 
external  contact  force  on  the  sensor  surface)  with  good 
results. 

5.  Conclusion 


1 


> 


X 


Figure  8  -  The  cantilever 


The  design  and  the  development  of  a  tactile  sensor 
prototype,  able  to  perform  robotic  application  in  the 
underwater  manipulation  field  have  been  described. 

The  transducer  chosen,  the  sensitive  surface  and  the 
mechanical  waterproof  support  have  been  proposed  as  a 
kind  of  solution  concerning  a  device  able  to  work  in 
hostile  environment.  Some  experimental  data  have  been 
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reported,  among  them  there  are  the  impulsive  force 
response  and  the  local  shapes  recognize. 

Detected  Force 


-10  -5  0  5  10 

Y(nrr) 


Figure  10  -  Force  versus  Y  displacement 
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Figure  1 1  -  Force  Versus  X  displacement 


This  system  must  be  still  considered  as  a  prototype.  In 
fact,  the  sensor  needs  to  be  improved  in  terms  of 
dimension  and  sensitivity.  In  fact,  the  mechanical  support 
should  be  reduced  and  the  sensitivity  of  the  sensor  surface 
must  be  increased.  So,  it  is  required  a  detailed  study  on  a 
more  complex  mechanical  structure,  on  a  more 
sophisticated  electronic  technology  in  order  to  allow  the 
use  of  a  greater  number  of  cells. 
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Abstract 

A  moving  coil-type  actuator  in  drive  motors  was 
designed  and  developed  to  control  a  drumstick  in 
automatic  drum  playing.  The  first  purpose  of  this  study 
was  to  develop  an  actuator  capable  of  the  fast  response. 
To  obtain  high  magnetic  flux  in  the  actuator,  a  rare  earth 
cobalt  magnet  is  used.  The  servo  system  functions  by 
means  of  position  feedback,  and  it  uses  a  follow-up 
control  that  allows  the  drumstick  to  follow  any  input 
waveform.  Further,  the  optimum  position  for  securing 
the  drumstick  is  determined  by  analyzing  the  base  of  the 
model  of  the  rigid  body  in  a  plane,  additionally  the 
coupling  between  the  drumstick  and  the  rotor  is 
designed. 

1.  Introduction 


an  automatic  drum-playing  device  which  is  able  to  play 
the  snare  drum  in  the  same  manner  as  a  human;  that  is,  to 
perform  the  snare  drum  by  using  a  drumstick. 

A  simple  technique  of  drum  playing,  involves  the 
drum  head  being  struck  near  its  center  with  a  drumstick, 
and  after  this  drumstick  returning  rapidly.  In  the  case  of 
human  players,  the  gripping  force  of  the  drumstick  is 
changed  from  a  strong  to  a  soft  movement  when  the 
drumstick  rebounds  from  the  drum  head.  The  drumstick 
does  not  keep  the  drum  head  from  vibrating.  In  case  of  an 
actuator,  a  fast  response  and  gain  control  instead  of 
gripping  force  is  needed.  First,  the  purpose  of  this  study 
is  aimed  at  developing  an  actuator  capable  of  fast 
response. 

2 .  Actuator 


In  recent  years,  rapid  progress  has  been  made  in 
computing  speed  and  memory  in  electronic  devices. 
Electronic  musical  instruments  have  improved  remarkably 
in  their  performance,  especially  regarding  their  ability  to 
sample  the  data  of  various  waveforms.  However,  the 
sound  they  produce  is  not  satisfying  when  compared  with 
the  actual  sounds  of  acoustic  musical  instruments,  due  to 
their  use  of  the  audio  devices.  This  study  aims  to  develop 


The  actuator  was  designed  and  developed  to  control  the 
drumstick  in  automatic  drum  playing.  Actuators  marketed 
in  the  past  had  utilized  a  DC  motor.  However,  there  were 
some  problems  involved  in  repeatedly  and  quickly  moving 
the  rotor  of  the  actuator  within  a  narrow  rotational  angel. 
Voltage  drop  was  caused  by  the  abrasion  of  the 
commutator,  and  heat  was  caused  by  the  flow  of  the 
current  through  the  rotor  in  the  motor.  As  a  result,  the 
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Figure  2.  Coil  form 


character  of  the  actuator  remarkably  diminished  the  control 
of  the  drumstick.  Therefore,  further  development  of  the 
actuator  was  necessary  in  order  to  solve  the  problems 
described  above.  A  moving  coil-type  drive  motor  [1][2] 
has  been  designed  for  this  purpose,  as  shown  in  Fig.1 . 

2.1  Magnetic  circuit 

The  rotor  in  the  actuator  is  designed  as  show  in  Fig. 2. 
The  rotor's  core  is  made  of  an  epoxy  resin  as  in  an 
enameled  wire.  The  rotor  core  is  built  in  the  form  of  a 
cylinder  in  order  to  be  as  light  as  possible.  The  enameled 
wire  is  wound  in  the  direction  shown  in  Fig.2  in  order  to 
obtain  optimum  magnetic  flow.  The  coil  resistance  of  the 
rotor  is  12.6  Q. 


Figure  3.  Magnetic  circuit 


2.1.1  Magnetic  flux 


High  magnetic  flux  has  been  obtained  in  the  actuator 
through  the  use  of  a  rare  earth  cobalt  magnet.  As  shown 
in  Fig. 3,  two  magnets  were  secured  in  the  inside  of  the 


outer  yoke.  The  gap  between  the  magnet  and  the  inner 
yoke  is  4  mm,  and  the  magnetic  flow  in  the  gap  can  be 
expressed  by  the  following  equations; 


The  cross-sectional  are  of  the  magnet  A  is 


The  cross-sectional  are  of  the  gap  A 
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Using  the  magnetic  flow  at  the  maximum  level  of  the 


magnetic  energy  B ^  from  reading  the  demagnetic  curve, 
the  magnetic  flux  in  the  gap  can  be  expressed  by  the 
equation 


1 
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where  a  and  /are  leakage  factors.  The  conditions  of  the 
leakage  factors,  a  =  1  and/ =  1,  were  used,  and  Eq.(4) 
was  calculated.  As  a  result,  in  case  of  the  present 


magnetic  circuit  the  magnetic  flux  in  the  gap  B  was  of 
the  approximate  value  0.44  T.  The  magnetic  flux  in  the 
gap  was  measured,  and  its  obtained  value  was 
approximately  0.45  T.  The  error  became  approximately  2 
%  by  comparing  the  measured  and  the  calculated  data.  As 
a  result,  the  analysis  of  the  magnetic  flux  was  indicated  to 
be  applicable  under  the  assumptions  regarding  the 
developed  magnetic  circuit. 

2.2  Coupling 

Generally,  drum  players  don’t  perform  by  gripping  the 
edge  of  the  drumstick  by  choice.  Therefore,  the  gripping 
position  and  the  coupling  of  the  drumstick  was  designed 
in  order  to  connect  the  drumstick  to  the  rotor  base  in 
consideration  of  the  actual  human  grip. 

The  analysis  used  the  model  of  a  rigid  body  in  a  plane. 
The  analytical  model  of  the  drumstick  is  shown  in  Fig.4. 
In  this  model,  the  rebounding  motion  of  the  drumstick 
striking  the  drumhead  demonstrated  that  the  drumhead  is 
struck  by  the  tip  of  the  drumstick.  And  the  drumstick  is 
secured  at  the  center  of  percussion  if  its  center  of 
percussion  could  be  derived  from  the  analysis.  Because 
the  drumstick  rotates  round  the  center  of  percussion  on 
which  the  impulsive  force  acts.  And  then,  the  drumstick 
after  it  was  struck  the  drumhead  should  be  able  to  rebound 
naturally  as  possible.  In  what  follows,  the  analysis  of 
the  center  of  percussion  is  discussed. 

The  drumstick  is  pivoted  at  a  point  other  than  its 
center  of  gravity  as  shown  in  Fig.4.  The  motion  can  be 
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Table  1 .  Specifications  of  stick 


Item 

Unit 

Value 

a 

m 

59.0 

b 

m 

225 

*G 

Kg  •  m2 

5.86X10"4 

i 

Kg  •  m2 

7.1 6X1  O'3 

given  by  the  equation 

li?=Mg  a  sin#  (5) 

where  1  is  the  inertia,  g  is  gravitational  acceleration,  M  is 
the  mass  of  the  drumstick,  and  a  is  the  distance  from  the 
rotational  center  to  the  center  of  the  gravity.  And  by 
assuming  the  motion  within  sin  6  #  6 ,  the  period  of  the 
natural  oscillation  r  can  be  expressed  by 

r  =  2  ^\jVM^g-a  (6) 

Using  Steiner's  theorem,  the  inertia  of  the  drumstick  IG 
secured  at  the  center  of  the  gravity  is 

Ig-I-M  a2  (7) 

When  the  drumstick  strikes  the  drumhead,  the 
impulsive  force  acts  at  the  drumstick's  tip.  At  that 
moment  the  motion's  change  is  exhibited  by  the  change 
of  angular  velocity  at  any  point  on  the  drumstick.  But 


Cam 


the  position  of  one  point  on  the  drumstick  remains 
unchanged;  there  is  only  unchange  point  in  the  drumstick, 
this  is  the  center  of  percussion.  Its  point  can  be  given  by 
the  equation 

ab  =  Ic/M  (8) 

The  schematic  view  of  the  coupling  and  the  Character 
of  the  drumstick,  indicated  by  the  symbols  of  a  or  b,  is 
shown  in  Fig.5  andTable.1,  respectively.  As  a  result,  the 
center  of  percussion  for  the  drumstick  can  be  obtained. 
The  drumstick  is  secured  by  coupling  at  the  position  of 
the  center  of  percussion.  The  drumstick  can  rebound 
naturally  from  the  drumhead  in  this  coupling  design.  And 
also,  the  drumstick  is  controlled  so  that  it  does  not 
prevent  the  drumhead  from  vibrating  as  freely  possible. 

2.3  Cam  for  measuring  the  position  of  the  rotor 

Measuring  the  position  of  the  rotor  was  achieved  using 
the  mechanism  of  the  cam  with  an  eddy  current-type 
sensor,  instead  of  changing  the  circular  motion  of  the 
rotor  as  the  rotor  returned.  The  projecting  part  of  the  cam 
is  designed  so  that  the  gap  between  the  sensor  and  the 
projecting  part,  as  the  rotor  returnes,  can  obtain  +0.5 
mm  to  the  rotational  angle  of  ±0.39  rad  (see  Fig.6),  and 
the  output  of  the  sensor  is  adjusted  ±1.2  V.  This 
mechanism  is  positioned  on  the  end  edge  of  the  rotor. 

3.  Synsesis  of  the  control  system 

As  shown  in  Fig. 7,  the  servo  system  functions  by 


Center  of  Percussion 
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Gi  :  Controller  gain  Am:  Amplifier 

Gp  :  Position  feedback  gain  Kt :  Torque  constant 

Gm:  Manipulation  gain  Ja :  Actuator  inertia 

Gv :  Velocity  gain  Hk :  Position  voltage  constant 

Figure  7.  Diagram  of  servo  system 


means  of  position  feedback  and  velocity  feedback, 
measuring  the  position  of  the  rotor  with  a  eddy  current- 
type  sensor  described  in  prior  the  section,  and  uses  a 
follow-up  control  that  allows  the  drumstick  to  follow  any 
input  waveform.  The  velocity  feedback  system  is 
achieved  by  using  a  differentiator  to  differentiate  the 
position  signal  from  the  sensor.  The  servo  amplifier  for 
the  touch  actuator  is  a  pulse-width  modulated  type,  with  a 
maximum  current  of  9  A.  As  a  result  of  this 
development,  the  actuator  produces  a  torque  of  0.768 
N-m/A,  a  thrust  of  2.86  N/A  at  the  drumstick's  tip,  and  a 

frequency  response  of  47  Hz,  including  the  drumstick  as 
shown  in  Fig.8. 

3.1  Drum  playing  system 

The  system  in  Fig. 9  is  comprised  of  two  actuators  with 
a  servo  mechanism  for  each  actuator.  In  the  present 
system,  two  actuators  for  the  drumsticks  are  needed  to 
control  its  motion  separately. 

In  order  to  control  this  network  of  actuators,  a  two- 
level,  hierarchically  structured  computer  system  was 
introduced.  At  the  slave  level,  4  micro-processors  ate 
employed  to  efficiently  control  the  two  actuators.  The 
interface  between  the  top  level  and  the  bottom  is  built  by 
utilizing  a  dual-ported  memory  in  integrated  circuit 
memories,  and  it  is  possible  to  transfer  the  data  quickly 


Figure  10.  Relationship  between  stick  velocity  and 
sound 

from  the  top  to  the  bottom  level.  The  CPU  system  in 
the  bottom  level  is  designed  and  developed  by  using  an  8 
bit  mode  and  a  D/A  convertor  of  12  bits. 

The  top  level  has  to  edit  musical  data  and  translate  it 
into  the  actuator's  input  waveform  parameters  based  on 
the  musical  data.  At  the  same  time  that  the  individual 
CPU  system  in  the  bottom  level  translates  the  input 
waveform  based  on  the  input  waveform  parameters,  it 
output  the  waveform  to  the  servo  system  for  the  actuator. 

4.Basic  experiment  for  drum  playing 

A  constant  speed  for  the  input  wave  form  was  used  so 
that  the  drumstick's  drumhead-striking  velocity  can  be 
controlled  by  its  speed.  The  relationship  between  the 
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5.  Conclusion 


Figure  11.  Experimental  Model 

velocity  of  the  drumstick's  tip  and  the  resulting  sound 
pressure  was  measured  using  a  sound  level  meter,  as 
shown  in  Fig. 10.  The  sound  level  meter  was  set  1  m 
above  the  drumhead.  The  drumstick  speed  at  its  tip 
indicates  it  to  be  in  proportion  to  the  sound  pressure, 
though  the  produced  sound  is  slightly  different  compared 
to  the  sound  that  a  person  produces  by  playing  a  drum. 
This  phenomenon  was  particularly  exhibited  in  regard  to 
the  strong  and  soft  tones.  Therefore,  the  experimental 
coupling,  which  secures  the  springs,  is  designed 
as  shown  in  Fig. 11.  This  mechanism  allows  the 

drumstick  to  act  in  an  up  -  and  down  motion  by  stretching 
the  springs  to  the  right  and  left.  And  also,  the  gripping 
force  cannot  be  changed  as  in  the  case  of  a  human  player 
because  of  the  consistent  stiffness  of  the  springs.  The 
servo  system  of  this  mechanism  was  designed  in  the 
manner  shown  in  Fig. 7.  The  actuator  of  this  mechanism 
produced  a  frequency  response  of  an  approximately  30  Hz 
dynamic  range.  When  the  ex-coupling  and  this 
mechanism  are  compared  in  regard  to  frequency  response, 
the  frequency  response  of  this  mechanism  is  lower  by 
approximately  17  Hz.  Yetdynamic  range  has  increased  by 
+5  dB  and  -5dB  in  the  realms  of  loudness  and  softness, 
respectively.  Additionally,  the  produced  sound  has  a  tone 
bearing  a  grearer  similarity  to  human  playing. 

Finally,  the  necessary  grip  of  the  drumstick  is  spring¬ 
like,  and  by  utilizing  a  spring  we  ensured  that  the 
drumstick  can  be  returned  naturally  and  rapidly  from  the 
drumhead. 


(1)  The  actuator  developed  could  produce  a  torque  of 
0.768  N'm/A,  a  thrust  of  2.86  N/A  at  the  drumstick's  tip, 
and  a  frequency  response  of  47  Hz,  including  the 
drumstick. 

(2)  When  the  measured  and  the  calculated  data  regarding 
the  magnetic  flux  between  the  magnet  and  the  inner  yoke 
is  compared  ,  we  see  an  approximately  2  %  error.  Thus, 
the  theoretical  equations  regarding  the  magnetic  flux  apply 
to  the  assumptions  governing  the  developed  magnetic 
circuit. 

(3)  The  drumstick  needs  to  be  gripped  at  the  axis  of  the 
rotation  if  it  is  mounted  on  a  fixed  horizontal  axis.  And 
the  position  of  the  center  of  percussion  could  be 
determined  by  utilizing  a  model  of  a  pendulum. 

(4)  The  actuator  could  produce  stable  tones  ranging  from 
loud  to  soft  by  using  a  constant  speed  for  the  input  wave 
from,  and  the  dynamic  range  obtained  was  approximately 
29  dB. 

(5)  An  experimental  coupling  was  developed  that  added 
two  springs  to  the  mechanism.  As  a  result,  loud  and  soft 
dynamic  range  increased  by  +5dB  and  -5dB,  respectively. 
And  the  sound  produced  could  have  tones  resembling 
human  playing. 
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Abstract 

A  preliminary  proof  of  concept  is  given  for  a  novel, 
all  electric,  linear  to  rotary  motion  precision  drive 
which  is  based  on  electro  or  magneto  structured 
fluid  clutches  and  electro  strictive  driver  stacks.  In 
the  embodiment  utilised  for  the  present  exercise  the 
drive  is  entirely  electrostatic. 

1.  Introduction 

Many  attempts  have  been  made  to  design  an 
electronically  controllable  producer  of  fine 
precision  rotary  motion.  Perhaps  the  handiest 
example  of  this  lies  in  the  stepper  motor  in  which 
alas,  the  regulation  of  torque  at  high  speeds  is 
severe.  This  problem  is  caused  principally  by  the 
inductive  nature  of  the  force  producing  element:  the 
greater  the  number  of  teeth  the  cogging  mechanism 
has  the  more  complicated  the  winding  structure  and, 
at  higher  speeds,  the  more  severe  the  degeneration 
of  the  wave  form  of  current.  Fine  steps  and  high 
stepping  speed  do  not  come  together.  In  order  to 
overcome  these  and  stiffness  limitations  a  new 
concept  mechanism  was  investigated. 

In  theory  magneto  strictors  and  a  powerful 
magneto  rheological  fluid  clutch  could  be  used  in 
the  drive.  For  the  present  purpose  on  all 
electrostatic  apparatus  is  used. 

2.  Machine  Concept 

The  mechanism  comprises  what  are  essentially 
matched,  variable  stroke,  positive  displacement 
piezo  drivers  which  rotate  twin  cylindrical  clutch 
driver  plates  (via  levers)  which  are  coupled  to  the 
load,  and  operate  in  tandem  by  alternate  excitation 
of  intervening  electro  rheological  fluid  -  Figure  1. 
This  shows  two  piezo  drivers  that  work  in 
synchronism  with  the  clutch  in  ratchet  fashion:  as 
each  PZT  expands  the  corresponding  clutch  is 
simultaneously  excited  into  an  unyielding  situation 
(for  a  matched  load)  and  the  output  cylinder  rotates 
by  a  small  amount.  As  the  piezo  starts  to  reverse  its 
direction  (because  the  charge  is  removed)  the  clutch 
excitation  is  also  removed,  thus  the  lever  drags  back 
the  driver  drum  to  its  ‘bottom  dead  centre’  Figure  2. 
In  sequence,  the  twinned  driving  system  takes  up 


the  other  barrel  thereby  advancing  the  load  in 
continuous  fashion. 

Speed  control  can  be  obtained  in  two  ways,  by 
adjusting  the  magnitude  of  the  voltage  pulse  (which 
gives  more  or  less  piezo  displacement)  and/or 
changing  the  frequency  of  its  supply.  Reversal  of 
the  motion  can  be  obtained  by  repeating  or  skipping 
a  pulse.  Given  the  risks  inherent  in  piezo  operation 
in  tension  the  inertial  load  of  the  barrel  has  to  be 
fixed  accordingly,  the  load  and  the  fluid  mass  being 
considered  to  be  added  to  the  output  barrel  mass  for 
this  purpose  [1],  Nevertheless,  given  this,  the  all 
electronic  dual  control  of  the  piezo  output  and 
perhaps  the  potential  advantage  of  independent 
clutch  control  seem  to  provide  the  basis  for  a 
precision,  flexible  and  potentially  electronically 
variable  configuration  of  rotary  motion  device, 
provided  compliance  and  backlash  are  restricted. 

The  main  outstanding  question  at  this  stage  is 
will  such  a  machine  actually  work?  In  the  present 
paper  a  positive  outcome  of  this  question  is  shown 
to  be  forthcoming.  The  concept  is  brought  before 
the  meeting  in  order  to  procure  industrial  interest 
towards  answering  the  second  most  important 
question  -  is  there  a  use  for  such  a  device?  Is  it 
worth  developing  and  if  so,  at  what  scale?  To  aid 
this  advance  simple  quantitative  assessments 
formulae  are  given  [2]  and  justified  by  comparison 
with  preliminary  test  results. 

3.  Calculation  Procedure 

The  output  displacement  from  each  piezo  stack 
is  amplified  by  the  nominal  lever  advantage  L/ 1  to 
give  a  linear  displacement  at  the  radius  of  lever 
connection  to  the  rotating  barrel  (r).  The  resulting 
“ideal”  performance  formulae  are  given  on  Figure 
3. 

The  input  values  to  these  formulae  come  from 
step  pulse  signals  obtained  through  a  parallel 
interface  card,  which  are  transmitted  to  a  power 
supply  (Kepco)  which  in  turn  outputs  a  zero  or  high 
tension  step  voltage  to  the  ER  clutches,  so  as  to 
perform  grip  or  slip  action.  Sawtooth  signals 
derived  from  a  12  bit  digital  to  analogue  converter, 
are  transmitted  to  a  proprietary  amplifier  which 
outputs  a  pre  set  voltage  to  matched  PZT’s  (Physik 
Instrumente),  thus  fixing  the  magnitude  and 
frequency  of  the  electrostrictor  displacement. 
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Without  the  amplifying  lever  the  PZT’s  would 
have  to  run  at  high  frequencies  to  produce  even  a 
modest  rotational  output  speed.  The  present 
arrangement  is  designed  to  run  at  1+  radians  per 
second  when  excited  at  200  Hz  or  less,  by  virtue  of 
an  approximately  10:1  lever.  The  fluid  used  was  a 
proprietary  mixture  of  about  10  kPa  max  yield  shear 
stress,  which  operated  in  the  0.5  mm  inter  electrode 
channels.  The  force  available  from  the  piezo  is 
about  10,000  N  which  translates  to  some  1000  N  at 
the  clapper/driver.  This  corresponds  with 
approximately  300  cm2  ER  shear  plane  area  (within 
a  factor  of  safety)  at  a  conservative  2  kV  clutch 
excitation.  This  arrangement  gives  approximately 
0.5  mm/piezo  stroke  movement,  at  the  barrel 
surface. 

4.  Results  and  Conclusions 

Initial  tests  at  zero  output  load  showed  a 
displacement  efficiency  (measured/  theoretical 
output  displacement)  of  up  to  83%.  The  loaded 
output  shaft  shows  some  slippage  -  Figure  4.  No 
doubt  this  is  due  to  a  combination  of  solid  part  and 
ERF  compliance,  imperfect  timing  and  the 
limitations  imposed  by  the  constant  power  driver 
amplifier  [2]. 

A  novel  rotary  stepper  motor  is  thus  in 
prospect.  The  precision  linear  actuator  is 
programmable  with  speed  and  stepwidth  being 
adjustable.  Additional  ERF  control  offers 
possibilities  in  position/velocity  working  which 
needs  further  study  and  the  production  of  more 
comprehensive  relevant  test  results  -  in  the  intended 
speed/torque  domain. 


A  magneto  rheological  suspension  could  be 
expected  to  provide  up  to  ten  times  more  low  shear 
rate  yield  shear  stress  (and  specific  torque)  at  the 
same  order  of  viscosity  (100/200  mPas),  as  a  typical 
ERF.  It  would  need  a  somewhat  bigger 
construction  of  clutch  to  enable  placement  of  a 
magnetic  circuit  which  would  probably  be  limited 
to  an  unforced  switching  response  time  of 
approximately  5  milli  seconds.  The  increased 
performance  could  presumably  be  offset  by  higher 
weight  and  cost  and  possibly  lower  speed. 
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Figure  1.  General  arrangement  of  the  digital  electro  static  power  train:  the  piezo  stacks  reciprocate  to  drive  the 
output  drum  continuously  via  ER  clutches.  The  speed  of  motion  can  be  varied  by  adjustment  of  the 
piezo  excitation  magnitude  and  frequency;  at  low  speeds  steps  down  to  0.01°  should  be  possible. 
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Figure  2.  Synchronised  motion  of  piezo’s  and  electro  rheological  fluid  clutches. 


DISPLACEMENT  AMPLIFICATION  -  L/I  =  10 
FORCE  ATTENUATION  I/L  =  1/10 
AVAILABLE  TORQUE  =  [F]  x  [I/L]  x  [r] 

TRANSMISSIBLE  TORQUE  =  [ERF  AREA]  x  [ERF  STATIC  YIELD  STRENGTH]  x  R 
OUTPUT  SPEED  =  [FREQUENCY]  x  [10Y]  x  [2  BARRELS]  x  [1/V]  RAD/SEC 


Figure  3.  ‘Unloaded’  formulae  for  estimation  of  ideal  (no  slip/regulation)  performance. 
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Figure  4.  Sample  characteristic  of  the  prototype  power  train  on  load 


The  2nd  International  Conference  on  Recent  Advances 
in  Mechatronics,  Istanbul,  Turkey,  May  24-26,  1999 


©  UNESCO  Chair  on  Mechatronics 
Bogazisi  University,  Istanbul,  Turkey 


Design  a  Capacitive  and  Perseptible  Tactile  Sensor  for  Robot  Hand 


Francisco  Flavio  Cordova  Quiroz 
Mauro  Francisco  Espinosa  Coss 

Universidad  del  Valle  de  Mexico,  Campus  Tlalpan,  Departamento  de  Cienciay  Tecnologia 
San  Jan  de  Dios  #6,  Col.  Exhacienda  de  San  Juan,  C.P  14370,  Mexico  D.F.,  Mexico 
Tel:  (525)  2  38  53  00,  Ext:  04202,  04216,  02245,  Fax:  (525)  6  71  25  91 
Email:  jfcq@dfl .  tehnex.  net  mx,  fcordovamredipn.  ipn.  mx 


Abstract 

This  work  take  the  sensibility  hand  construction,  it  is 
based  in  two  systems,  the  sensitive  system  (skin),  and 
the  nervous  system  for  the  signal  transmission,  the 
prototype  design  satisfy  all  requirements  to  sellect  from 
one  exited  point  to  other,  such  as  the  pressure  on  this 
point  to  send  a  computer,  comparing  [1]  that  doesn't 
define  the  precise  exited  point. 


1.  Introduction. 

Studying  the  human  arm  and  hand  composition,  it's 
possible  observe  that  is  composed  of  four  basic  systems, 
which  are  the  most  important  to  an  appropriate  and  a 
proper  way  of  working,  those  systems  are  : 

•  Osseous  System. 

•  Muscular  System. 

•  Sensitive  System. 

•  Local  and  Peripherical  Nervous  System. 

Where  each  system  has  special  and  particular  functions, 
all  this  systems  are  syncroniced  to  work  toogether,  and 
the  resolt  is  a  big  and  complex  system.  Now  it's 
describing  all  systems  emphasizing  the  most  important 
in  our  work. 

1.1.  Osseous  System 

This  is  in  charge  of  sustaining  all  our  body,  it  gives 
consistance  and  firmness,  over  the  rest  of  all  other 
systems,  comparing  this  versus  a  robot  then  we  can  say 
that  this  is  it's  structure,  see  figuire  #1. 


1.2.  Muscular  System 

With  the  use  of  this  system  we  obtain  all  the  movements 
of  tlie  human  body  and  we  can  move  and  displace  any 
object  or  body,  only  if  the  maximum  weight  of  the 
object  is  drawn  by  force  and  it  is  produced  by  the 
muscles,  wheras  in  a  robot  or  mechanic  arm,  we  are 
talking  about  file  force  that  is  produced  by  file  motors  of 
the  unit  and  its  structure  . 

1.3.  Sensitive  System 

The  sensitive  system  is  associated  with  the  skin  and 
throug  this  it  is  possible  to  detect  the  next  three 
parameters  of  main  importance  to  evaluate  the  different 
stimulus  of  the  environment,  for  a  Robot  its  the  sensor 
and  tranducers  to  acquire  this  stimulus,  these  parameters 
are: 

•  Pressure. 

•  Temperature. 

•  Rugosity  of  the  Surface. 
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1.4  Nervous  System 

This  system  is  in  charge  of  transmiting  the  diffemt 
impulses  that  receive  and  send  any  part  of  the  human 
body  to  the  brain,  this  system  is  formed  for  an 
entireness  organs  whose  functions  are:  relationship  life, 
and  organic  function  automatic  control. 

2.  Analysis 

The  sensitive  system  or  skin  has  two  parts;  an  external 
called  epidermis  and  an  internal  called  dermis. 

Under  the  epidermis  we  can  find  the  dermis  that  is  the 
most  internal  layer  of  the  skin,  it  is  formed  by  living 
cells,  that  adopt  the  form  of  the  closest  one  for  the 
exterior,  they  form  a  surface  with  entrances  and  exits 
called  papilas,  in  the  same  way  we  can  find  nervous 
terminals  in  charge  of  registering  the  tact  stimules, 
temperature  and  pressure. 

The  skin  has  little  nervous  terminals  that  monit  all 
actions  that  we  have  done  on  it.  once  we  have  a  contact, 
it  transmits  a  little  electrical  impulse  that  goes  to  the 
peripherical  nervous  system,  but  if  the  impulse  is  bigger 
than  a  different  one,  it  will  go  to  the  nervous  central 
system  and  then  its  reflection  will  activate  itself  to  avoid 
damages  into  the  contact  point,  see  figure  #2. 


Figure  #2,  Nervous  Terminals  in  Charge  of 
Registering  the  Tact  Sti mules 


The  function  of  these  nervous  terminals  is  to  protect, 
and  it  forms  one  of  the  main  special  senses  of  the 
human  body,  the  tact. 


Now  if  we  examine  the  terminals  that  we  have  in  the 
skin  they  act  as  little  switches  that  indicates  the  contact 
point  and  evaluate  the  pressure  that  we  are  putting  on  it. 

If  we  separate  this  effect  like  electrical  fenomeno,  we 
have  two  separate  subsystems,  the  firts  one  is  the  tactile 
subsystem  in  on/off  sensibility  form,  the  second  one  its 
the  evaluated  pressure  subsystem,  so  we  will  watch  that 
the  circuit  has  been  closed,  but  we  haven't  quantified  the 
pressure  that  it's  putting  on  it. 

To  solve  this  problem,  we  can  use  a  pressure  membrane 
as  an  evaluating  element,  where  the  deformation  of  the 
membrane  sends  an  electric  impulse  that  is  proportional 
to  the  total  pressure  that  it's  putting  on  it. 

We  can  also  solve  this  problem  using  variable 
capacitors,  because  if  we  alter  the  length  that  separates 
the  plates  of  the  capacitor,  we  will  vary  it's  capacitive 
value,  and  we  also  have  a  proportional  variation  to  the 
pressure  that  it's  putting  on  it  but,  this  will  be  considered 
in  the  second  part  of  the  work. 


The  nervous  system  as  we  see  is  divided  anatomicaly  in 
two  parts:  central  nervous  system,  and  peripherical 
nervous  system,  like  shown  in  the  figure  #3. 


Figure  #3,  Peripherical  Terminal  Nervous  System 

The  organs  of  the  central  nervous  system  are: 
encephalon  and  spinal  marrow,  and  the  organs  of  the 
peripherical  nervous  system  are:  the  nervous  properly 
said,  (organs  are  not  mentioned  in  the  study). 

The  anatomic  functional  unit  of  the  nervous  system  is 
the  neuron,  which  is  constituted  by  a  cell  body  that  has 
several  prolongations;  the  largest  is  called  cylinder  axis, 
and  the  shortest  is  called  dendrite,  see  figure  #4. 
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Figure  #4,  Neural  Squematic  Diagram 


The  neural  body  is  the  place  where  functional 
information  ocuits,  it  generates  a  stimulus  or  registers  it 
and  it  is  the  cylinder  axis  which  works  like  a  conduct 
way  to  the  stimulus,  through  the  synapsis  the  stimulus 
crosses  each  neuron.  The  stimulus  transmition  system  is 
reflected  by  the  cables  and  the  footprints  of  the  physical 
system. 


As  well  as  the  quantity  of  these  nervous  terminals  by 
square  centimeter  is  too  big,  due  to  its  short  gap 
between  each  other,  it  involves  the  development  of  a 
system  that  detects  each  terminal  in  an  independent 
way. 


In  our  work  we  realized  an  arbitrary,  and  not  so  detailed 
division  of  the  human  hand  surface,  that  it  could  have 
124  contact  points  and  one  evaluation  of  the  pressure 
that  we  are  putting  on  it,  like  shown  in  the  figure  #5. 


Figure  #5,  Contact  and  evaluation  Points  on  the  Hand 


3.  Development 


Now  it's  known  the  sensitive  system,  is  possible  to 
design  a  system  that  realize  the  same  functions,  but  with 
on/off  sensibility. 


Then  the  sample  and  control  systems  for  this  points  it's 
a  keyboard  controller,  because  through  this  devices  it  is 
possible  detect  the  activated  points  in  diferent  time.  In 
the  study  of  this  devices  we  find  some  controller  that 
fulfill  own  necessity,  like  the  intel  controllers  [2]  which 
as  the  8279, 83C51KB  and  the  8XC51SL,  all  this  devices 
are  keyboard  controllers  but  the  83C51KB  was  selected 
because  thus  characteristics  and  ease  use  like  shown  the 
diagram  block  in  the  figure  #6. 


Figure  #6,  Diagram  Block  of  83c51  kb  Keyboard 
Controller 


For  the  first  stages  it's  necessary  detect  the  on/off 
pressure  in  the  exitated  point,  so  the  electrical  system  of 
the  robot  hand  is  the  same  such  a  matrix  keyboard 
configuration,  shown  in  the  figure  #7. 


Figure  #7f  Electrical  Diagram  to  On/Off  Pressure  in 
The  Robot  Hand 
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Well,  the  matrix  is  distributed  in  all  sensitive  hand 
surface,  to  obtain  al  the  contact  point  we  most  to  use  the 
8  scaner  input  lines  “SIL”  1  line  for  each  finger  and  the 
rest  for  the  palm,  the  rest  16  scaner  output  lines  “SOL” 
it's  to  made  the  contact  point  of  each  division  zone. 

Due  the  83C51KB  works  in  a  frecuency  of  4  to  6  MHz, 
then  the  sample  time  is  in  a  range  of: 

TmEnc=4  ps  and  1^  =  2 .6ps 

This  time  is  better  than  the  response  time  of  the  hand, 
the  sample  frecuency  depends  of  the  resistance  value 
conected  between  Vcc  and  RCin  terminal,  and  is 
characterized  by  the  graphic  #  1 . 


nCW  PKv^4cticy  y*. 


Graphic  #1 ,  RCin  Frecuency  (5vdc  At  Romm 
Temperature) 

In  Hie  second  stage,  is  implanted  a  sencible  capacitance 
device  to  pressure  measurement  on  the  contact  point, 
where  the  change  of  capacitance  is  proportional  to  the 
pressure. 

So  for  the  pressure  measurement  we  need  a  Analogical 
Digital  Convert  “D.A.C”  to  evaluate  the  magnitud  of  the 
analogical  signal,  with  this  new  function  the  diagram 
system  change,  like  shown  the  figure  #8. 

Due  this  modifications  the  83C51KB  controller  no  is  the 
only  device  that  work  in  the  hand,  now  we  have  to  desig 
a  new  circuit  device  to  solve  the  next  problems: 

a)  Calibration  of  the  Sensitive  Device. 

b)  Capacitance  Measurement. 

c)  Combination  between  the  Analogical  and  Digital 
Signals 


Figure  #8,  Diagram  Block  of  the  New  System 


To  solve  this  problems,  we  applicated  a  capacitive  circuit 
to  isolate  the  signals  for  each  point  of  the  matrix,  so  one 
side  we  get  the  digital  signal  and  in  the  other  hand  the 
analogical  signal,  to  make  ease  the  pressure 
meassurement  process,  because  we  use  the  capasitive 
efect  in  a  filter  when  the  noise  doesn't  affect,  now  Hie 
circuit  used  is  in  the  figure  #9. 


Get  the  Analogical  Signal 

Were  the  resistance  is  1  MOhm  and  the  total  impedance 
is  too  high,  so  doesnt  afect  the  digital  pulse  magnitud, 
and  the  output  circuit  get  the  analogical  signal  that  is 
characteriside  by  the  equation  of  this  circuit  (1). 

2  tvvCR.V, 

V  = - —  (1) 

0  1+2  mCR2  K) 

that  represent  the  analogical  output  magnitude  in 
function  of  the  capacitive  variation  in  a  range  of 
0.2951pf  to  8.854pf  in  a  frecuency  of  6  MHz,  like  shown 
in  the  graphic  #2. 
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Graphic  #2,  Variation  Output  Voltage  with 
Capacitance  Variation 


With  this  circuit,  we  have  a  frecuency  too,  where  our 
principal  objetive  is  the  magrutud  messearument  of  the 
analogical  signal,  for  this  case  we  need  a  enbolment 
detector  circuit  by  improving  the  system,  that  is 
charectized  by  the  next  equation  (2). 

V(t)  =  V0e-'IRC  (2) 


-  - —  V 

Figure  #1 1 ,  Diagram  Block  of  the  Sensitive  Tactile 
System 
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Where  x  RC  and  x  —  10/v,  now  we  can  see  all  this 
funtions  in  the  complete  circuit,  figure  #10. 
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Figure  #10,  Total  Circuit  to  Obtain  the  Analogical 
Signal 


4.  Conclusions 

In  the  prototype  construction,  we  use  digital  devices  that 
were  not  design  to  this  functions,  but  the  adjust  fulfill 
the  goal. 

Besides  the  uses  the  analogical  circuit  to  pressure 
meassurement  in  any  point  of  the  matrix,  permits  a  full 
integration  in  a  mechanical  hand  like  a  human  hand  in 
the  perseptible  tactile  function,  all  that  show  the  function 
of  all  the  system  and  the  possibility  to  made  complex 
systems,  see  figure  #11. 
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Abstract 

Piezoelectric  actuators  have  found  many  applications  in 
mechatronics  systems  because  of  their  ability  to  provide 
large  forces  in  very  short  times.  To  overcome  the  limits  of 
the  tiny  displacements  available ,  different  amplification 
mechanism  have  been  envisaged,  namely  integration  of 
multiple  steps  (‘inchworm ),  direct  mechanical 
amplification  (simple  lever,  frames)  and  transfer  of 
impulse  to  a  projectile.  Especially  in  the  last  two 
approaches,  the  dynamic  characteristics  of  the 
piezoelectric  actuator  are  of  primary  importance. 
Manufacturers'  catalogues  for  commercially  available 
piezo  stacks,  whilst  providing  important  electrical  and 
basic  mechanical  compliance  data,  usually  do  not  give 
sufficient  information  on  the  dynamic  behaviour  of  these 
devices.  This  paper  provides  experimentally  observed 
dynamic  performance  data  for  several  piezo  stacks  of 
differing  sizes  and  from  different  manufacturers.  The 
observed  motion  is  compared  with  that  predicted  by  a 
simple  Matlab-Simulink  model  and  suitable  model 
parameters  estimated. 

1.  Introduction 

Piezoelectric  stack  type  actuators  are  beginning  to  be 
used  in  a  wide  variety  of  applications  on  account  of  their 
ability  to  generate  large  forces  very  quickly  [1].  However, 
they  generate  micron-scale  displacements.  In  order  to  use 
them  in  practical  machinery  applications  it  is  usually 
necessary  to  amplify  their  output  motion  in  some  way. 
There  are  three  commonly  applied  approaches  to 
increasing  output  motion. 

The  first  is  to  integrate  many  cycles  of  motion  as  is 
accomplished  by  the  ‘inchworm’  linear  motor  [2]  or,  in  a 
different  way,  by  travelling  wave  (ultrasonic)  piezo¬ 
motors  [3,  4].  The  piezo  devices  are  usually  driven  in  a 
resonant  mode.  The  second  method  is  to  use  a  mechanical 
amplifier,  typically  a  flexure  hinged  structure  [5].  The 
third,  and  currently  least  exploited  approach  is  to  use  a 
‘ballistic’  or  ‘impulse’  approach  in  which  the  piezo  stack 


exchanges  energy  with  a  ‘projectile’  which  provides  the 
output  effort  by  virtue  of  its  kinetic  energy  and  is  capable 
of  moving  through  a  useful  distance  [6,  7]. 

In  all  three  displacement  increasing  approaches  the 
dynamics  of  the  piezo  stack  are  important.  However,  for 
the  latter  two  approaches  (direct  amplification  and 
‘impulse’  actuation)  a  good  knowledge  of  the  dynamic 
characteristics  of  the  piezo  stack  becomes  essential.  For 
amplifiers  it  is  needed  in  order  to  optimise  the  combined 
piezo-amplifier  structure,  whilst  for  impulse  devices  the 
energy  transfer  mechanism  needs  to  be  carefully 
optimised,  which  requires  piezo  dynamics  information. 

It  is  well  known  that  piezoelectric  devices  of  the 
previous  generation  had  to  be  fed  with  very  high  voltages, 
which  required  special  care  in  the  design  of  the  power 
supply  system.  Exciting  voltages  have  significantly 
dropped  (typically  down  to  100  V)  in  piezoelectric  multi¬ 
layer  actuators,  where  many  (60-150)  layers  of  reduced 
thickness  (in  the  order  of  100  pm)  are  mechanically 
assembled  in  series  to  form  a  stack  while  being 
electrically  connected  in  parallel.  Thanks  to  the  reduced 
thickness  of  each  active  layer,  the  electric  field  induced 
by  a  given  potential  is  greatly  increased.  ‘Soft’ 
piezoelectric  materials  (like  PZT  5)  are  usually  employed 
in  actuator  construction,  because  they  provide  higher 
maximum  displacements  (although  with  higher 
hysteresis)  -  the  piezoelectric  coupling  constant  d33  is 
typically  between  5  and  7  (xlO10  m/V)  and  the  electric 
field  can  be  as  high  as  1-2  MV/m. 

The  piezoelectric  multilayer  actuators  examined  in  this 
paper  are  commercially  available;  they  can  be  seen  in 
figure  1.  Actuators  A  and  B  are  nominally  identical 
except  for  the  cross  sections  (A  is  5x5  mm2  and  B  is 
2x3  mm2);  they  are  10  mm  in  height  with  active  layers 
100  pm  thick.  Therefore  they  are  expected  to  have  the 
same  resonant  frequency  and  maximum  displacement  but 
different  output  force  and  capacitance  (which  affects  the 
dynamic  response).  Actuator  C  is  5x5x14  mm3  and  has 
140  active  layers  about  100  pm  thick.  Stack  D  is 
5. 2x5. 2x3. 5  mm3  and  has  slightly  thicker  layers 
(125  pm). 
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pS  1:  Ph°togr*ph  of  some  commercial  stack  piezoelectric  actuators  from  various  manufacturers. 
Experimental  results  on  stacks  A  to  D  are  reported  in  this  paper. 

A  simple  computational  model  has  been  developed  the  mechanical  and  the  electrical  sub-systems  separately 

"h  the  am,  of  prod.chng  the  dynamic  rexponee  of  the  and  to  taking  into  account  tor  intmeto. 

ac  a  ora,  given  some  technical  data  (m  figure  2  the  approximation,  the  piezoelectric  multi-layered  atructure  is 

f”  .*  »  reproduced;  if  dte  replaced  in  the  mechanical  aub-syato  ty TmSS 

stack  is  fixed  at  one  end,  the  mass  is  multiplied  by  4).  The  system  -  where  the  measured  mass  o/the  actuator  is 

S  “Ied  r?  Simulink®  (a  toolbox  of  **  multiplied  by  a  suitable  constant.  The  parametric  values 

atlab®  software  package)  with  the  idea  of  representing  for  the  model  have  been  obtained  from  manufacturers’ 
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Figure  2:  Schematic  of  the  Simulink®  model  for  a  stack  free  at  both  ends. 
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Figure  3:  Experimental  results  on  stack  A.  Both  end  faces  are  free,  driving  voltage  is  50  V.  The  figures 
near  each  curve  represent  the  values  of  the  current  limiting  resistor. 


data  or  deduced  from  measured  quantities  (like  the 
capacitance).  In  the  mechanical  system  the  parameter  m0 
is  added  to  take  into  account  the  mass  of  inactive  layers 
(typically  present  at  the  top  and  bottom  ends  of  the 
piezoelectric  actuator). 

The  electrical  sub-system  is  the  top  part  of  figure  2.  A 
ramp  function  with  saturation  at  the  maximum  exciting 
voltage  simulates  the  action  of  the  electronic  switch.  For 
the  MOSFET  used  in  the  experiments  this  ramp  is 
effectively  a  step  and  so  a  ramp  rate  of  107  V/s  is  used  in 


the  simulations  shown  here.  Rcharge  represents  a  series 
resistor  which  limits  the  current  flow  into  the  piezo 
actuator  which  is  modelled  as  a  perfect  capacitor, 
although  resistive  characteristics  can  be  accommodated 
by  modification  to  the  values  of  Rcharge  and  Rbleed 
employed  (Rbieed  nominally  models  a  discharge  resistor  on 
our  drive  circuit,  in  parallel  with  the  actuator).  D  is  the 
electric  displacement  resulting  from  the  charge  on  the 
actuator  capacitance.  To  enable  Matlab®  to  solve  the 
algebraic  loop  a  starting  value,  close  to  zero,  is  imposed 


000.E0  50.E-6  100.E-6  150.E-6  200.E-6  250.E-6  300.E-6 

Time  (s) 

Figure  4:  Model  output  for  stack  A  with  both  surfaces  free  and  a  driving  voltage  of  50  V.  The  figures 
near  each  curve  represent  the  values  of  the  current  limiting  resistor. 
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as  MyGuess.  The  electric  field  developed  is  fed  from  the 
electrical  sub-system  to  the  mechanical  one  by  the  link 
between  the  EPiezo  and  Strain  blocks.  The  gain,  d, 
represents  the  d33  value  of  the  piezo  material  which 
converts  the  electric  field  to  strain  and  this  is  transformed 
to  stress  via  the  mechanical  compliance,  s.  This  is  used  to 
determine  the  force  accelerating  the  lumped-mass  of  the 
actuator  (S  is  the  cross-section  area  of  the  actuator).  After 
integration  twice  to  yield  the  displacement,  this  is 
converted  into  strain  and  fed  back  into  the  mechanical 
sub-system.  The  feedback  link  between  the  two  sub¬ 
systems  is  completed  by  transducing  the  stress,  via  d,  and 
adding  it  to  the  electric  displacement.  Some  features  of 
the  model,  like  the  bleeding  and  the  current-limiting 
resistors  and  the  voltage  ramp,  are  specific  for  the  driving 
circuit  used. 

In  order  to  collect  experimental  data,  the  piezoelectric 
stacks  were  driven  by  an  electronic  circuit,  featuring  a 
MOSFET  with  a  specific  driver  IC  controlled  by  a  TTL 
signal,  designed  to  provide  very  fast  switching  times.  A 
power  unit  with  very  large  reservoir  capacitor  and  low 
impedance  supplied  a  controllable  high  voltage. 


The  motion  of  the  top  surface  of  the  actuator  was 
monitored  using  a  Polytec  Laser  Doppler  Vibrometer 
(OFV  512  interferometer  +  OFV  3001  controller).  A 
piece  of  retro-reflective  tape,  of  negligible  mass,  was 
stuck  on  the  top  surface  of  the  actuator  to  optimise  the 
signal  to  noise  ratio.  Velocity  range  varied  from 
5  mm/s/V  to  1000  mm/s/V  depending  on  the  actuator  and 
on  the  current  limiting  resistor,  while  filtering  was  always 
off,  leading  to  maximum  bandwidths  ranging  from 
250  kHz  to  1.5  MHz.  The  signal  from  the  vibrometer  was 
fed  into  a  20  MHz  oscilloscope  triggered  by  the  same 
TTL  signal  which  drove  the  MOSFET  driver  IC;  the 
acquired  oscilloscope  traces  were  transferred  to  a  PC  for 
further  analysis. 

Piezoelectric  actuators  A  to  C  were  tested  at  50  V  in 
completely  free  conditions  (by  placing  them  on  a  very 
low  stiffness  material)  and  at  100  V  with  the  bottom 
surface  glued  onto  a  20x20x1  mm3  alumina  insulating 
plate,  which  was  in  turn  glued  to  a  7  mm  thick  steel  base 
plate.  Piezoelectric  actuator  D  was  tested  at  155  V. 


Frequency  (Hz) 

o.?w  L1°Ur!er  transform  °f  the  curves  in  figures  3  and  4.  Experimental  data  (black)  and  model 
output  (grey  -  lower  curve  of  each  pair). 
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Time  (s) 

Figure  6:  Experimental  data  on  all  the  tested  stacks,  driven  at  the  voltages  shown,  with  a  current 
limiting  resistor  of  5  Q  and  with  one  end  fixed. 

2.  Results  and  Discussion  surfaces  free,  50  V  maximum  excitation).  It  is  clear  that 

the  model  can  predict  the  general  behaviour  of  the 
Figures  3  and  4  show  the  experimental  results  and  the  piezoelectric  stack  quite  well,  except  for  slightly  larger 

model  output  for  stack  A  under  the  same  conditions  (both  overshooting  in  the  model  which  could  be  related  to 


OOO.EO  50.E-6  100.E-6  150.E-6  200.E-6  250.E-6  300.E-6 

Time  (s) 

Figure  7:  Model  output  for  all  the  tested  stacks  with  a  current  limiting  resistor  of  5  Q  and  with  one  end 
fixed  (the  same  conditions  as  in  figure  6). 


465 


having  neglected  any  mechanical  frictional  losses.  From 
figures  3  and  4  the  effect  of  the  current-limiting  resistor 
(Retrace)  can  be  seen:  the  lower  this  value,  the  faster  the 
response,  but  at  the  expense  of  a  higher  stressing  of  the 
stack.  In  figure  5  the  Fourier  transforms  of  these  data  are 
shown.  The  parameter  mo  of  the  model  was  calibrated  to 
match  the  resonant  frequency  observed  at  Rc,la^e=  5  Q 
and  set  to  0.8  g  (the  mass  of  the  stack  is  2.17  g).  It  is 
interesting  to  notice  how  the  model  can  predict  the  shift 
toward  higher  values  of  the  main  resonant  frequency  with 
increasing  current-limiting  resistor  (Rctefge),  although  it 
overestimates  this  effect  (which  is  much  more  evident  in 
the  experimental  results  on  other  stacks,  not  reported 
here). 

Figures  6  and  7  show  the  experimental  results  and  the 
output  from  the  model,  respectively,  for  all  the  actuators 
tested,  with  Rc!laT.e=5  fi  (note  that  the  curves  have  been 
vertically  displaced  to  avoid  overlap).  Again,  a  good 
agreement  between  measured  and  predicted  data  can  be 
noticed,  with  the  aforementioned  overshooting. 

3.  Conclusions 

The  model  can  be  seen  to  provide  a  reasonable 
approximation  to  the  behaviour  of  the  piezo  stacks. 
However,  the  simplifying  assumption  of  a  lumped-mass 
model  for  the  multilayer  actuator  in  the  mechanical  sub¬ 
system  prevents  the  model  from  predicting  the  higher 
harmonics  which  are  found  in  the  Fourier  transforms  of 
the  experimental  data.  For  many  engineering  applications 
this  will  not  be  problematic  and  the  model  will  provide 
sufficiently  accurate  results  for  initial  design  of  dynamic 
piezoelectric  actuation  systems. 

It  is  hoped  that  this  work  will  be  of  use  to  anyone 
developing  high  speed  piezo  actuators  for  machinery 
applications. 
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Abstract 

A  micromanipulator  is  a  precision  positioning  device  that 
often  uses  piezoelectric  actuators.  Piezoelectric  actuators 
have  disadvantageous  properties  such  as  hysteresis 
nonlinearity  and  drift.  This  paper  discusses  a  hybrid  model 
to  describe  these  properties.  The  model  consists  of  a 
dynamic  linear  part  and  a  static  nonlinear  part.  The 
dynamic  part  describes  the  dynamic  response  and  the  drift 
property.  The  static  nonlinear  part  adapts  the  model  to  the 
hysteresis  nonlinearity  and  gain  nonlinearity.  Simulation 
and  experimental  results  show  that  the  model  is  able  to 
predict  the  drift  and  the  nonlinear  properties  of  the 
actuator,  including  the  minor  loop  property  of  hysteresis. 
This  actuator  model  serves  the  basis  for  the  nonlinear 
micromanipulator  model. 

1.  Introduction 

A  micromanipulator  is  a  precision  positioning  device 
having  applications  in  biotechnological  operations,  the 
assembly  of  microelectro-mechanical  systems,  and  the 
testing  of  microelectronics  circuits  [1].  It  requires  actuators 
having  displacement  resolution  of  nanometers,  high 
stiffness  and  fast  frequency  response.  These  requirements 
can  be  fulfilled  by  using  piezoelectric  actuators.  Therefore, 
piezo  actuators  have  been  widely  used  in  many 
micromanipulators  and  other  precision  positioning  devices. 
However,  piezo  actuators  have  such  properties  as 
hysteresis  nonlinearity  and  drift,  which  remarkably  reduce 
their  accuracy.  As  the  micromanipulation  applications 
become  increasingly  demanding,  requirements  for  better 
control  design  are  also  raised.  A  good  manipulator  model 
will  help  the  control  design.  Such  a  model,  in  turn,  requires 
proper  dynamic  models  for  the  piezo  actuators. 

The  properties  of  piezoceramic  actuators  have  been 
studied  for  many  years  and  different  approaches  have  been 
used.  IEEE  Ultrasonics,  Ferroelectrics  and  Frequency 


Control  Society  formulated  linearized  constitutive  relations 
describing  piezoelectric  continua,  published  in  1966  and 
revised  in  1987  [2].  Leigh  and  Zimmerman  discussed  an 
implicit  algorithm  for  predicting  the  hysteresis  behaviour 
of  piezoceramic  actuators,  employing  the  trapezoidal  rule 
for  stepping  the  equations  forward  in  time  [3].  Jung  and 
Kim  presented  a  feed-forward  control  method  with  three 
different  deterministic  models  to  reduce  the  hysteresis 
effect  in  piezoceramic  actuators  [4].  Ge  and  Jouaneh 
modelled  the  hysteresis  nonlinearity  of  a  piezoceramic 
actuator  using  numerical  Preisach  models  [5],  [6]. 
Goldfarb  and  Celanovic  developed  a  generalized  Maxwell 
model  of  a  piezoelectric  stack  actuator  where  the  static 
hysteresis  was  identified  as  energy  storage  coupled  to  rate- 
independent  dissipation  and  was  represented  by  a 
generalized  elasto-slip  model  [7].  The  emphasis  has  been 
on  hysteresis  nonlinearity:  models  are  often  static,  and  drift 
modelling  is  largely  ignored. 

In  this  paper,  we  use  a  hybrid  approach  to  model  the 
piezohydraulic  actuators  of  a  micromanipulator  developed 
at  the  Tampere  University  of  Technology  and  the  Helsinki 
University  of  Technology.  The  model  consisting  of  a 
dynamic  part  and  a  static  part  is  based  on  the  method 
presented  in  [9].  The  dynamic  part  of  the  model  describes 
the  dynamic  behaviour  of  the  actuator  including  the  drift 
property.  The  hysteresis  and  gain  nonlinearities  are 
modelled  statically  using  neural  networks.  Here,  we  extend 
the  hysteresis  model  such  that  it  is  able  to  take  into  account 
not  only  the  major  hysteresis  loop  but  also  minor  loops, 
which  is  very  important  for  any  practical  model. 

In  the  next  section,  the  actuator  and  the 
micromanipulator  under  study  will  be  briefly  discussed. 
Section  3  presents  the  structure  of  the  hybrid  nonlinear 
model.  The  simulation  and  experimental  results  are 
presented  and  discussed  in  section  4.  Section  5  concludes 
the  paper. 
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2.  The  micromanipulator 


A  micromanipulator  is  a  device  that  facilitates  the 
remote  handling  of  microscopic  objects  under  computer- 
assisted  human  control.  The  operator  obtains  visual 
information  about  the  end-effector  and  micro  objects  using 
a  microscope  and  a  CCD  camera,  as  shown  in  Fig.  1.  The 
micromanipulator  can  be  controlled  either  by  using  a 
joystick  or  a  PC  keyboard.  Automatic  operations,  such  as 
automatic  injections,  can  be  activated  using  the  keyboard. 


Figure  1:  Concept  of  micromanipulation. 


2.1  Actuation  system 

The  actuation  system  of  the  micromanipulator  consists 
of  a  piezoelectric  actuator,  a  small  tank  and  a  metallic 
bellows,  as  illustrated  in  Fig.  2  [8].  The  piezo  actuator  is 
placed  in  the  tank  filled  with  hydraulic  oil.  When  a  voltage 
is  applied  to  the  piezo  actuator,  it  deforms.  When  the 
actuator  buckles,  oil  flows  from  the  tank  to  the  bellows 
which  elongates,  and  vice  versa:  when  the  actuator  gets 
straightened,  oil  flows  from  the  bellows  to  the  tank.  Since 
the  effective  area  of  the  bellows  is  smaller  than  that  of  the 
actuator,  the  displacement  is  magnified.  The  results  of 
displacement  experiments  have  shown  that  the  movement 
range  of  the  actuator  is  about  ±  250  pm. 

The  piezoelectric  actuator  (RAINBOW™)  is 
structurally  similar  to  unimorph  type  of  elements.  The 
RAINBOW™  actuator  is  a  single  element  structure 
consisting  of  a  PZT  side  and  a  metallic  side  [10].  When  a 
voltage  is  applied  opposite  to  the  poling  field,  the  wafer 
buckles  and  when  the  voltage  is  parallel  to  the  poling  field, 
the  wafer  straightens. 

The  bellows  is  a  spring  type  of  passive  component 
where  the  force  required  to  deform  the  bellows  is  directly 
proportional  to  the  displacement.  The  bellows  can  be 
considered  as  a  linear  system. 

2.2  Manipulator  structure 

The  manipulator  has  a  Stewart  platform  [11]  type 
parallel  structure  consisting  of  three  identical 


Figure  2:  Overview  of  the  actuation  system. 


1 .  End-effector  2.  Mobile  platform  3.  Bellows 

4.  Magnet  mechanism  5.  Piezo  actuator 


Figure  3:  Overview  and  photograph  of  the 
micromanipulator. 

piezohydraulic  actuation  systems  as  described  in  [12].  The 
actuators  are  connected  by  a  mobile  platform  resulting  in  a 
tripod-like  parallel  configuration,  Fig.  3.  The  mobile 
platform  is  fixed  to  each  bellows  using  a  venting  screw  and 
a  small  nut.  The  end-effector  (needle)  is  mounted  on  the 
platform. 

This  manipulator  differs  from  the  conventional  Stewart 
platform  type  of  structures,  since  the  bending  of  the 
bellows  facilitates  a  joint-free  structure.  Using  the  bending 
character  of  the  bellows  instead  of  spherical  joints 
simplifies  the  structure  and  the  manufacturing  process, 
since  especially  the  manufacture  of  miniaturised  spherical 
joints  is  difficult.  By  changing  the  lengths  of  the  bellows 
the  orientation  of  the  mobile  platform,  and  thus  the 
position  of  the  end-effector  can  be  controlled. 
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3.  The  hybrid  model  of  the  piezohydraulic 
actuation  system 


In  the  following,  we  develop  a  dynamic  model  of  the 
piezohydraulic  actuator  for  its  z-axis  displacement.  Since 
the  bellows  and  the  hydraulic  system  can  be  considered  as 
a  linear  amplifier,  the  specific  properties  of  the 
piezohydraulic  actuator  inherit  mainly  from  the  properties 
of  the  piezoelectric  actuator  -  hysteresis  and  drift,  as 
shown  in  Fig.  4  and  Fig.  5,  respectively.  From  Fig.  4,  it  can 
be  noticed  that  the  gain  of  the  actuator  is  nonlinear.  Due  to 
the  hysteresis  and  drift,  a  linear  dynamic  model  is  not 
sufficient  to  describe  the  system  behaviour.  Therefore,  the 
model  parameters  have  to  be  changed  to  cope  with  the 
nonlinear  features.  This  paper  uses  a  model  structure  where 
parameters  of  a  properly  formulated  linear  dynamic  model 
are  scheduled  by  neural  networks.  The  model  structure 
presented  in  [9]  is  improved  to  take  into  account  the  minor 
hysteresis  loops. 

3,1  The  dynamic  part  of  the  actuator  model 


The  step  response  shown  in  the  Fig.  5  can  be  described 
by  a  linear  model  containing  two  time  constants,  one 
describing  the  fast  behaviour  and  the  other  describing  the 
slow  behaviour  -  the  drift  -  of  the  system.  As  shown  in 
Fig.  6,  the  model  consists  of  two  first  order  transfer 
functions  having  time  constants  Tx  and  T2,  and 
respectively  two  gains  kx  and  k2 .  The  gains  determine  the 
contribution  of  the  two  parts  to  the  system  output.  The 
system  can  be  described  by  a  transfer  function  having  two 
poles  and  one  zero: 


G(5)  = 


(kxa  +  k2b)s  +  ab(kl  +  k2) 
(5  +  a)(j  +  b) 


(1) 


where  a  -  \/Tx  and  b  -  l/T2. 

However,  the  linear  transfer  function  with  fixed 
parameters  is  only  locally  valid  due  to  the  nonlinearities  of 
the  system.  To  describe  the  actuator  behaviour  over  the 
entire  workspace,  model  parameters  must  be  identified  for 
each  operation  context. 


3.2  The  static  part  of  the  actuator  model 


The  static  nonlinearity  part  of  the  hybrid  model  is 
responsible  for  describing  the  hysteresis  of  the  actuator.  It 
also  takes  into  account  the  gain  nonlinearity.  One  of  the 
classical  models  for  hysteresis  is  the  Preisach  model  [13]: 

x(t)  =  [[  n-(a,  p)ya|3[tt(0]^adp  (2) 

J  Ja>p 


Hysteresis  property  of  the  piezohydraiiic  actuator 


Figure  4:  Hysteresis  and  gain  nonlinearity 
properties  of  the  piezohydraulic  actuator. 


Step  response  of  the  piezohydraulic  actuator 


Figure  5:  Step  response  of  the  piezohydraulic 
actuator,  the  drift  being  obvious. 


where  *(0  is  the  output  response  of  a  piezoceramic 
actuator;  p.(a,  P)  is  a  weighing  function  in  Preisach 
model;  a  and  p  correspond  to  up  and  down  switching 
values  of  the  input,  as  shown  in  Fig.  7;  and  yap [u(t)] 
is  a  binary  hysteresis  operator  whose  value  is 
determined  by  the  input  operation. 


The  classical  Preisach  model  can  only  represent 
hysteresis  nonlinearity  which  satisfies  the  wiping-out 
property  and  the  congruency  property.  The  wiping-out 
property  means  that  the  hysteresis  curve  does  not  depend 
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200) 


Input  voltage  u(t) 

Figure  7:  Hysteresis  loops. 

on  how  it  was  approached.  The  congruency  property  refers 
that  the  change  in  displacement  should  be  the  same  for  the 
same  control  input  change.  This  property  is  not  satisfied 
e.g.  in  piezoceramic  actuators.  A  detailed  discussion  of 
these  properties  can  be  found  in  [13]. 

If  we  regard  the  Preisach  model  as  a  nonlinear  function, 
it  could  be  generalized  as: 

x(t)  =  /(a,  p,  u(t),y[u(t)])  (3) 

where  a  and  (3  correspond  to  up  and  down  switching 
values  of  the  input,  u(t)  is  the  input  signal  of  the 
actuator,  and  y [«(r)]  is  the  direction  of  the  input 
signal. 

Thus,  nonlinear  mappings  such  as  feed  forward  neural 
networks  can  be  used  to  describe  the  hysteresis 
nonlinearity.  Moreover,  the  dynamic  model  is  only  valid 
locally  because  it  depends  on  the  input  voltage  and  history 
of  the  inputs.  Therefore,  the  parameters  of  the  dynamic 
model  should  be  adapted  to  the  hysteresis  and  gain 
nonlinearities. 

3.3  The  hybrid  model  structure 

The  dynamic  model  (1)  can  be  modified  to  take  into 
account  the  system  nonlinearities  by  replacing  the 
parameters  with  functions: 

G(s)  =  A,, (a,  «,y)a  +  A2(a,  P»  “>Y) 

(s  +  a)(s  +  b(a,  p,  u,  y)) 

where 

A,  =  £,(a,  p,  u,  y )a  +  k2( a,  P,  u,  y )b(a,  p,  u,  y) 
and 

A2  =  ab{ a,  p,  u,  y)[fc,(a,  p,  u,  y)  +  k2( a,  p,  u,  y)]  . 
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Figure  8:  Variation  of  the  parameter  a  in  different 
models. 


Figure  9:  Principle  of  the  hybrid  model  structure. 

The  response  time  of  the  piezohydraulic  actuator  is 
consistency  despite  the  hysteresis  nonlinearity.  Thus,  the 
parameter  a  is  kept  constant.  This  has  been  proven  in 
experiments  and  is  demonstrated  in  Fig.  8  which  shows  the 
values  of  a  for  50  different  models.  Those  models  are 
determined  from  50  step  responses  covering  the  whole 
workspace  of  the  actuator.  The  parameters  b  ,  kx  and  k2 
are  functions  of  up  and  down  switching  values  a ,  p ,  input 
signal  u(t)  and  its  direction  y .  The  functions  are  nonlinear 
and  implemented  using  neural  networks.  Other  methods 
can  also  be  used.  The  hybrid  model  structure  is  shown  in 
Fig.  9. 

4.  Simulation  and  experimental  results 

The  system  response  can  be  divided  into  multiple 
windows  to  determine  the  parameters  of  an  individual 
linear  dynamic  model.  Such  determination  results  in  a 
group  of  dynamic  models  G((s)  and  groups  of  model 
parameters  (afy  bif  ku,  k2i)  . 

A  multi-stage  step  signal  is  designed  to  cover  the 
workspace  of  the  actuator,  as  shown  in  Fig.  10.  The  signal 
can  be  divided  into  multiple  windows  such  that  each 
window  contains  one  step  signal.  As  a  result,  316  windows 
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Figure  10:  The  multi-stage  step  test  signal. 

are  obtained  for  the  test  signal.  The  a  parameter  is 
relatively  constant  for  all  ARX  models  and  has  determined 
based  on  the  main  loop  model  [9],  which  has  a  mean  value 
of  117.4  and  a  standard  deviation  of  13.9,  Fig.  8.  After 
determining  the  gains  ku ,  the  fast  dynamic  response  is 
simulated  and  the  difference  between  the  measured  signal 
and  simulated  fast  response  is  computed  for  each  window. 
The  resulting  residue  is  used  for  identifying  an  ARX  model 
with  one  pole  for  each  step  response  and  thus  determining 
bi  and  k2i .  After  the  model  parameters  for  each  window 
have  been  computed,  they  are  used  as  the  training  data  for 
neural  networks.  Three  two-layer  feedforward  neural 
networks  are  used  for  computing  the  functions  kY(  ), 
k2{  )  and  b{  ) .  Input  variables  for  the  networks  are  a ,  p , 
u{t)  and  y. 

The  model  is  built  using  the  test  signal  shown  in  Fig. 

10.  The  simulated  output  of  the  model  and  the  testing  data 
match  quite  well.  A  segment  of  the  signal  is  shown  in  Fig. 

1 1 ,  as  the  entire  signal  contains  too  many  steps  to  be  seen 
the  details  in  a  single  figure.  Fig.  12  compares  simulated 
and  experimental  results  of  a  linear  decaying  ramp  signal 
and  shows  almost  indistinguishable  difference.  When  the 
results  are  plotted  as  a  hysteresis  curve  in  Fig.  13,  the  small 
difference  between  the  experimental  results  and  simulated 
outputs  can  be  observed.  Fig.  14  shows  the  comparison  of 
an  experimental  and  a  simulated  response  for  a  signal 
generated  using  a  joystick,  i.e.  directly  by  a  human 
operator.  The  model  output  is  able  to  follow  the  measured 
curve  generally.  The  modelling  errors  seen  in  Fig.  14  come 
from  several  aspects.  Firstly,  the  measurement  signal  is 
noisy,  which  brings  errors  in  the  parameter  determination. 
The  errors  in  the  parameters  transfer  to  the  neural  network, 
and  in  the  meanwhile,  make  the  training  of  the  network 
more  difficult.  Secondly,  the  used  training  signal  may  not 
sufficiently  excite  the  actuator. 


Figure  11 :  Model  matching,  measured  (gray)  and 
simulated  (black)  output. 
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Figure  12:  Measured  (gray)  and  simulated  (black) 
output  for  a  linear  decaying  ramp  signal. 


Experimental  and  simulated  hysteresis  curve 


Figure  13:  Measured  (gray)  and  simulated  (black) 
hysteresis  curves. 
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Overall,  our  proposed  model  describes  well  the 
dynamics,  the  drift  and  the  nonlinear  properties  of  the 
actuator.  The  results  can  be  further  improved  by  reducing 
measurement  noise  and  using  more  complex  pattern  in 
training  signal. 


Figure  14:  Measured  (gray)  and  simulated  (black) 
response  of  a  free  movement  test. 

The  developed  actuator  model  can  be  utilized  in 
modelling  the  entire  micromanipulator  by  combining  three 
actuator  models  together.  Since  the  actuator  model 
describes  the  dynamics,  hysteresis  nonlinearity  and  drift  of 
the  micromanipulator,  the  identification  of  the  manipulator 
kinematics  completes  the  micromanipulator  model. 

5.  Conclusions 

This  paper  presented  a  nonlinear  dynamic  model  of  a 
piezohydraulic  actuator.  The  model  is  able  to  describe 
minor  hysteresis  loops.  The  structure  of  the  model  can 
conceptually  be  separated  in  two  parts:  a  linear  dynamic 
part  to  model  the  dynamics  and  the  drift  property  of  the 
actuator,  and  a  static  nonlinear  part  to  adapt  the  dynamic 
model  to  the  hysteresis  and  gain  nonlinearities.  The  hybrid 
model  was  generated  using  a  multi-stage  step  signal.  The 
resulted  model  was  verified  on  a  linear  decaying  ramp 
signal  and  a  free  movement  test.  The  results  have  shown 
that  the  nonlinear  dynamic  model  is  capable  of 
compensating  the  hysteresis  and  gain  nonlinearities  of  the 
actuator  as  well  as  the  drift  property. 

The  hybrid  model  presented  here  has  been  tested  on 
both  the  major  loop  and  the  minor  loop  of  the  hysteresis. 
By  improving  the  measurement  and  trying  other  types  of 
test  signal,  it  is  possible  to  improve  the  model.  The 
presented  nonlinear  dynamic  model  can  also  act  as  a  part  of 
a  micromanipulator  model.  A  micromanipulator  model 


based  on  the  actuator  models  will  be  developed  in  the 

future. 
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Abstract 

The  development  of  a  human  upper  limb  prosthesis  is  a 
multidisciplinary  approach  involving  mechanics, 
electronics  and  control  in  a  concurrent  manner.  As  a 
consequence,  better  solutions  are  obtained  if  a 
mechatronic  approach  is  adopted.  The  present  paper 
outlines  the  work  done  in  the  framework  of  the  project 
MANUS  so  far.  In  particular,  problems  related  to  the 
development  of  a  natural  HMI  based  on  EMG  sensors 
under  a  specific  electronic  architecture  are  addressed. 
Likewise,  we  focus  on  the  particular  solution  of  hand 
mechanics,  electronic  architecture  and  actuators  design 
so  that  overall  functionality  is  met  in  accordance  with 
user  requirements. 

1.  Introduction 

The  topic  of  robotics  hand  design  has  received  interest 
from  the  scientific  community  for  more  than  three 
decades.  Initially  it  was  a  mean  of  providing  dextrous 
manipulation  capabilities  to  robotics  devices  looking  for 
reusability  of  components,  i.e.  more  general  tools.  The 
thorough  state  of  the  art  achieved  in  the  field  of 
mechanical  design  and  control  of  such  robotics  hands 
made  it  possible  focusing  on  the  application  of  these 
devices  to  technical  aids  for  disabled,  in  particular,  for 
prosthetic  hands. 

The  problem  becomes,  however,  much  more  complex. 
On  one  hand,  while  in  robotic  hands  actuator  packages 
can  be  placed  far  from  the  hand  itself  and  therefore  can  be 
bulky,  prosthetic  applications  require  high  miniaturisation 
in  order  to  integrate  all  the  components  in  the  same  room 
a  human  hand  would  take.  This  imposes  the  necessity  of 
developing  high  power  density  actuator  packages.  On  the 
other  hand,  the  prosthesis  has  to  be  commanded  by  the 


amputee  and  thus  a  natural  interface  should  be  provided, 
allowing  for  enough  fast  command  channels. 

Taking  the  whole  implementation  into  consideration, 
specific  sensing  devices,  actuators,  mechanics  and  control 
has  to  be  provided  in  a  concurrent  engineering  problem. 
As  a  consequence,  a  number  of  mechatronic  topics  are 
inherently  involved  in  such  a  device.  In  particular, 
supporting  electronics  is  to  be  specifically  designed  in 
order  to  improve  actuator  and  sensing  device 
functionality.  Moreover,  advanced  motion  control 
schemes  are  also  of  paramount  importance  given  the  fact 
that  some  autonomous  reactive  behaviour  of  the  hand  in 
response  to  external  disturbances  (mainly  slippage  and 
unknown  forces)  is  necessary. 

The  present  paper  introduces  the  approach  followed  by 
the  MANUS  consortium  under  the  European  Community 
project  “Modular  Anthropomorphous  User-adaptable 
Hand  Prosthesis  with  Enhanced  Mobility  and  Force 
Feedback”  under  contract  DE-4205.  The  MANUS 
consortium  includes  scientific  technology  developers, 
medical  firms  as  well  as  user  organisations  and  has  been 
set  up  in  order  to  provide  support  to  the  mechatronic 
approach  proposed  in  the  development  of  the  prosthesis. 

In  the  following  sections  we  will  attend  to  main 
subsystems  of  the  hand  prosthesis,  namely,  Electronics 
and  Control,  Hand  mechanism  and  Actuation  technology 
focussing  in  the  latter. 

2.  Electronics  &  Control 

The  aim  of  the  project  is  increasing  the  functionality  of 
the  hand  prosthesis  by  providing  a  threefold  modular 
approach.  The  prosthesis  should  be  modular  according  to 
the  number  and  kind  of  grasping  modes  provided  to  the 
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amputee;  it  has  to  be  modular  in  the  sense  of  allowing  a 
stepwise  extension  to  the  shoulder  (in  this  first  version 
only  hand  and  wrist  are  addressed);  the  third  modularity 
aspects  accounts  for  the  residual  capabilities  of  the 
amputee. 


In  order  to  solve  the  problem,  an  electronic  subsystem, 
referred  to  as  EMG  Processor  in  figure  1  has  been 
implemented.  The  EMG  processor  filters,  samples  and 
treats  the  signal  to  produce  a  three  bit  ternary  logic  code 
to  command  the  prosthesis.  Figure  2  shows  the  EMG 


Oiyope  1.  E^e^tpovix  cnj/GTep 


Regarding  this  last  issue,  a  natural  human-machine 
interface,  HMI,  has  to  be  provided.  Several  alternatives 
have  been  considered.  Among  them  two  are  being 
evaluated  in  the  project:  surface  mioelectrical  pickups  and 
mioelectrical  implanted  sensors.  In  both  cases,  the  signal 
genarated  by  muscle  contraction  are  registered  and  coded 
to  command  the  prosthesis.  Since  a  large  number  of 
grasping  modes  is  desired,  a  high  number  of  codes  is 
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signal  from  a  healthy  muscle  of  a  patient  with  amputation. 
It  has  been  experimentally  checked  that,  with  a  short 
period  of  training,  the  amputee  can  produce  desired 
commands  by  controlling  the  muscle  contraction  is  short 
periods  of  0.5  to  1  s. 

Following  this  approach,  we  have  detected  18  valid 
codes  for  commanding  the  signal.  All  codes  starting  with 
1  or  2  (medium  or  high  muscle  contraction)  are  associated 
with  valid  commands.  Since  electrical  potential  signal 
vary  from  user  to  user,  when  the  prosthesis  is  fitted  a 
calibration  process  starts  in  which  the  user  is  asked  to 
produce  a  pattern  of  codes  so  that  signal  processing 
parameters  are  optimised.  It  has  been  checked  out  that  the 
amplitude  of  EMG  signals  corresponding  to  the  same 
patient  vary  as  a  function  of  muscle  fatigue,  sweating  and 
a  number  of  other  causes.  In  order  to  avoid  wrong 
command  coding,  calibration  processes  are  evenly 
distributed  in  time  so  that  processing  parameters  are 
always  kept  updated. 

High  level  control  of  the  prosthesis  is  performed  by 
the  host  microcontroller,  as  shown  in  figure  1.  One  of  the 
requirements  as  specified  by  users  and  therapists  for 
increasing  the  functionality  of  the  prosthesis  is  involving 
the  users  in  the  control  loop  of  grasping  operations.  This 
can  be  achieved  by  providing  a  feel  of  the  grasping  force 
being  exerted.  The  host  microcontroller  is  in  charge  of 
computing  the  hand  kinematics  transformation  according 
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to  the  exact  kinematics  of  the  hand  mechanism  and  the 
particular  control  command  generated  by  the  user. 

3.  Hand  Mechanism 

When  designing  an  anthropomourphous  hand  for  a 
upper  limb  prosthesis,  a  number  of  general  design 
requirement  arise.  Designing  a  hand  prosthesis  is  a  clear 
example  of  user  oriented  concurrent  design  in  which 
ergonomic,  functional  as  well  as  cosmetic  issues  must  be 
addressed. 

In  a  first  stage  of  our  project,  we  have  identified  all 
these  requirements  in  terms  of  must  and  minimum  criteria. 
The  definition  of  requirements  is  based  on  a  survey 
conducted  on  hand  prosthesis  users  belonging  to  several 
countries  and  cultures  as  well  as  on  prosthesis  therapists, 
[1].  From  this  study  a  number  of  functional  requirements 
arose  that  can  be  summarised  as  follows: 

•  Desired  5  grasping  modes  according  to  figure  2: 
lateral  grasp,  hook  grasp,  precision  grasp,  cylindrical 
grasp  and  spherical  grasp.  A  minimum  of  4  grasps  are 
acceptable  (spherical  grasps  are  not  so  useful  as  the 
other). 

•  Maximum  grasping  time  of  1  s. 

•  Motion  range  of  90  degrees  for  finger  motion 
(medium  and  index). 

•  Desired  grasping  force  of  100  N. 

•  Maximum  overall  weight  of  800  gr  (for  the  entire 
prosthesis  device  including  electronics  and  actuators), 
desired  weight  of  500  gr. 


Tip  grasp  Spherical  grasp  Lateral  grasp 
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According  to  these  specifications  several  kinematics 
configurations  have  been  considered.  One  of  the  issues 
addressed  at  this  stage  is  the  use  of  adaptive  vs.  non- 
adaptive  mechanisms.  Adaptive  mechanisms  are 
mechanisms  with  a  number  of  degrees  of  freedom  higher 
than  the  number  of  actuators  in  the  hand.  As  a 


consequence  it  is  an  underactuated  mechanism  that  relies 
on  differential  devices  to  obtain  shape  adaptation,  i.e.  the 
enveloping  surface  of  the  fingers  can  be  adapted  to  the 
shape  of  the  object  being  grasped.  A  first  advantage  of 
these  approach  is  the  increased  manipulability  of  the  hand. 

On  the  other  hand  non-adaptive  mechanisms  have 
exactly  the  same  number  of  actuators  as  the  number  of 
degrees  of  freedom.  Generally,  for  these  devices  the 
motion  of  the  different  phalanxes  of  a  finger  is  linked  in  a 
fix  manner  to  the  motion  of  the  previous  phalanxes.  A 
number  of  mechanisms  for  doing  so  are  currently  well 
known.  They  are  based  on  mechanical  linkages  based  on 
bars  [2],  on  gears  and  on  tendons  [3].  Even  when  no 
possibility  of  shape  adaptation  is  present  in  this  type  of 
mechanism,  resulting  devices  are  much  simpler  than 
adaptive  devices. 

In  the  framework  of  MANUS  we  have  analysed  both 
adaptive  and  non-adaptive  mechanisms.  Due  to  severe 
restrictions  on  weight  and  available  size  (see  above)  we 
have  finally  selected  a  non-adaptive  mechanism  with  four 
degrees  of  freedom.  The  mechanism  comprises  three 
active  fingers:  index  medium  and  thumb  and  additionally 
the  wrist  is  provided  with  a  rotational  joint  providing 
prono  supination. 


The  motion  of  index  and  medium  is  linked  by  a 
differential  mechanism  so  that,  when  one  of  the  fingers 
contacts  the  object  to  be  grasped,  the  other  follows  flexing 
until  contact  and  thus  providing  an  adaptation  to  object 
geometry.  After  studying  different  kinematics 
configurations,  it  was  found  that  an  opposition  type  of 
hand  would  provide  the  basic  four  grasping  modes  above 
introduced.  Designing  a  two-degree  of  freedom  thumb 
(opposition  and  flexion)  solves  this. 
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Two  basic  mechanisms  for  linking  the  motion  of  all 
phalanxes  for  a  given  finger  are  being  considered: 
coupling  by  bars  and  tendons.  Coupling  with  bars  could 
result  in  a  mechanism  with  singular  positions  in  its  range 
of  motion,  therefore  a  careful  design  is  needed.  On  the 
other  hand,  coupling  through  tendons  avoid  the  problem 
of  singularities  in  the  motion  range  but  increases  the 
number  of  components  to  be  included  in  a  very  small 
room,  a  trade-off  should  be  reached.  Also  weight  is  of 
major  concern.  Having  into  account  that  the  overall 
weight  should  not  increase  800  gr,  no  more  than  100- 
200gr  are  left  for  the  hand  mechanism. 
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4.  Actuation  technology 

It  was  already  on  April,  1982,  in  the  context  of  a 
workshop  held  at  MIT,  [4],  where  it  was  pointed  out  that 
“the  current  actuation  technology  provides  perhaps  the 
most  serious,  long  term  impediment  to  artificial  hand 
design”.  The  same  statement  holds  17  years  after  this 
meeting.  In  fact,  current  actuation  technologies  fail  to 
provide  efficient,  high  power  density  actuators  suitable  for 
artificial  hand  design.  The  lack  of  adequate  actuation 
technologies  directly  affects  the  design  of  dextrous  hands. 
This  impediment  becomes  even  stronger  if  the  artificial 
hand  is  intended  for  prosthetics. 

In  the  framework  of  MANUS  a  number  of  possible 
alternatives  have  been  considered.  Initially  a  number  of 
“traditional”  technologies  were  proposed  as  backup 
technologies,  while  a  number  of  “non-traditional” 
actuation  technologies  were  proposed  for  research,  ie. 
Shape  Memory  Actuators,  and  Ultrasonic  Motors. 


In  the  first  stage  of  the  project  we  have  thoroughly 
studied  and  compared  all  of  them  in  terms  of  efficiency, 
bandwidth,  controllability,  available  torque  and  velocity, 
robustness  and  specific  power.  As  a  result  we  have 
concluded  that  both  US  motors  and  Brushless  DC  motors 
are  the  most  suited  for  a  hand  prosthesis  application. 

While  Brushless  DC  motors  is  a  well  known 
technology,  US  motors  are  still  in  the  state  of  the  art  of 
research  in  actuation  technologies.  Therefore  our 
consortium  is  conducting  research  towards  improving  US 
motors  performance  mainly  in  providing  a  good  trade-off 
between  delivered  torque  and  rotation  speed  and 
efficiency.  We  have  fully  designed  a  number  of  prototypes 
according  to  our  theoretical  models  to  check  the  effect  of 
a  number  of  parameters  on  the  performance  of  the 
actuator,  one  of  them  can  be  seen  in  figure  5. 

Ultrasonic  piezoelectric  motors  are  motors  of  simple 
structure  that  use  elastic  ultrasonic  waves  to  drive  a  rotor 
by  frictional  forces.  There  are  several  types  of  ultrasonic 
motors,  which  can  be  classified  depending  on  its  shape 
and  the  type  and  number  of  vibration  modes  they  use  to 
produce  the  movement.  Among  all  types  of  piezoelectric 
motors,  the  ring  type  travelling  wave  ultrasonic  motor 
(TWUM)  has  been  the  most  successful  and  it  is  being 
used  in  commercial  applications  [5]. 

Ultrasonic  motors  driven  by  travelling  waves  are  based 
on  the  fact  that  when  a  travelling  wave  propagates  along 
elastic  solid,  each  particle  on  its  surface  moves  describing 
an  ellipsis.  The  ultrasonic  travelling  waves  are  produced 
by  electric  excitation  of  a  stator  made  of  a  piezoelectric 
material.  This  motion  is  then  imposed  by  direct  contact  to 
the  rotor. 

Ultrasonic  piezoelectric  motors  exhibit  several 
features  that  are  very  convenient  for  a  prosthetic  hand 
application.  Compared  to  a  normal  electromagnetic  D.C. 
motor  which  has  a  high  efficiency  but  a  low  torque  at  high 
speed,  the  ultrasonic  motor  can  produce  high  torque  at  a 
low  speed  with  a  high  efficiency.  In  addition  in  these 
lightweight  motors,  the  torque  per  unit  weight  is  high  and 
since  the  inertia  of  the  rotor  can  be  made  veiy  small,  the 
control  characteristics  at  start  and  stop  are  also  good. 
Moreover,  since  the  rotor  is  always  pushed  against  the 
stator  to  maintain  frictional  forces,  when  no  electric  power 
is  applied  the  torque  and  position  is  held.  This  last 
characteristic  increases  the  total  efficiency  of  ultrasonic 
motors  for  this  application,  since  no  power  would  be 
needed  when  the  hand  is  in  its  final  position. 

A  double  stator  travelling  wave  ultrasonic  motor 
(TWUM),  as  schematically  shown  in  figure  5,  has  been 
designed  to  obtain  a  higher  operating  torque  for  a  given 
diameter.  As  shown  in  the  figure,  the  rotor  is  made  of  two 
outer  disks,  which  are  both  fixed  to  the  motor  shaft  so 
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they  move  together.  Each  stator  has  its  own  input  signal, 
so  there  are  two  independent  energy  inputs  to  the  two-disc 
rotor.  Thus,  the  final  output  torque  delivered  by  the  motor 
is  due  to  the  forces  acting  upon  both  disks  in  the  contact 
interface.  In  addition,  since  torque  and  efficiency  depend 
on  the  pre-load,  see  [5],  the  pressure  between  rotor  and 
stators  for  this  prototype  can  be  adjusted  by  pressing 
against  the  wave  springs  also  shown  in  the  figure.  This 
force  is  applied  by  tightening  the  upper  disk  until  the 
desired  position  is  reached  and  then  locking  it  with  the 
nut. 

The  other  components  of  the  motor  are  the 
piezoelectric  and  the  engineering  plastic  rings.  Each 
piezoelectric  ring  is  divided  into  18  sectors  with 
alternating  polarity.  A  ring  is  stuck  to  each  of  the  stators 
with  a  very  strong  adhesive.  On  the  other  hand, 
engineering  plastics  are  stuck  to  each  disk  to  provide  a 
wear  resistant  surface  for  the  contact  interface  with  the 
stators. 

An  interesting  issue  addressed  by  our  design  is  related 
to  the  possibility  of  optimising  the  driving  voltage  to  the 
piezoelectric  discs  independently.  We  have  to  take  into 
account  that  even  when  piezoelectric  ceramics  are  very 
similar,  in  general  there  will  be,  due  to  manufacturing 
process,  some  differences  in  the  resonant  frequency  of  the 
discs.  If  discs  are  excited  at  different  frequencies  there 
would  be  a  net  difference  of  tangential  velocities  at  the 
contact  point  where  torque  and  speed  is  applied  to  the 
rotor.  But  if  frequencies  are  equal,  the  amplitude  of 
vibration  will,  in  general,  differ  between  stators,  causing 
again  the  same  problem  of  inefficient  torque  transmission. 

In  order  to  avoid  so,  amplitude  of  vibration  has  to  be 
controlled.  This  is  solved  by  placing  embedded  sensors  in 
the  piezoelectric  ceramic.  These  embedded  sensors  are 
circular  sectors  of  the  piezoelectric  disc  that  are  used  to 
measure  the  amplitude  of  vibration.  Amplitude  of  both 
piezoelectric  discs  is  compared  and  controlled  so  that  it 
remains  equal  during  operation  and  thus  minimising 
sliding  in  the  contact  surface. 

Future  research  includes  the  optimisation  of  the  stator 
design,  specially  the  tooth  shape,  and  the  rotor  stiffness. 
Such  study  would  require  a  better  understanding  of  the 
non-linear  contact  problem  between  rotor  and  stator. 

This  prototype  can  be  also  considered  as  two  parallel 
coupled  actuators.  Such  configuration  has  some 
advantages  in  force  control  applications.  Beside 
increasing  the  output  torque,  this  actuation  system  can  be 
used  to  expand  the  dynamic  range  and  reduce  impact 
forces  [6].  For  instance,  if  different  stators  are  used,  the 
resolution  would  be  bounded  by  the  minimum  controllable 
force  exerted  by  the  smaller  stator  and  the  maximum  force 
of  the  larger  stator.  In  addition,  if  instead  of  a  rigid  link 


between  the  disks  a  compliant  transmission  is  used,  the 
forces  due  to  unexpected  contact  transitions  would  be 
reduced. 


5.  Conclusions 

In  this  paper  we  have  presented  an  overview  of  the 
activities  being  carried  out  in  the  framework  of  the 
MANUS  project  to  develop  a  hand  prosthesis.  As  a 
whole,  the  prosthesis  can  be  considered  as  a  complex 
mechatronic  system  in  which  system  functionality  is 
enhanced  by  specifically  interrelating  mechanisms, 
electronics  and  control. 

At  a  level  of  component  or  subsystem  the  same 
mechatronic  approach  can  be  seen.  Taking  as  an  example 
the  implementation  of  Ultrasonic  Motors  we  see  how  the 
integration  of  embedded  sensors  can  be  used  to  optimise 
the  performance  of  the  motor  by  modifying  the  vibration 
pattern  of  the  stator  and  thus  allowing  a  uniform  contact 
surface. 

It  is  still  soon  to  evaluate  the  performance  of  the 
proposed  hand  prosthesis,  in  particular  more  research  has 
to  be  conducted  on  actuation  to  improve  efficiency  and 
speed. 
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Abstract 

In  this  paper  we  present  an  active  video  module  that  con¬ 
sists  of  a  miniature  video  sensor,  a  wireless  video  trans¬ 
mitter  and  a  pan-tilt  mechanism  driven  by  micromotors. 
The  video  module  is  part  of  a  miniature  mobile  robot  that 
is  projected  to  areas  of  the  environment  to  be  surveyed.  A 
single-chip  CMOS  video  sensor  and  miniature  brushless 
D.C.  gearmotors  are  used  to  comply  with  restrictions 
imposed  by  the  robotic  system  in  terms  of  payload  weight, 
volume  and  power  consumption.  Different  types  of  actua¬ 
tion  are  analyzed  for  compatibility  with  a  mesoscale 
robotic  system.  Applications  of  an  active  video  module  are 
discussed. 

1.  Introduction 

The  use  of  robots  to  remotely  monitor  hazardous  envi¬ 
ronments  is  a  primary  application  for  autonomous  robotic 
systems.  A  joint  project  between  the  University  of  Minne¬ 
sota,  MTS  Systems  Corp.,  and  Honeywell  Inc.  is  develop¬ 
ing  a  new  approach  to  this  robotic  task  through  the  use  of  a 
novel  distributed  system  of  miniature  mobile  robots. 
These  miniature  robots  are  distributed  throughout  the 
environment  through  two  separate  methods  of  locomotion. 
A  gross  positioning  method  launches  the  robots  through 
the  air  to  an  approximate  location.  After  the  robots  land, 
fine  motion  capabilities  that  the  robots  posses  allow  them 
to  move  into  appropriate  reconnaissance  positions.  These 
miniature  robots,  called  Launchable  Reconnaissance 
Robots  (LRR),  contain  various  types  of  sensors  as  pay- 
loads  and  a  wireless  transmitter/receiver.  In  this  paper  we 
describe  the  active  video  module  that  was  designed  as  a 
payload  for  an  LRR.  This  is  a  challenging  task  due  to  the 
limitations  required  on  size,  weight,  and  power  consump¬ 
tion.  We  discuss  the  various  design  issues  and  new  tech¬ 
nologies  that  enabled  us  to  achieve  our  goal  of  providing 
live  video  information  using  active  video  sensors. 


1.1.  Launchable  reconnaissance  robots 


Figure  1.  Launchable  reconnaissance  robot 


LRRs  are  cylindrical  in  shape  with  an  outer  diameter  of 
40  mm  and  length  of  80  to  100  mm.  These  dimensions 
allow  the  robot  to  be  launched  using  standard  equipment. 
There  are  two  wheels  at  both  ends  of  the  cylinder  that  are 
driven  by  separate  D.C.  gear-motors.  Another  motor  is  used 
to  retract  a  spring  arm  located  outside  the  body.  By  quickly 
releasing  the  spring  a  hopping  action  of  the  robot  is 
achieved.  This  locomotion  method  is  intended  to  rescue  the 
robot  from  obstacles  that  are  too  big  to  move  over  using  the 
wheels.  Figure  1  illustrates  a  prototype  LRR. 

The  most  important  components  of  the  LRRs  are  the  var¬ 
ious  sensors  that  are  carried  as  payloads  within  the  shell. 
LRRs  have  a  modular  design  so  that  different  payload  mod¬ 
ules  can  be  attached  to  the  base  robot  for  alternative  func¬ 
tionality.  These  modules  include  vibration  and  toxic  gas 
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sensors  using  MEMS  technology,  microphones,  and  an 
active  video  module. 

LRRs  can  be  deployed  either  by  individuals  or  by  more 
sophisticated  and  larger  mobile  robots.  They  receive  com¬ 
mands  and  transmit  information  through  an  RF  data  link. 
This  allow  the  robots  to  form  a  distributed  sensory  network 
over  the  area  of  surveillance.  Figure  2  illustrates  this  con¬ 
cept. 

1.2.  Active  Video  Module 

The  active  video  module  consists  of  a  miniature  video 
camera,  a  wireless  video  transmitter  and  a  pan-tilt  mecha¬ 
nism.  The  camera  is  normally  concealed  inside  the  body  to 
conserve  the  tubular  form  of  the  shell.  It  comes  out  of  the 
body  by  opening  a  hatch  and  retracts  when  the  robot  is  to 
be  moved.  However,  the  camera  can  still  see  through  the 
transparent  body  of  the  robot. 

The  payload  volume  available  for  the  video  module  is  a 
semi-cylinder  along  the  tubular  body,  approximately  35 
mm  in  diameter  and  18  mm  in  length  with  a  total  volume 
of  8.7  cm3.  To  fit  inside  such  a  small  volume  each  compo¬ 
nent  of  the  module  must  be  miniaturized.  Additionally,  the 
maximum  power  available  for  payloads  from  the  lithium 
batteries  of  the  robot  is  0.9  W  (100mA  @  9V). 

In  the  remainder  of  this  paper  we  discuss  individual  ele¬ 
ments  of  the  active  camera  module  surveying  the  available 
technologies. 


Operator 


Figure  2.  Distributed  robotic  system 


2.  Video  camera  and  transmitter 

Video  is  a  valuable  information  source  for  reconnais¬ 
sance  and  surveillance  purposes.  Live  or  still  images  may 
be  captured  and  sent  back  to  a  human  operator.  A  video 


camera  generates  signals  according  to  the  light  intensity  on 
its  sensor.  Light  rays  from  the  scene  are  focused  on  the 
sensor  plate  by  a  lens  system.  Early  video  sensors  were  all 
tube-type  devices  and  were  expensive.  An  enormous 
reduction  in  cost,  size  and  power  consumption  was 
achieved  by  the  invention  of  the  CCD  sensor. 

Another  type  of  video  sensor  technology,  the  CMOS 
sensor,  has  emerged  recently.  Both  CMOS  and  CCD  sen¬ 
sors  are  solid-state  devices  made  from  silicon.  They  are 
based  on  the  same  principle  of  photoconversion  to  repre¬ 
sent  incident  photons  by  charge.  Unlike  the  CCD,  the 
CMOS  sensor  detects  the  integrated  charges  in  the  pixels  at 
the  spot,  without  transferring  them,  using  charge  amplifiers 
made  from  CMOS  transistors.  CMOS  is  a  well  developed 
technology  and  all  necessary  circuitry  for  the  camera  can 
be  integrated  in  a  single  chip  at  a  reduced  cost  and  power 
consumption  [7]. 

An  important  feature  of  the  sensor  is  on-chip  automatic 
exposure  control  circuit.  This  circuit  adjusts  the  integration 
time  of  the  pixels  (the  duration  while  the  photons  hit  the 
pixels  and  charges  are  collected  before  they  are  sampled 
and  flushed)  and  eliminates  the  need  for  external  mechani¬ 
cal  shutter  components.  In  other  words,  the  camera  elec¬ 
tronically  adjusts  to  ambient  lighting  conditions  and  no 
mechanical  aperture  in  the  lens  system  is  needed.  Since  the 
video  module  will  be  used  both  indoors  and  outdoors  this 
functionality  is  essential. 

The  power  consumption  of  single  chip  monochrome 
CMOS  video  sensors  on  the  market  are  typically  between 
100-200  mW.  The  power  consumption  of  CCD  sensors  is 
typically  3  to  5  times  this  figure.  The  sensor  we  use  is  the 
OV5016  by  OmniVision  and  consumes  20  mA  at  5  V. 

Color  sensors  are  also  available  for  both  CCD  and 
CMOS  types.  Color  images  do  not  contain  considerably 
more  information  than  grayscale  images  and  in  the  case  of 
the  video  module  the  increased  power  consumption  makes 
this  option  unattractive. 

A  pinhole  lens  with  5.7  mm  focal  distance  is  used  to 
focus  the  image  on  the  video  sensor.  The  resulting  sensor- 
lens  package  is  approximately  15x15x16  mm  in  size  and 
weighs  less  than  5  gr. 

CMOS  vision  sensors  are  also  sensitive  to  near-infrared 
wavelengths.  Using  suitable  LEDs  for  illumination,  these 
sensors  are  useful  for  nighttime  applications. 

Table  1  summarizes  the  specifications  of  the  video  cam¬ 
era  used  in  the  video  module. 


Table  1:  Video  camera  specifications 


Sensor  type 

Single-chip  monochrome  CMOS 
sensor  with  320x240  pixels 

Size 

15  x  15  x  16  mm 

Power  consumption  ! 

20  mA  at  6-9  VDC 
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Table  1:  Video  camera  specifications 


Output 

Composite  video  signal,  2  V  p-p  at 

30  frame/s 

Lens 

Pinhole  lens  5.7  mm  focal  length 

There  are  a  number  of  wireless  video  transmitters  avail¬ 
able  on  the  market,  however,  only  those  intended  for 
covert  video  applications  and  hobby  use  are  small  enough 
to  fit  within  the  payload  constraints.  We  use  a  miniature 
transmitter  by  Micro  Video  Products,  Canada  that  trans¬ 
mits  in  the  900  MHz  ISM  (Industrial,  Scientific  and  Medi¬ 
cal)  band  and  consumes  30  mA  at  9V.  The  circuit  board  is 
about  24  x  17  x  8  mm  in  size.  Its  range  was  tested  to  be 
150-200  ft  line  of  sight  indoors.  However,  the  structure  of 
the  building  will  affect  this  figure. 

3.  Actuators 

Development  of  actuators  for  MEMS  is  an  important 
research  area.  Several  different  actuators  utilizing  various 
physical  phenomena  have  been  developed.  The  effect  of 
miniaturization  on  these  actuators  is  dependent  on  the  type 
of  forces  involved  in  actuation  [15], 

Common  microactuators  can  be  classified  as  actuators 
using  electromagnetic  and  electrostatic  forces  and  actua¬ 
tors  using  a  functional  element  [10],  Examples  of  actua¬ 
tors  with  a  functional  element  are  piezoelectric  and  shape 
memory  alloy  (SMA)  actuators. 

Many  of  these  microactuators  may  be  applied  to  mesos- 
cale  systems  millimeter  to  centimeter  size.  However,  their 
effectiveness  in  this  size  may  be  different  than  it  is  in  the 
micro  domain.  Additionally,  some  actuators  may  require 
high  voltages  or  currents  which  limits  their  use  in  minia¬ 
ture  mobile  robots.  Below,  common  types  of  actuators  are 
analyzed  from  this  perspective. 

3.1.  Electrostatic  and  electromagnetic  actuators 

Electrostatic  force  between  two  electrodes  is  propor¬ 
tional  to  the  surface  area  of  electrodes  and  inversely  pro¬ 
portional  to  the  square  of  the  distance  between  them.  Since 
these  two  scale  equally  but  opposite  to  each  other  electro¬ 
static  forces  are  not  effected  from  miniaturization.  When 
electrostatic  forces  are  compared  to  gravitational  forces,  as 
in  the.  case  of  micro  systems,  they  are  considered  suitable 
for  actuation.  However,  high  voltages  (over  100  V)  are 
typically  needed  to  drive  electrostatic  actuators  [10],  For  a 
mesoscale  system  electrostatic  forces  are  usually  too  weak 
to  generate  mechanical  action. 

Unlike  electrostatic  forces,  electromagnetic  forces,  com¬ 
monly  utilized  in  all  types  of  electric  motors  are  effected 
from  scaling  by  the  square  of  the  linear  dimension.  How¬ 


ever,  electromagnetic  actuators  may  still  be  a  good  choice 
for  mesoscale  systems  if  the  magnetic  field  density  is  high. 
Motors  with  rare-earth  permanent  magnets  are  typically 
used  in  such  drives.  As  an  example,  a  brushless  D.C. 
motor  by  RMB  has  dimensions  of  3  mm  diameter  and  is 
approximately  10  mm  length.  Torques  of  25.10'6N-m  at 
20000  rpm  are  achievable  with  this  motor  [5], 

A  gearbox  at  the  output  of  the  electromagnetic  actuator 
is  often  necessary  to  increase  the  torque  while  reducing  the 
speed.  Typical  reduction  rates  for  commercially  available 
gearmotors  with  a  diameter  below  5  mm  are  from  1 :3.6  to 
1:125  [6]  [12],  A  planetary  micro  gear  system  is  often 
employed  for  increased  reduction  in  a  small  volume.  The 
elements  of  the  gear  box  are  too  small  to  be  machined  by 
traditional  methods.  Wire  Electro  Discharge  Machining 
(W-EDM)  technology  allows  tooth  modulus  down  to  20 
microns  using  any  conductive  material.  Gears  made  of 
Nickel  manufactured  by  the  LIGA  process  are  also  used  in 
commercial  motors  [12],  The  rotating  shafts  are  usually 
made  of  steel  and  use  jewel  bearings. 

3.2.  Piezoelectric  actuators 

Piezoelectric  elements  generate  strain  due  to  an  applied 
voltage  across  them.  Nanometer  resolution  and  large 
forces  can  be  generated  at  frequencies  of  several  kHz. 
However,  the  strain  generated  is  around  0.1%  and 
mechanical  amplification  of  displacement  is  generally 
required.  A  mechanism  working  close  to  a  kinematic  sin¬ 
gularity  may  be  used  to  create  large  displacements  from 
the  small  strain  of  the  piezo  element  [4], 

Another  problem  is  the  requirement  of  high  voltages, 
typically  around  150  V.  Although  power  consumption 
may  be  low,  special  power  electronics  is  required  to  gener¬ 
ate  these  high  voltages  from  typical  battery  supply  volt¬ 
ages  of  mobile  robots. 

One  distinct  type  of  actuator  using  piezoelectric  ele¬ 
ments  is  the  ultrasonic  motor  [13].  These  types  of  motors 
have  a  rotor  that  rests  on  a  stator  made  of  piezoelectric  ele¬ 
ments.  The  stator  is  excited  by  a  voltage  signal  to  create 
travelling  waves  and  cause  a  rubbing  movement  between 
the  stator  and  the  rotor.  Typical  characteristics  of  these 
motors  are  high  torque  at  low  speed  and  high  holding 
torque  due  to  friction  between  stator  and  rotor.  They  are 
also  suitable  for  hazardous  environments  since  no  sparks 
are  produced.  The  inherent  high  torque  at  low  speeds  elim¬ 
inates  the  need  for  complex  gear  boxes  in  many  cases. 

3.3.  Shape  memory  alloy  actuators 

Shape  memory  alloy  (SMA)  material  is  a  metal  alloy 
(commonly  TiNi)  with  a  shape-recovery  characteristic. 
When  the  material  is  plastically  deformed  and  then  heated 
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above  a  certain  temperature,  it  recovers  its  original  shape. 
This  property  is  utilized  to  create  various  kinds  of  actua¬ 
tors.  The  SMA  material  is  usually  strained  by  a  bias  force 
and  upon  heating  recovers  its  original  shape  by  acting 
against  the  bias  force.  Stresses  of  170  MPa  and  more  can 
be  generated  this  way.  The  bias  force  is  adjusted  to  cause 
4%  maximum  strain  to  minimize  the  decrease  in  the  mem¬ 
ory  effect  after  many  cycles.  Tens  of  millions  of  cycles  are 
possible  at  low  strain  [3], 

The  SMA  provides  simple  and  robust  actuation  within  a 
small  volume  and  weight.  It  is  intrinsically  an  on/off  type 
of  actuator  with  two  positions  for  high  and  low  tempera¬ 
ture  states.  However,  research  has  been  done  to  implement 
electric  resistance  feedback  control  in  a  SMA  servo  sys¬ 
tem  [9]. 

One  disadvantage  of  SMA  is  its  relatively  slow  response 
especially  during  the  cooling  phase  which  is  usually  not 
forced.  Bandwidths  of  approximately  4  Hz  have  been 
achieved  by  differential  heating  and  using  SMA  wire  both 
as  actuator  and  as  mechanical  bias  for  restoration  [8]. 
Another  disadvantage  for  mobile  systems  with  limited 
power  supply  is  the  typical  current  of  several  hundred  mil- 
liamps  required  to  heat  the  SMA  material. 

In  the  case  of  the  active  video  module,  the  most  restric¬ 
tive  requirements  from  the  chosen  actuation  type  are  small 
volume,  low  current  (100  mA  peak),  and  low  voltage  (9  V 
max).  The  camera  weighs  less  than  5  gr.  and  enough 
torque  can  be  generated  for  the  necessary  pan  and  tilt 
action  by  any  of  the  three  actuation  types  mentioned 
above.  An  ultrasonic  motor  has  good  torque,  speed,  and 
holding  torque  specifications  for  this  purpose,  however  the 
need  for  power  electronics  to  increase  the  voltage  and 
driver  circuitry  to  generate  appropriate  signals  does  not 
comply  with  the  small  volume  available. 

A  mechanism  driven  by  a  shape  memory  alloy  actuator 
would  have  the  advantage  of  simple  and  thus  reliable  oper¬ 
ation.  However,  the  camera  is  to  be  tilted  and  panned 
within  a  range,  and  intermediate  positions  must  be  held 
without  consuming  power.  SMA  actuation  can  still  be  use¬ 
ful  for  simple  mechanisms  like  bistable  latches  for  locking 
and  releasing  spring  actuated  hinges. 

An  electromagnetic  actuator  was  chosen  to  drive  the 
pan-tilt  mechanism  of  the  active  video  module.  It  is  a 
brushless  D.C.  gearmotor  by  RMB.  The  motor  has  a  diam¬ 
eter  of  3.4  mm  and  length  of  approximately  15  mm.  A  3 
stage  planetary  gearbox  provides  1:125  reduction  and  a 
continuous  output  torque  of  2.2  mNm  [6].  The  total  gear- 
head  efficiency  is  60%.  Since  the  motor  is  brushless,  com¬ 
mutation  is  done  externally  by  a  microprocessor  based 
drive  circuit,  also  supplied  by  the  company.  However,  the 
on  board  processor  of  the  robot  is  likely  to  take  over  this 
job.  Peak  power  consumption  is  70  mA  at  5  V. 


4.  Pan-tilt  mechanism 

The  usual  design  of  a  pan-tilt  mechanism  has  two  actua¬ 
tors  for  each  axis  of  motion.  Usually  the  pan  motor  carries 
the  tilt  motor  and  the  camera.  These  types  of  pan-tilt  actu¬ 
ators  are  frequently  used  for  security  monitoring.  They  are 
also  used  by  computer  vision  and  robotics  researchers  for 
active  vision.  These  systems  are  generally  big,  heavy  and 
slow.  Additionally  they  do  not  incorporate  any  position 
feedback  sensor.  Some  alternative  designs  were  made  [1], 
for  example  a  linear  stepper  motor  controlled  platform 
pan-tilt  actuator  and  a  spherical  pointing  motor  (SPM)  The 
latter  consists  of  a  miniature  camera  with  a  permanent 
magnet  mounted  on  a  gimbal.  Three  sets  of  coils  are 
wounded  outside  the  gimbal  in  orthogonal  directions.  By 
controlling  the  individual  currents  to  each  coil  a  magnetic 
field  vector  of  desired  orientation  is  produced.  The  perma¬ 
nent  magnet  on  the  gimbal  (and  thus  the  camera)  aligns 
itself  with  this  vector.  The  camera  can  be  rotated  by  step 
sizes  of  0.011°.  However,  the  SPM  weighs  160  gr.  and 
requires  about  1A  current. 

The  active  video  module  transmits  live  images  back  to  a 
human  operator  and  the  pan-tilt  action  is  also  controlled  by 
this  operator.  Therefore  highly  accurate  motion  or  position 
feedback  is  not  essential.  On  the  other  hand,  the  camera 
should  normally  be  concealed  inside  the  robot  body,  come 
out  when  needed,  and  retract  before  the  robot  moves. 

The  general  design  of  the  pan-tilt  mechanism  is  shown 
in  Figure  3.  A  tendon  attached  to  a  drum  at  the  base  con¬ 
trols  the  tilt  action.  The  camera  is  attached  to  a  sliding  col¬ 
umn  and  is  constantly  pushed  up  by  a  compression  spring. 
Additionally,  a  torsion  spring  at  the  upper  drum  exerts  a 
continuous  moment  to  tilt  the  camera  towards  its  maxi¬ 
mum  tilted  position.  When  the  tilt  motor  releases  the  ten¬ 
don  the  camera  first  raises  up  to  gain  clearance  for  pan 
action  and  then  rotates  90  to  180  degrees  under  the  action 
of  the  torsional  spring.  The  reverse  happens  when  the  ten¬ 
don  is  wound  back.  The  whole  setup  is  mounted  on  a  plat¬ 
form  which  is  rotated  by  a  second  motor  for  the  pan 
action.  The  portion  of  the  transparent  shell  of  the  robot 
which  is  directly  above  the  camera  is  separate  from  the 
rest  and  is  attached  to  the  camera.  Figure  4  shows  the 
operation  of  the  mechanism.  Figure  5  shows  an  early 
design  of  the  active  video  module.  The  camera  and  the 
motor  are  visible. 
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Figure  3.  Pan-tilt  mechanism 


Figure  4.  Mechanism  operation 


Figure  5.  Active  video  module 


5.  Applications  and  future  work 

The  primary  application  of  the  launchable  reconnais¬ 
sance  robot  is  surveillance  and  especially  detection  of 
humans.  Currently  the  images  acquired  from  the  robots  are 
inspected  by  human  operators  but  the  goal  is  to  bring  more 
autonomous  behavior  using  advances  in  technology  and 
computer  vision. 

Image  processing  is  by  its  nature  a  computationally 
expensive  task.  However,  using  digital  video  cameras  and 
powerful  microprocessors  it  is  possible  to  have  embedded 
vision  systems  suitable  for  miniature  mobile  robotic  appli¬ 
cations.  One  example  is  the  Eyebot  from  the  University  of 
Western  Australia  [2].  This  platform  employs  a  digital 
camera  with  80x60  pixels  and  a  Motorola  68332  32-bit 
microcontroller  for  control  of  mobile  robots  and  processing 
of  visual  data. 

Digital  transmission  with  image  compression  is  another 
advantage  of  using  digital  cameras.  A  micro  camera  sys¬ 
tem  compromising  a  CMOS  grayscale  sensor  with  312  x 
287  pixels,  A/D  converter,  processing  interface  and  pipe¬ 
lined  processing  architecture  was  built  into  a  package  size 
of  20.6  x  15.75  xl4.7  mm  [11].  The  total  processing  power 
of  the  camera  is  70  MIPS  (million  instructions  per  second). 
It  can  be  programmed  to  perform  real-time  image  enhance¬ 
ment,  image  encoding  or  motion  triggered  acquisition. 

Active  camera  systems  have  been  used  for  motion  track¬ 
ing.  Motion  based  tracking  systems  have  the  advantage  of 
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being  able  to  track  any  moving  object  regardless  of  shape 
and  size  [14].  Unlike  recognition  based  systems  they  can 
be  used  effectively  in  uncontrolled  environments. 

Our  future  goals  include  digital  image  acquisition,  on¬ 
board  image  processing  and  implementing  active  vision 
techniques  with  the  vision  module. 

6.  Conclusion 

A  miniature  active  video  module  for  a  launchable 
mobile  robot  was  designed.  Different  types  of  video  sen¬ 
sors  were  inspected  and  various  forms  of  micro  actuation 
were  analyzed  for  their  compatibility  in  a  mesoscale 
robotic  system.  Applications  and  future  improvements  of 
the  video  module  were  discussed. 
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ABSTRACT 

We  present  a  localization  and  navigation  system  which 
determines  the  position  and  orientation  of  a  mobile  robot  and 
navigates  it  safely  using  the  odometry  system,  a  gyro  and  a 
compass  measurements.  The  estimating  of  the  localization  using 
these  sensors  information  was  performed  by  an  extended 
Kalman  filter  (EKF).  Using  this  kind  of  sensor  fusion,  the 
precise  localization  information  of  the  mobile  robot  updated  and 
path-planning  can  be  realized  safely  by  Generalized  Voronoi 
Diagrams  (GVD). 

1. INTRODUCTION 

Precise  determination  of  the  actual  position  and  orientation  of 
an  autonomous  mobile  robot  is  essential  in  control  and  path 
finding  [1,2].  This  important  task  can  be  accomplished  by  using 
dead  reckoning  [3,  4],  landmarks  [5-7]  or  beacons  [8,  9]  to 
correct  errors  in  position.  In  the  dead  reckoning  system  the 
robot’s  position  and  orientation  are  calculated  from 
measurements  of  wheel  motion,  also  called  odometry.  Dead 
reckoning  control  has  some  problems  like  poor  mechanical 
alignment  of  wheels,  slop  in  gears,  noise  in  sensor  signals, 
errors  in  sensor  signals,  wheel  slippage  and  trajectory  variations 
due  to  surface  unevenness.  To  reduce  the  problems  with  dead 
reckoning,  recently,  a  position  and  orientation  determining 
strategies  are  developed  by  using  an  ultrasonic  ring  around  an 
autonomous  mobile  robot  and  two  ultrasonic  position 
referencing  beacons  in  [10],  and  only  one  beacon  in  [1 1]. 

In  this  work,  we  introduce  an  approach  to  combine  the  observed 
sensor  data  coming  from  dead  reckoning  system,  a  fiber-optic 
gyroscope  also  called  “laser  gyroscope”  (Autogyro  Navigator) 
[12]  and  a  magnetic  compass  (KVH  Fluxgate  Compass)  [13] 
sensors  for  exact  determination  of  the  position  and  orientation  of 
a  mobile  robot  in  a  predefined  environment.  An  extended 
Kalman  filter  (EKF)  [14-16]  algorithm  is  employed  to  realize 
this  approach.  In  addition,  measurements  data  gathered  from  the 
ultrasonic  and  infrared  sensors  are  integrated  to  avoid  the 
mobile  robot  from  the  known  /  unknown  obstacles  and  plan  the 
target-path  using  the  Generalized  Voronoi  diagrams  (GVD)  [17, 
18]  concept.  We  apply  the  our  approach  to  the  problem  of 


location  estimation  and  navigation  in  mobile  robotics  using  and 
experimenting  with  the  our  laboratory  mobile  robot  Nomad-200. 
The  Nomad-200  is  an  integrated  mobile  robot  system  and  has  an 
on-board  multiprocessor  system  consisting  of  a  Pentium  133  and 
multiple  slave  microcontrollers.  It  has  also  an  incremental 
odometry  system,  one  laser  gyroscope  and  one  magnetic 
compass  to  localize  itself,  and  sixteen  infrared  and  ultrasonic 
sensors  to  obtain  a  360-degree  sweep  horizontally. 

The  use  of  gyros  in  mobile  robot  applications  has  become  more 
attractive  for  position  estimation.  Komoriya  and  Oyama  used  the 
OF6-3  (Hitachi)  optical  fiber  gyroscope  in  order  to  fuse  with 
odometry  information  [19].  They  used  a  Kalman  Filter  to  fuse 
these  two  different  sensors  information.  Barshan  and  Durrant- 
Whyte  conducted  the  gyroscopes  for  the  inertial  navigation  of 
the  mobile  robots  [20].  They  evaluated  the  performance  of  the 
ENV-05S  Gyrostar  from  (Murata)  and  the  other  Solid  State 
Angular  rate  Transducer  “Start”  gyroscope  manufactured  by 
(Gee).  They  found  that  these  gyros  had  relatively  large  drifts 
like  5°  to  15°/min.  However,  The  applying  EKF  showed  that  the 
drift  can  be  reduced  the  rate  of  1°  to  3°/min. 

Because  of  its  technical  maturity  and  commercial  availability, 
the  magnetic  compass  is  an  useful  altematif  to  inertial  sensor 
systems  provided  it  is  widely  used  in  mobile  robotics 
applications.  Due  to  the  influence  on  accumulated  dead¬ 
reckoning  errors,  to  provide  a  measure  of  absolute  heading  of 
the  mobile  robot  is  very  important  to  solve  the  navigation 
problems  safely.  One  magnetic  compass  has  an  important 
disadvantage  that  the  magnetic  field  of  the  earth  is  often 
distorted  near  high  voltage  system  or  big  ferromagnetic 
structures.  Also  the  robot  itself  distorts  this  magnetic  field.  In 
order  to  filter  these  internal  and  external  magnetic  field 
perturbations,  we  have  developed  a  method. 

The  Kalman  Filter  approach  is  an  essential  data  fusion  method 
to  reduce  the  drift  and  distortions,  and  fuse  odometry,  gyro  and 
compass  data  in  both  of  the  above  studies.  To  produce  a  detailed 
model  of  the  sensors  is  important  to  use  the  Kalman  Filter 
efficiently.  In  our  study,  we  have  developed  the  EKF  algorithm 
(presented  in  section  3)  to  process  the  sensor  data  sequentially. 
Each  sensor  information  was  taken  into  consideration 
separately.  So  it  made  the  algorithm  more  flexible  and  the 
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measurements  could  be  easily  integrated  with  each  other  in  the 
EKF  algorithm. 

In  the  navigation  part  in  section  4,  mobile  robot  can  navigate 
with  a  given  set  of  path  data  from  an  off-line  path  planner  which 
uses  GVD  diagrams  and  calculates  its  shortest  target  path, 
shown  in  Fig. 2  and  3.  In  our  system  there  is  an  active  matching 
algorithm  in  order  to  update  the  robot  environment  with  new 
obstacles  or  to  delete  some  known  obstacles  that  are  removed 
anywhere.  If  any  change  in  the  robot  environment  is  sensed,  the 
off-line  path  planner  produces  its  new  path  data  using  the  GVD 
diagrams. 


2. SENSING  SYSTEM 


2.1  Incremental  Odometry  System 

Our  robot.  Nomad  200  has  three  wheel  synchronous  drive  non- 
holonomic  system  with  zero  gyro-radius  (i.e.  the  robot  can  rotate 
around  its  center)  and  equipped  with  optical  shaft  encoders.  The 
three  wheels  controlled  by  one  motor  translate  together  and 
rotate  all  together  with  an  other  motor.  Its  shaft  encoders  give 
pulses  per  sampling  period.  The  number  of  counted  pulses  are 
converted  to  the  distance  and  given  as  a  x,  y  coordinates  of  the 
robot.  The  robot  can  only  translate  along  the  forward  and 
backward  directions.  This  situation  is  the  non-holonomic 
constraint  and  similar  to  that  of  a  car.  The  reference  point  of  the 
mobile  robot  is  the  midpoint  of  the  wheels  as  shown  in  Figure  1 . 


However,  this  sensor  is  often  distorted  near  high  voltage  lines 
and  big  ferromagnetic ,  structures  that  corrupt  the  earth’s 
magnetic  field. 

In  the  mobile  robot  applications,  the  best  compass  is  the  flux- 
gate  compass.  It  measures  the  horizontal  components  of  the 
earth’s  magnetic  field  strength  ; 

H  =  (  hx,  hy,  hz  )T 

Where  hx  is  measured  in  forward  direction,  hy  in  the 
mathematical  positive  cross-direction  and  h2  in  the  vertical 
direction.  H-vector  in  the  plane  x,  y  describe  a  circle.  So.  the 
robot  heading  is  given  by  the  transformed  x,  y  components; 

^compass  =  tan  1  (hy  /  hx  )  -  B0  (4) 

Where  9P  is  the  initial  heading  angle  of  the  robot  measured  by 
the  magnetic  compass  in  the  first  reading. 

2.3  Fiber-Optic  (Laser)  Gyroscope 

Gyroscopes  are  used  to  compensate  for  the  foremost  weakness 
of  odometry  in  the  mobile  robot  applications.  Any  small 
orientation  error  in  the  odometry  system  will  give  a  growing 
cumulative  position  error.  In  order  to  prevent  this  weakness, 
highly  accurate  gyros  can  be  used  to  detect  the  orientation  errors 
and  correct  them  immediately. 

In  our  system  we  used  a  laser  gyro  which  is  the  "‘Autogyro 
Navigator”  from  Andrew  Corp.  It  is  a  single-axis  inter¬ 
ferromagnetic  fiber  optic  gyroscope  to  support  the  dead 
reckoning  and  GPS  receiver  systems. 

The  device  measures  angular  rate,  and  allows  robot-turning 
angle  to  be  measured  accurately.  The  autogyro  produces  an 
output  in  digital  format  (16-bit)  proportional  to  angular  rotation 
rate  over  the  range  ±100  degrees  per  second.  One  sample  (0.1 
second)  represents  10  degrees  of  rotation  at  ±100  degrees  per 
second  rotation  rate.  With  the  gyro  stationary,  the  indicated  rate 
is  less  than  0.2  degree  per  second.  The  stationary  drift  rate 
depends  on  the  operating  temperature  range  and  drifts  with 
time.  Using  the  drift  rate  “edrift”  the  angular  velocity  of  the 
mobile  robot  can  be  calculated  as  follows: 

Wgyro  —  Cgyro  (  egyro  —  edrift  )  (5) 


Figure  1 .  Position  of  the  synchronous  mobile  robot. 

In  two-dimensional  Cartesian-coordinates  system,  the  robot 
position  and  orientation  at  the  t=kT  instant  can  be  given  as  Xk, 
yk  and  0k  respectively.  Using  the  trigonometric  relation,  the 
robot  position  and  orientation  can  be  updated  as  follows: 


6k+l=  6k  +  A0k 

(1) 

Xk+i=  Xk  +  Ask  Cos(0k+i) 

(2) 

yk+i=yk  +  ASkSin(0k+]) 

(3) 

Where  Ask  (  =T.vk )  is  the  distance  traveled  and  A0k  (  =T.Wk  ) 
the  steering  rate  between  the  two  sampling  point  pk  and  pk+i  . 

If  we  examine  the  equations  (1),  (2)  and  (3),  we  can  realize  that 
orientation  errors  will  cause  large  lateral  position  errors  and  this 
errors  will  increase  proportionally  with  the  distance  Ask  traveled 
by  the  mobile  robot.  For  this  reason,  we  used  gyroscope  and 
compass  to  compensate  this  foremost  weakness  of  odometry. 

2.2  Flux-gate  Magnetic  Compass 

In  our  mobile  robot,  we  have  C-100  (from  KVH)  compass 
system  that  uses  a  state  of  the  art  microprocessor  controlled 
flux-gate  compass.  In  the  convenient  conditions,  especially  in 
outdoor  applications,  accuracy  of  the  1  degree  can  be  achieved. 

In  terms  of  its  influence  on  accumulated  odometry  errors,  the 
robot  heading  0k  is  the  most  important  between  the  localization 
parameters.  In  an  other  word,  the  correct  measure  of  absolute 
heading  is  very  important  to  navigate  the  mobile  robot  safely. 


Where  egyro  is  the  gyro  output  voltage,  edrift  the  drift  voltage,  and 
Cgyro  transformation  constant. 

3.  LOCALIZATION  PROCESS 
3.1  Robot  Model  and  Kalman  Filtering 

In  our  study,  we  assume  that  the  angular  velocity  is  constant 
during  one  sampling  period  which  is  T=0.1  sec.  in  our  tests.  In 
this  case,  any  acceleration  in  the  robot  will  be  taken  as  system 
noise.  The  state  space  variables  are  the  position  x,  y  and  the 
heading  0.  We  also  add  the  vehicle  translation  velocity  “v”  and 
angular  velocity  “w”  to  the  state  vector  Xk  as  follows: 


X 

xk  +  Ask.Cos(0k  +  A0k) 

y 

yk  +  Ask  .Sin(0k  +  A0k ) 

0 

= 

0k+A0k 

V 

V 

w 

k-t-1 

w 

Which  can  be  written  as: 

Xk+i  =  f  (  xk ,  uk )  (6b) 

For  each  sampling  period  (T=0.1  sec.)  the  measurements  xQdo  , 
y0do  are  obtained  from  the  wheels  encoder,  the  rotational  Wg^o 
from  the  gyroscope  and  the  robot  heading  0m  computed  based  on 
A0Compass  or  ABgyro  or  A0odometry  using  this  simple  if  statement : 
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if  (  A6c  <  AGthi  )  &  ( |ABg_c|  >  A0th2 )  then  0m  =  ©compass 
else 

if  (  A0g  <  AOthi  )  &  ( |A0g.o|  >  A0th2 )  then  0m  =  0gyrO 

else  0m  —  0odometry 


Where  A0C 

(A0compass)k  “  (0compass)k  “  (0m)k*l 

A0g 

A 

' — ' 

1 

J 

II 

g 

& 

<X> 

< 

II 

A0g.c 

0gyro  “  0compass 

A0g-o 

—  0gyro  ~  ©odometry 

AGthi 

=  1 0  /  T  (Experimental  threshold  level- 1 ) 

A0th2 

=  0. 145°  /  T  (Experimental  threshold  level-2) 

If  we  want  to  summarize  these  measurements,  the  observation 
equation  can  be  expressed  as  follows: 

1  0  0  0  Ojx" 

0  1  0  0  0  y 

0  0  1  0  0  0  (7a) 

0  0  0  1  0  v 

0  0  0  0  1  w  . 

jl  Jk 

yk  =  g(xk)  (7b) 

Our  state  space  model  have  some  inaccuracies  otk  and  pk  which 
are,  respectively,  system  and  observation  noises.  In  this  case, 
the  stochastic  system  representation  can  be  expressed  by  the 
following  equations: 

Xk+i=  f(  Xk ,  Uk )  +  ak  (8a) 

Yk  =g(xk)+Pk  (8b) 

We  assume  that  these  noises  are  non-correlated,  zero-mean 
Gaussian  white  noises  with  covariance  matrices  Qa  and  Rp  , 
respectively. 

3.2  Estimation  Phase  of  EKF 

The  mean  of  the  estimated  state  vector  and  related  covariance 
matrix  are  the  probability  distribution  of  the  robot  state 
estimation.  Given  state  space  model  of  the  system  in  equation 
(8a)  is  in  the  non-linear  form.  We  should  linearize  this  equation 
in  the  form  of  equation  (9)  in  order  to  express  the  uncertainty  of 
his  structure. 

XkH-i=<|>k.Xk+  Ok  (9a) 

yk  =  Xk  +  Pk  (9b) 

Now,  let  us  choose  some  nominal  value  of  x  and  call  it  xn0m,  and 
constitute  the  state  space  transition  matrix  (jv 


Expanding  xk+i  in  a  Taylor  series  about  x  gives  us; 

Xk+i  ~  f(  Xnom.k,  Uk)  +  (Xk  —  Xnom.k)  .  (11) 

Where  we  have  left  off  the  higher  order  terms.  This  equation  is 
linear  in  x  in  this  form  without  the  higher  order  terms. 
Therefore,  the  optimum  estimate  of  Xk+i  is  given  as: 

xk+l  “  ^(xnomJc^kj+^k^k  “  xnom.k  )  (12) 

Where  as  usual  X  is  the  best  estimate  of  xk  inherited  from  the 

previous  computing  cycle.  If  we  assume  that  xnom.k  =  Xk,  then  it 
will  give  us: 

Xk+l=f(*k,Uk)  (13) 

The  equation  for  the  covariance  matrix  of  the  error  in  this 
estimate  is  as  follows: 

Pk+l^kPk-'t'k +Qcx  (14) 

In  our  experiments,  we  saw  that  standard  deviation  for  the 
odometry  is  1  cm.  in  100  cm.  and  2  degree  in  100  degree. 

3.3  Updating  the  State  Estimation 


We  assume  that  sensor  measurements  are  obtained  from 
statistically  independent  sources.  Therefore,  the  covariance 
matrix  of  the  output  noises  has  a  diagonal  form.  Since  state 
values  obtained  from  our  odometry  system,  the  observation 
matrix  is  equal  to  unity  matrix.  Therefore,  yk  in  observation 
equation  is  exactly  equal  to  state  vector  Xk.  The  optimal  estimate 
with  our  assumption  Xnom  k  =  x^  is  given  as: 

Xk+l=Xk+Kk+i[yk+l-Xk]  (15) 

Where  Kk+i  is  the  gain  matrix  and  can  be  calculated  using  the 
equation  (15); 

Kk+1  =flk(flc  +Rp)_1  (16) 

Let  us  constitute  the  Pk+i  using  his  equation: 

Pk+l  =  Pk  -  (Pk  +  Rp  )Kk+1.KT+1  (17) 

In  our  observations,  standard  deviation  in  compass  is  0.02  and 
in  gyro  0.01.  All  variables  of  the  above  model  can  not  be 
observed  using  our  sensor  system  and  then  the  estimation 
uncertainty  of  these  variables  will  diverge  unless  direct  x,  y 
measurements  are  taken  into  consideration  to  reduce  these 
uncertainties. 

4.  TRAJECTORY  GENERATION 

In  the  navigation  part,  our  mobile  robot  can  navigate  with  a 
given  set  of  path  data  from  an  off-line  path  planner  which  uses 
GVD  diagrams  [15]  and  calculates  its  shortest  target  path, 
shown  in  Figure  2  and  Figure  3. 

In  our  system  there  is  an  active  matching  algorithm  in  order  to 
update  the  robot  environment  with  new  obstacles  or  to  delete 
some  known  obstacles  that  are  removed  anywhere.  If  any  change 
in  the  robot  environment  is  sensed,  the  off-line  path  planner 
produces  its  new  path  data  using  the  GVD  diagrams. 


Figure  2.  All  possible  paths  with  GVD. 


Figure  3.  The  calculated  shortest  path. 
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5.  EXPERIMENTAL  RESULTS 


We  implemented  and  tested  the  study  on  our  mobile  robot 
Nomad  200.  The  robot  has  been  primarily  based  for  use  as  an 
autonomous  robot  operating  in  a  structured  laboratory  or  office 
environment.  It  has  also  served  as  a  test  platform  for  studies  in 
real-time.  The  entire  autonomous  robot  is  self-contained  (all 
processing  being  performed  onboard).  It  has  a  maximum  travel 
speed  on  61  cm/sec.  Our  laboratory  area  was  surveyed  out  to  a 
10  cm.  grid  with  accuracy  better  than  about  1  cm.  To  extract  the 
mapping,  a  start  and  target  point  were  presented,  and  an 
absolute  localization  was  presented  with  incremental  odometry, 
gyro,  and  compass  sensors  using  the  EKF.  In  addition  to,  a  real¬ 
time  path  planning  was  generated  using  the  GVD  diagrams. 

In  our  localization  study,  we  placed  the  robot  at  the  initial 
position  Xo  =  1.8  m.,  y0  =  -1.8  m.,  and  0O  =  0°  in  the  laboratory 
workspace  which  is  limited  to  an  area  of  5. 5x3. 5  meters.  A 
special  robot  trajectory  was  planned  using  the  polar  form  for  the 
elliptical  equation; 

x  =  Xc  +  ri .  Cos0  ,  y  =  yc  +  r2 .  Sin0 
Where  Xc  ,  yc  is  the  ellipse  center,  ri  semimajor  axis,  and  r2 
semi-minor  axis  as  shown  in  Figure  4. 

The  length  (p)  of  this  trajectory  is  approximately  13.7  meters. 


Figure  4.  Generated  robot  trajectory. 

In  the  first  part  of  the  localization  test,  we  started  the  robot  from 
the  initial  position  (x®,  y®,  0o)T  =  (L8,  -1.8,  0)T  and  turn  it  in  the 
counterclockwise  direction  along  the  elliptical  rotation  in  a 
travel  speed  on  35  cm/sec.  as  shown  in  Figure  4.  In  the  second 
part,  this  scenario  was  realized  in  the  other  direction  and  at  the 
same  speed.  After  these  turns,  obtained  position  and  orientation 
errors  on  the  final  point  are  tabulated  in  table  1  and  table  2.  In 
the  tests,  all  sensors  were  used  to  estimate  robot  localization 
and  track  the  generated  elliptical  path.  However,  the  sensor 
outputs  were  stored  separately  to  find  out  the  best  estimate  of 
the  robot  localization  with  different  sensor  combination. 

The  error  (e)  in  estimating  the  position  is  calculated  as; 

%  =  llp-p=ll2  (9) 

That  is,  the  error  in  estimation  is  the  geometric  distance  (in 
meters)  between  the  real  position  and  the  estimated  position. 
The  expected  value  of  this  error  will  provide  us  the  standard 
deviation  of  the  estimated  position. 

The  experimental  results  obtained  with  EKF  algorithm  in 
relation  to  the  sensor  groups  for  this  elliptical  path  are 
summarized  in  Table  1  and  Table  2.  Here,  £p  is  the  deviation  of 
the  robot  location  in  the  x,  y  coordinates  (Euclidean  distance 
between  the  starting  and  final  points),  £e  deviation  in  the  robot 
heading  angle.  We  also  calculated  the  relative  distance  errors 
£p/p  in  order  to  express  the  localization  errors  with  respect  to 
the  sensor  groups.  Our  test  results  show  that  if  the  odometry 
system  is  supported  with  a  gyroscope  and  a  compass,  the 
localization  errors  can  be  reduced  to  minimum  level. 


Table  1.  Localization  errors  after  CCW  turn. 


Errors 

Sensors  | 

Odo. 

Odo.& 

Gyro 

Odo.& 

Comps 

All 

£P 

30.7 

10.8 

7.5 

2.8 

£e 

-12.7 

-7.6 

3.8 

0.7 

Ep/P 

2.25 

0.79 

0.55 

0.20 

Table  2.  Localization  errors  after  CW  turn. 


Errors 

Sensors  j 

Odo. 

Odo.& 

Gyro 

Odo.& 

Comps 

All 

Ep 

42.7 

12.3 

6.5 

2.1 

£e 

-15.1 

-8.2 

0.7 

0.4 

Ep/p 

3.12 

0.90 

0.47 

0.15 

In  our  laboratory  area,  to  realize  the  navigation  of  the  robot,  a 
start  and  target  point  were  presented  and  an  off-line  path 
planning  was  generated  using  the  GVD  diagrams  as  mentioned 
before.  The  shortest  target  path  was  easily  calculated  with  this 
method,  shown  in  Figure  5. 


Figure  5  The  calculated  shortest  path  using  GVD  in  the 
laboratory  environment. 


6.  CONCLUSIONS 

A  complete  data  fusion  and  processing  system  was  presented  for 
application  in  the  robotics  localizing  and  navigation  field.  Our 
experimental  results  demonstrated  that  excellent  positioning  of 
low-speed  mobile  robots  can  be  achieved  using  an  incremental 
odometry,  a  fiber-optic  gyro  and  a  compass  sensors  inside  any 
working  environments  with  significant  cluttered  and  noise. 
System  performance  comes  into  view  to  be  close  to  the  best 
achievable  with  the  current  sensor  and  system  models.  The 
obtained  result  is  used  to  navigate  mobile  robot  safely.  In  our 
study,  GVD  diagrams  are  used  to  effectively  determine  a 
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sequence  of  robot  motion  such  that  the  robot  can  arrive  at  its 

goal  position  in  the  shortest  time  while  avoiding  obstacles. 
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